14: Unsupervised Learning, etc.

Objectives

Skim the Wikipedia article on cluster analysis
- What is a “clustering”?
- What is the difference between “hard clustering” and “soft clustering”?
- Evaluation: What is the difference between “internal” and “external” evaluation?
- Applications: what are some applications of clustering in two diferent fields of interest to you?
k-means clustering
- Wikipedia article
- Scikit-learn documentation
- Scikit-learn example
- Can you explain what’s wrong in each subplot of the “Unexpected KMeans clusters” figure?

Specific methods:

Going further: