Chapter Eight

Unsupervised Learning

Learning Objectives
  1. Formulate the unsupervised learning problem as density estimation, clustering, dimensionality reduction, or generative modelling, and identify which family a given task belongs to
  2. Derive Lloyd's algorithm for k-means as coordinate descent on the within-cluster sum of squares, prove its monotone convergence, and implement k-means++ initialisation
  3. Derive the EM algorithm for Gaussian mixture models, recognise it as coordinate ascent on the evidence lower bound, and implement it from scratch
  4. Choose between agglomerative linkage criteria (single, complete, average, Ward) on the basis of cluster shape and noise sensitivity
  5. Apply DBSCAN, HDBSCAN, and spectral clustering, choosing parameters from k-distance plots and the eigengap heuristic
  6. Derive PCA from the maximum-variance, minimum-reconstruction-error, and SVD viewpoints, and implement probabilistic PCA
  7. Use kernel PCA, t-SNE, UMAP, autoencoders, Isomap, locally linear embedding, and Laplacian eigenmaps for non-linear dimensionality reduction
  8. Derive Latent Dirichlet Allocation and the collapsed Gibbs sampler, and apply it to a document corpus
  9. Evaluate unsupervised methods using internal indices (silhouette, Davies-Bouldin, Calinski-Harabasz), external indices (mutual information, ARI), and downstream-task performance
  10. Situate unsupervised learning relative to modern self-supervised learning: pretext tasks, contrastive losses, masked modelling

In this chapter

This site is currently in Beta. Please get in touch via chrispaton.org with any suggestions, questions or comments.