Geometry of Data


Modern data are high-dimensional, for example, images with millions of pixels, text corpora with millions of words, gene sequences with billions of base pairs, etc. However, these data tend to concentrate on lower-dimensional, nonlinear subspaces known as manifolds. This class covers the mathematical theory of high-dimensional geometry and manifolds and the application of this geometry to machine learning and data analysis.


Additional Reading

Manfredo do Carmo, Riemannian Geometry

Sigmundur Gudmundsson, Introduction to Riemannian Geometry

Example Jupyter Notebooks

For those of you who are relatively new to Jupyter, here are a few notebooks that you might find useful (from my undergraduate course Foundations of Data Analysis.)