Pattern Recognition and Machine Learning, Christopher M. Bishop, 2006 (Springer)DOI: 10.1007/b139590 - This textbook covers PCA, its principles, and limitations, offering a solid foundation in dimensionality reduction and related machine learning concepts.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - This book discusses PCA in the context of representation learning, contrasting it with non-linear autoencoders and highlighting its limitations for complex data.
A Global Geometric Framework for Nonlinear Dimensionality Reduction, Joshua B. Tenenbaum, Vin de Silva, and John C. Langford, 2000Science, Vol. 290 (American Association for the Advancement of Science (AAAS))DOI: 10.1126/science.290.5500.2319 - This paper introduces Isomap, a manifold learning technique that demonstrates the inability of linear methods like PCA to capture the structure of curved manifolds, such as the "Swiss roll".