Pandas User Guide: Memory Usage and Optimized Data Storage, The Pandas Development Team, 2024 - Official Pandas documentation sections detailing memory usage, categorical data types, and efficient I/O operations like chunking, essential for optimizing data structures and loading in machine learning.
High Performance Python: Practical Performant Programming for Data Science, Micha Gorelick, Ian Ozsvald, 2020 (O'Reilly Media) - A widely recognized book offering practical strategies for optimizing Python code, including detailed discussions on memory management, data structures, and profiling techniques relevant to data science and machine learning applications.
NumPy documentation: The N-dimensional array (ndarray), The NumPy Developers, 2024 - Official NumPy documentation explaining the ndarray object, its memory layout (nbytes), and how data types (astype) influence memory consumption, which is fundamental for efficient numerical computing in ML.