CUDA C++ Programming Guide, NVIDIA, 2023 (NVIDIA) - Official guide detailing GPU memory hierarchy, coalesced access, and memory optimization techniques crucial for high-performance GPU computing.
Computer Architecture: A Quantitative Approach, John L. Hennessy, David A. Patterson, 2017 (Morgan Kaufmann) - Foundational textbook covering CPU memory hierarchy, cache behavior, and principles of efficient data access in computer systems.
XLA: Optimizing Compilers for Machine Learning, Google, 2023 (Google) - Official documentation explaining XLA's role as an optimizing compiler, including its capabilities in memory layout assignment and hardware-specific optimizations for JAX.
Array memory layout, NumPy Developers, 2023 - Official NumPy documentation explaining the memory layout of ndarray objects, including C-contiguous (row-major) and Fortran-contiguous (column-major) orders.