Apache Iceberg Documentation, The Apache Software Foundation, 2024 (The Apache Software Foundation) - Explains Apache Iceberg table format concepts, including manifests and data skipping mechanisms.
Apache Parquet Format Specification, The Apache Software Foundation, 2024 - Details the structure of Parquet files, including metadata organization and column statistics.
Delta Lake Documentation, The Linux Foundation, 2024 - Covers Delta Lake features for performance, such as data skipping and Z-ordering.
Apache Iceberg: A Table Format for Analytic Datasets, Ryan Blue, Daniel Weeks, Jeremy Klumpp, Russell Spitzer, Fady Essam, Anton Okolnychyi, 2020Proceedings of the VLDB Endowment, Vol. 13 (VLDB Endowment)DOI: 10.14778/3407086.3407153 - Presents the architecture and features of Apache Iceberg, focusing on its metadata management with manifest lists and files for efficient query planning and data skipping.