Apache Iceberg Documentation, The Apache Software Foundation, 2024 (The Apache Software Foundation) - Provides detailed information on Iceberg's architecture, including its metadata layer and features like hidden partitioning.
Delta Lake Documentation, The Linux Foundation, 2024 - Describes Delta Lake's design, transactional guarantees, and capabilities such as time travel and schema evolution.
Apache Hudi Documentation, The Apache Software Foundation, 2024 (The Apache Software Foundation) - Offers comprehensive explanations of Hudi's approach to mutable datasets, streaming ingestion, and storage types (COW, MOR).
Delta Lake: High-Performance ACID Table Storage for Big Data, Denny Lee, Tathagata Das, Michael Armbrust, Burak Yavuz, Xiangrui Meng, Shixiong Zhu, Joseph Bradley, Brooke Wenig, Ali Ghodsi, and Matei Zaharia, 2020Proceedings of the VLDB Endowment, Vol. 13 (VLDB Endowment)DOI: 10.14778/3407612.3407647 - An academic paper describing the design and performance characteristics of Delta Lake, illustrating how transactional capabilities are added to data lakes.