The Google File System, Sanjay Ghemawat, Howard Gobioff, Shun-Tak Leung, 2003Proceedings of the nineteenth ACM symposium on Operating systems principlesDOI: 10.1145/945445.945451 - A foundational paper introducing the principles of distributed file systems, including design choices for scalability, fault tolerance, and large sequential data access, which influenced HDFS.
Apache Hadoop HDFS, The Apache Software Foundation, 2025 - Official documentation describing the architecture, design, and operational aspects of the Hadoop Distributed File System.
Amazon Simple Storage Service (S3) Documentation, Amazon Web Services (AWS), 2024 - Official documentation providing comprehensive details on Amazon S3, its features, API, data models, and best practices for cloud object storage.
fsspec: Filesystem interfaces for Python, Martin Durant and other contributors, 2025 - The official documentation for fsspec, a Python library that provides a unified interface for accessing various file systems, including local, HDFS, and cloud object storage like S3.