Database Management Systems, Raghu Ramakrishnan, Johannes Gehrke, 2003 (McGraw-Hill Education) - A standard textbook providing fundamental concepts of distributed databases and query processing, including join algorithms.
Performance Tuning Guide - Join Strategy Hints, The Apache Software Foundation, 2024 (The Apache Software Foundation) - Official documentation explaining join strategies in Apache Spark SQL, including broadcast and shuffle joins, and how to influence them.
Query Profile, Snowflake Inc., 2025 (Snowflake Inc.) - Snowflake's official documentation describing how to analyze query execution plans to understand join strategies and performance.
Skew-Tolerant Join Methods for MapReduce, Bugra Okcan, Vadim Riabov, 2011Proceedings of the VLDB Endowment, Vol. 4 (VLDB Endowment)DOI: 10.14778/340270.340277 - This paper discusses methods to handle data skew during distributed join operations in large-scale data processing systems.