Big Data: Principles and Best Practices of Scalable Realtime Data Systems, Nathan Marz and James Warren, 2015 (Manning Publications) - The definitive resource for understanding the Lambda Architecture, covering its motivations, layered components, and patterns for fault-tolerant data processing.
Questioning the Lambda Architecture, Jay Kreps, 2014Confluent Blog (Confluent) - Introduces the Kappa architecture as an alternative to Lambda, advocating for a log-centric approach where all data processing is unified as stream processing.
Apache Flink Documentation, The Apache Flink Community, 2025 - Provides comprehensive information on Flink's capabilities, including its unified batch and stream processing model, exactly-once semantics, and state management, which enable the Kappa architecture.
Apache Kafka Documentation, The Apache Kafka Community, 2024 - Details Kafka's role as a distributed commit log, its guarantees, and features that position it as the immutable source of truth for modern data architectures like Kappa.
Stream Processing with Apache Flink: Fundamentals, StreamSQL, and Table API, Fabian Hueske and Vasia Kalavri, 2019 (O'Reilly Media) - A practical guide to building real-time data pipelines with Apache Flink, covering processing semantics, state management, and the architectural implications for stream-first designs.