Release It! Design and Deploy Production-Ready Software, Michael T. Nygard, 2018 (The Pragmatic Bookshelf) - A foundational book on designing and deploying resilient software systems, covering patterns like retries, timeouts, and graceful degradation for handling failures in distributed environments.
Retrying on errors with exponential backoff, Google Cloud Documentation, 2024 (Google Cloud) - Official Google Cloud documentation providing best practices for implementing retry mechanisms with exponential backoff and jitter for transient errors in distributed systems.