Caching LLM calls, OpenAI, 2023OpenAI Cookbook (OpenAI) - Offers practical examples and guidance on implementing caching specifically for Large Language Model API calls, covering key generation and workflow.
High Performance Python: Practical Performant Programming for Humans, Micha Gorelick, Ian Ozsvald, 2020 (O'Reilly Media) - Includes discussions on various optimization techniques, including memoization and caching, relevant for improving the efficiency and speed of Python applications interacting with external services.