Operationalizing Generative AI, Google Cloud, 2023Google Cloud Blog (Google Cloud) - Provides a comprehensive overview of the lifecycle and operational considerations for deploying and managing generative AI applications, including specific challenges related to model evolution, data management, evaluation, and cost optimization.
Retrieval-Augmented Generation for Large Language Models: A Survey, Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang, 2023arXiv preprint arXiv:2312.10997DOI: 10.48550/arXiv.2312.10997 - This survey provides a comprehensive overview of Retrieval-Augmented Generation (RAG) and includes a section on the operational challenge of managing and updating the external knowledge base to ensure data freshness for RAG systems.
OWASP Top 10 for Large Language Model Applications, OWASP Foundation, 2023OWASP Project (OWASP Foundation) - Provides a list of the top 10 most critical security risks specifically for Large Language Model applications, offering guidance for identifying, assessing, and mitigating vulnerabilities as part of ongoing security posture maintenance.