Python toolkit for building production-ready LLM applications. Modular utilities for prompts, RAG, agents, structured outputs, and multi-provider support.
Controlled experiments on the web: survey and practical guide, Ron Kohavi, Roger Longbotham, Dan Sommerfield, Randal M. Henne, 2009Data Mining and Knowledge Discovery, Vol. 18 (Springer)DOI: 10.1007/s10618-008-0114-1 - This foundational survey paper offers a practical guide to designing, running, and analyzing controlled experiments on the web, providing a comprehensive background for A/B testing applications in search.
Online Experimentation in Large-Scale Production Systems, Dmitriy Fabijan, Pavel Dmitriev, Peter Shirobokov, 2019IEEE Software, Vol. 36 (IEEE)DOI: 10.1109/MS.2019.2929910 - This paper provides an up-to-date overview of the engineering challenges and best practices for implementing and managing online experimentation platforms in large-scale production environments, relevant to building A/B testing infrastructure for search.