LangSmith Documentation, LangChain, 2024 - Official documentation for LangSmith, detailing its features for tracing, debugging, monitoring, evaluation, and feedback collection for LLM applications.
LangChain Python Documentation, LangChain, 2024 - The primary documentation for the LangChain framework, providing context for building the LLM applications that LangSmith helps manage in production.
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena, Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li, Eric P. Xing, Hao Zhang, Joseph E. Gonzalez, Ion Stoica, 2023NeurIPS 2023 Datasets and Benchmarks TrackDOI: 10.48550/arXiv.2306.05685 - A research paper that explores the methodology and challenges of using large language models as judges for evaluating the quality of other LLM-generated text, relevant to LangSmith's evaluation framework.