Michelangelo: Uber's Machine Learning Platform, Siby, Praveen and Goel, Anirudh and Jain, Manu and Jain, Amit and Singh, Amit and Sharma, Hien and Kumar, Karan and Panwar, Vaibhav and Srivatsa, Sathish and Shvachko, Alex and Grewal, Navjot and Arora, Sahitya and Kumar, Manish and Singh, Kamesh and Singh, Gaurav and Pudi, Venkata and Nallamothu, Venkata and Jain, Anupam and Gupta, Sanjai, 2017arXiv preprint arXiv:1710.05370 - Describes the architecture of a large-scale machine learning platform, including the necessity and design of a feature store to address training-serving skew and ensure feature consistency.
Asynchronous I/O for External Data Access, Apache Flink Documentation, 2024 - Official documentation explaining Flink's Asynchronous I/O API, which is crucial for high-throughput interaction with external online feature stores.
Designing Machine Learning Systems, Huyen, Chip, 2022 (O'Reilly Media) - Provides a comprehensive explanation of feature stores, covering their architecture, role in preventing training-serving skew, and requirements like point-in-time correctness for ML systems.