TFX: A TensorFlow-Based Production-Scale Machine Learning Platform, Denis Baylor, Jian Li, Andrei Lopatenko, Aurélien Plab, Mikhail Berezovskiy, Daniel Golovin, Robby Neiger, Andrew M. Miller, Stephen Kidd, Michael R. Jones, Alex Sergeev, Max G. E. Bauman, Ted H. Lee, Alexey Smirnov, David S. Berg, Todd Phillips, David F. Blank, Christine Cheng, Josh Smith, D. Sculley, Michael A. Walker, Clemens Mewald, 2017Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (ACM)DOI: 10.1145/3097983.3098021 - Introduces TensorFlow Extended (TFX), a comprehensive ML platform where feature transformation pipelines are a central component for managing data preprocessing and ensuring consistency between training and serving.
Feature Transformations in Feast, Feast Project, 2025 (Feast Project) - Official documentation for Feast, an open-source feature store, detailing how to define and apply feature transformations as part of feature views, demonstrating practical pipeline implementation within a feature store.
Hidden Technical Debt in Machine Learning Systems, D. Sculley, Gary Holt, Daniel Golovin, Eugene Davydov, Todd Phillips, Dietmar Ebner, Vinay Chaudhary, Michael Young, Jean-François Crespo, Dan Dennison, 2015Advances in Neural Information Processing Systems 28, Vol. 28 (NeurIPS) - A foundational paper discussing common pitfalls and sources of technical debt in real-world ML systems, including the challenge of ensuring consistent data transformations across training and serving environments.