Synthetic Data Generation: A Review, Carla Pereira, Rui Mendes, Pedro Rodrigues, 2023Applied Sciences, Vol. 13 (MDPI)DOI: 10.3390/app13042738 - This paper reviews various methods and applications of synthetic data generation, providing a broad overview of the field and its tool categories.
Practical MLOps: Operationalizing Machine Learning Models, Noah Gift, Alfredo Deza, 2021 (O'Reilly Media) - This book provides guidance on selecting and integrating machine learning tools and frameworks, covering aspects like open-source choices and practical evaluation criteria, aligning with the "Evaluating Potential Tools" section.
The Synthetic Data Vault: An Ecosystem for Privately Sharing High-Quality Synthetic Data, Neha Gupta, Kalyan Veeramachaneni, Paul Van Horn, Vivek F. Farias, 2020ACM SIGMOD International Conference on Management of Data (SIGMOD) (ACM (Association for Computing Machinery))DOI: 10.1145/3318464.3380720 - This paper introduces the Synthetic Data Vault (SDV), a prominent ecosystem for generating synthetic data, serving as an example of a sophisticated tool and its underlying research.