A Kernel Two-Sample Test, Arthur Gretton, Karsten M. Borgwardt, Malte J. Rasch, Bernhard Schölkopf, Alexander Smola, 2012Journal of Machine Learning Research, Vol. 13 (Journal of Machine Learning Research) - Foundational paper introducing Maximum Mean Discrepancy (MMD) as a kernel-based two-sample test for comparing distributions.
What Makes a Good Synthetic Dataset? A Review of the Metrics Used for Evaluating Synthetic Tabular Data, Sudeendra Shrikumar, Gulten Arslan, Rahul Krishnan, 2022Transactions on Machine Learning Research (ML open-source software community) - A comprehensive review of metrics and considerations for evaluating the quality of synthetic tabular data, including MMD, energy distance, and classifier-based approaches.