DVC Documentation, Iterative.ai, 2024 - Official documentation for Data Version Control (DVC), a tool for versioning datasets and machine learning models alongside code.
Git Large File Storage (LFS), GitHub, Inc., 2024 - Official documentation for Git LFS, an extension that handles large files efficiently by replacing them with text pointers inside Git.
Data Management for Machine Learning: A Survey, Wang, Jie, Kraska, Tim, Wu, Eugene, 2020The VLDB Journal, Vol. 29 (Springer Berlin Heidelberg)DOI: 10.1007/s00778-019-00569-8 - A survey article that reviews various aspects of data management in machine learning, including discussions on data versioning techniques and their challenges.