Use volumes, Docker Documentation, 2024 (Docker) - Official guide to Docker volumes, explaining their purpose, usage, and how they provide persistent storage for containers, which is essential for external model loading.
Best practices for writing Dockerfiles, Docker Documentation, 2024 - Provides guidelines for optimizing Docker images, including minimizing size and build times, which is relevant to the increased image size disadvantage of embedding models.
Introducing MLOps: How to go from Model to Production, Mark Treveil, Nicolas Omont, Aurélien Géron, Clemens Mewald, Michel Tessier, Alexey Grigorev, 2022 (O'Reilly Media) - A comprehensive book on MLOps practices that includes discussions on model serving, deployment strategies, and managing model artifacts in production environments.
MLOps: A guide to operations for machine learning, Google Cloud Documentation, 2024 (Google Cloud) - Google Cloud's official MLOps guide, covering principles and practices for deploying and managing ML systems, including considerations for packaging and serving models with containerization.