Cluster Autoscaler, Kubernetes Authors, 2024 (The Linux Foundation) - Official documentation explaining the purpose, architecture, and configuration of the Kubernetes Cluster Autoscaler.
Taints and Tolerations, Kubernetes Authors, Current - Official guide to using taints and tolerations for node isolation and selective pod scheduling.
Resource Management for Pods and Containers, Kubernetes Authors, Current (The Kubernetes Project) - Describes how to specify resource requests and limits for pods, essential for GPU allocation.
NVIDIA GPU Operator, NVIDIA, 2024 (NVIDIA) - Official documentation for deploying and managing NVIDIA GPUs in Kubernetes using the GPU Operator.
Running GPUs on GKE, Google Cloud, Current (Google Cloud) - Guide for configuring and utilizing GPU-enabled nodes and workloads on Google Kubernetes Engine.