Meta’s AI Research SuperCluster: A new AI supercomputer to advance AI, Kevin Lee, Shubho Sengupta, 2022Meta AI Blog (Meta) - Describes the architecture of a large-scale, on-premise supercomputer built for AI, detailing high-performance interconnects like InfiniBand and distributed training setup.
Cloud TPU Architecture, Google Cloud, 2024 (Google LLC) - Provides technical specifications and architectural insights into Google's Tensor Processing Units (TPUs), specifically designed for large-scale machine learning workloads in the cloud.