Highlights
IEEE MetroInd 2026: Characterizing GPU Capacity and Computational Cost in Production AI Inference
My paper on a replica-centric capacity-planning framework for GPU and Multi-Instance GPU (MIG) based AI inference platforms was accepted at IEEE MetroInd 4.0 & IoT 2026 in Rome, Italy.