Senior Program Manager
Program Manager II
Bachelor in Computer Science
Presents the system architecture of Deep Learning Inference Service, Bing's platform for deep neural network model inference. This paper introduces core concepts such as intelligent model placement, heterogeneous resource management, resource isolation, and efficient routing. Highlighted as one of Topbot's top 2019 AI advances in machine learning infrastructure.