{Seminar} @ CDS: #102, February 28th, 02:00: “Systems + AI Research at Microsoft”

When

28 Feb 25    
2:00 PM - 3:00 PM

Event Type


CDS Systems + ML Seminar Series


Speaker : Anjaly Parayil, Mayukh Das, Ayush Choure, Alind Khare

Affiliation : Microsoft M365 Research

Title : Systems + AI Research at Microsoft

Date & Time : February 28, 2025 (Friday), 02:00 – 03:00 PM

Venue : #102, CDS Seminar Hall


ABSTRACT

At Microsoft, we operate one of the largest productivity clouds and we need to keep pace with paradigm shifts such as the massive growth in AI workloads, sustainability push, the need for self-managing cloud environments and the complex challenges that arise out of its sheer scale. To solve these challenges, at M365 Research group in Microsoft, we have built a cross-domain research team focusing on applied research on “ML for Systems” to bring a step function improvement in Cloud Efficiency and Reliability. In this talk, we will discuss our collaboration with top research institutions to drive innovation and leverage the immense scientific knowledge and expertise to bring new ideas into practice. In this introductory session, we’ll briefly explore several challenges encountered in large-scale production environments, for example, scheduling and routing strategies for large language models (LLMs), intelligent monitoring to achieve near-perfect reliability, and cloud capacity allocation issues, etc.


SPEAKER’S BIO

Anjaly Parayil is a Senior Researcher at M365 Research leading applied research at the intersection of efficiency and reliability of cloud services. In particular, she works at the intersection of machine learning and systems to ensure continuous availability of cloud services as well as for the efficiency of Cloud infrastructure running various workloads, including the newly emerged Large Language Model workloads. Previously, she served as a Postdoctoral Researcher at the US Army Research Laboratory’s Computational and Information Sciences Directorate, specializing in reinforcement learning and Bayesian inferencing. Anjaly earned her doctorate from the Indian Institute of Science’s Department of Aerospace Engineering, with a thesis on uncertain systems and multi-agent control that received the Prof. A. K. Rao Medal for Best Ph.D. Thesis. Her work has resulted in over 25 publications and multiple patent filings.

Mayukh Das is a Senior Researcher at Microsoft driving applied AI research for Cloud Efficiency. In particular, he works on varied decision-making problems for configuration tuning for performance optimization of cloud services, for capacity provisioning, for power and energy optimization, and, operational efficiency of ML workloads. He completed his PhD from UT Dallas and his thesis work was focused on Reinforcement Learning and Probabilistic Modeling in Noisy domains. Prior to Microsoft he was at Samsung Research solving Edge-AI problems. He serves on the program committee of various conferences including AAAI, ICML, NeurIPS, SDM etc. and has served as a track chair at CODS-COMAD ‘24. Mayukh has authored 25+ publications in AI/ML and holds 7+ patents.

Ayush Choure is a Principal Researcher at Microsoft, working on reliability and efficiency problems at M365 Research. He has a PhD in Theoretical Computer Science from IIT Bombay, specializing in geometry and probability theory.

Alind Khare is a Senior Researcher at Microsoft, leading applied research on building efficient serving infrastructure for large language models (LLMs) at production scale. His work focuses on optimizing the serving of multimodal and reasoning-based LLMs. He earned his PhD from Georgia Tech, where he explored innovations across machine learning and systems (SysML)—developing hardware-aware neural architecture search techniques and building real-time ML serving systems with deadline guarantees. His research has been published in top conferences, including NSDI, ECCV, NeurIPS, ICLR, KDD, and MLSys.


Host Faculty: Prof. Yogesh Simmhan


ALL ARE WELCOME