Research
In my research, I identify and solve fundamental problems that limit the performance and availability of large-scale cloud networks. To do so, I develop techniques grounded in a broad set of domains, including graph theory, optimization, and formal methods, while closely tying them to real-world systems and their practical constraints.
So far, my approach has led to the development of:
-
Scalable algorithms and optimization methods with formal guarantees, which have resulted in both conceptual advances and tangible impact in areas such as resource allocation and clock synchronization.
-
Systems and methods that identify, explain, and effectively address performance gaps in both handcrafted and learning-enabled heuristics.
Currently, I am extending this agenda by redesigning large-scale training and inference pipelines to power the next generation of AI models.