Research

My broad area of research is distributed systems and networking. I seek to build systems that both advance the state-of-the-art as well as the state of practice.

I am currently CEO & Co-founder at Feldera, where we are building a powerful incremental compute platform for AI, ML and data teams. It is powered by our award-winning research that allows us to incrementally execute arbitrarily complex SQL programs.

Prior to Feldera, I was a senior researcher at VMware Research (2016-2023), where I led research efforts to improve the scalability, reliability and extensibility of large-scale cluster managers. Anvil is a framework to build formally verified Kubernetes controllers and Sieve automatically tests Kubernetes controllers for reliability issues. DCM makes it easy to build scalable and flexible cluster managers using declarative programming.

As a PhD student (2012-2016), I invented techniques to deliver predictable performance for certain classes of distributed systems. I’m grateful that some of that work has been impactful. C3 ships with ElasticSearch and OpenSearch as the Adaptive Replica Selection feature, and influenced the design of Spotify’s ELS.

I’m a fervent champion of open-source software. My time with the ns-3 network simulator project was a formative part of my career, where I was an active contributor and maintainer between 2009 and 2016. My largest contribution to the project was ns-3’s integration with Click. I was excited to learn that ns-3 and its predecessors were awarded the 2020 ACM SIGCOMM Networking Systems Award.

Selected projects

Most of my projects are open-source and available on my Github page.

Anvil: Verifying liveness for cluster management controllers.
[Best paper award at OSDI ‘24] [code]
Sieve: Automatically testing Kubernetes controllers for distributed systems-ey bugs.
[OSDI ‘22] [HotOS ‘21] [KubeCon NA ‘21 talk] [code]
Declarative Cluster Managers (DCM): Combines incremental view maintenance, SQL and constraint programming to build scalable, flexible and powerful cluster managers (we built a high-performance Kubernetes Scheduler with it, among other things).
[VLDB ‘23] [OSDI ‘20]] [HotOS ‘19] [code]
Elmo: Scalable and flexible multicast at line-rate using source-routing. Check out Mellanox’s implementation of Elmo on their Spectrum-2 ASIC.
[SIGCOMM ‘19] [P4 Summit]
Rapid: widely used cluster membership protocols go haywire in the presence of complex failure scenarios (e.g. high packet loss). Rapid instead guarantees stable and strongly consistent membership at scale. Check out its use to scale Akka Cluster to 10K nodes.
[ATC ‘18] [code] [blog] [Community efforts: go-rapid, swift-rapid]
Wisp: decentralized, end-to-end rate limiting and request scheduling for micro-services.
[SoCC ‘17]
C3: a replica selection algorithm for distributed data stores that is robust to performance variability among replicas. It currently ships with ElasticSearch and has influenced the design of Spotify’s Expected Latency Selector.
[NSDI ‘15] [code]
Odin: a software-defined WiFi network, centered around a programmable virtual access point primitive. The project has seen many forks by researchers (a notable effort being the Wi-5 project).
[ATC ‘14] [HotSDN ‘12] [code]

Selected Publications

A full list of my publications can be seen on my Google Scholar page

Anvil: Verifying Liveness of Cluster Management Controllers
Xudong Sun, Wenjie Ma, Jiawei Tyler Gu, Zicheng Ma, Tej Chajed, Jon Howell, Andrea Lattuada, Oded Padon, Lalith Suresh, Adriana Szekeres, Tianyin Xu
OSDI 2024. Awarded Best Paper.
R3: Record-Replay-Retroaction for Database-Backed Applications
Qian Li, Peter Kraft, Michael Cafarella, Çağatay Demiralp, Goetz Graefe, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Xiangyao Yu, Matei Zaharia
VLDB 2023.
Scaling a Declarative Cluster Manager Architecture with Query Optimization Techniques
Kexin Rong, Mihai Budiu, Athinagoras Skiadopoulos, Lalith Suresh, Amy Tai
VLDB 2023.
Automatic Reliability Testing For Cluster Management Controllers
Xudong Sun, Wenqing Luo, Tyler Gu, Aishwarya Ganesan, Ramnatthan Alagappan, Michael Gasch, Lalith Suresh, Tianyin Xu
OSDI 2022.
NSight: How to diagnose nanosecond network delays in rich end-host stacks
Roni Haecki, Radhika Niranjan Mysore, Lalith Suresh, Gerd Zellweger, Bo Gan, Timothy Merrifield, Sujata Banerjee, Mothy Roscoe
NSDI 2022.
DBOS: A DBMS-Oriented Operating System
Athinagoras Skiadopoulos, Qian Li, Peter Kraft, Kostis Kaffes, Daniel Hong, Shana Matthew, David Bestor, Michael Cafarella, Vijay Gadepally, Goetz Graefe, Jeremy Kepner, Christos Kozyrakis, Tim Kraska, Michael Stonebraker, Lalith Suresh, Matei Zaharia.
VLDB 2022 (Vision paper). (To appear)
Reasoning about modern datacenter infrastructures using partial histories
Xudong Sun, Lalith Suresh, Aishwarya Ganesan, Ramnatthan Alagappan, Michael Gasch, Lilia Tang, and Tianyin Xu.
HotOS, 2021.
Building Scalable and Flexible Cluster Managers using Declarative Programming
Lalith Suresh, Joao Loff, Faria Kalim, Sangeetha Abdu Jyothi, Nina Narodytska, Leonid Ryzhyk, Sahan Gamage, Brian Oki, Pranshu Jain, Michael Gasch.
OSDI, 2020.
Elmo: Source Routed Multicast for Public Clouds
Muhammad Shahbaz, Lalith Suresh, Nick Feamster, Jen Rexford, Ori Rottenstreich, Mukesh Hira.
IEEE/ACM Transactions on Networking, 2020.
Hillview: A trillion-cell spreadsheet for big data
Mihai Budiu, Parikshit Gopalan, Lalith Suresh, Udi Wieder, Han Kruiger, Marcos K. Aguilera.
VLDB, 2019.
Elmo: Source Routed Multicast for Public Clouds
Muhammad Shahbaz, Lalith Suresh, Nick Feamster, Jen Rexford, Ori Rottenstreich, Mukesh Hira.
ACM SIGCOMM, 2019.
Synthesizing Cluster Management Code for Distributed Systems
Lalith Suresh, João Loff, Nina Narodytska, Leonid Ryzhyk, Mooly Sagiv, and Brian Oki.
HotOS, 2019.
Stable and Consistent Membership at Scale with Rapid
Lalith Suresh, Dahlia Malkhi, Parikshit Gopalan, Ivan Porto Carreiro, Zeeshan Lokhandwala.
USENIX ATC, 2018.
Remote regions: a simple abstraction for remote memory
Marcos K. Aguilera, Nadav Amit, Irina Calciu, Xavier Deguillard, Jayneel Gandhi, Stanko Novakovic, Arun Ramanathan, Pratap Subrahmanyam, Lalith Suresh, Kiran Tati, Rajesh Venkatasubramanian, Michael Wei.
USENIX ATC, 2018.
Kraken: Online and Elastic Resource Reservations for Cloud Datacenters
Carlo Fuerst, Stefan Schmid, Lalith Suresh, Paolo Costa.
IEEE/ACM Transactions on Networking, 2018.
Remote memory in the age of fast networks
Marcos K. Aguilera, Nadav Amit, Irina Calciu, Xavier Deguillard, Jayneel Gandhi, Pratap Subrahmanyam, Lalith Suresh, Kiran Tati, Rajesh Venkatasubramanian, Michael Wei.
SoCC 2017. (Vision paper)
Distributed Resource Management across Process Boundaries
Lalith Suresh, Peter Bodik, Ishai Menache, Marco Canini, Florin Ciucu.
SoCC 2017.
Rein: Taming Tail Latency in Key-Value Stores via Multiget Scheduling
Waleed Reda, Marco Canini, Lalith Suresh, Dejan Kostic, Sean Braithwaite.
EuroSys 2017.
Kraken: Online and Elastic Resource Reservations for Cloud Datacenters
Carlo Fuerst, Stefan Schmid, Lalith Suresh, Paolo Costa.
INFOCOM, 2017.
C3: Cutting Tail Latency in Cloud Data Stores via Adaptive Replica Selection
Lalith Suresh, Marco Canini, Stefan Schmid, Anja Feldmann
USENIX NSDI 2015.
Programmatic Orchestration of WiFi Networks
Julius Schulz-Zander, Lalith Suresh, Nadi Sarrar, Anja Feldmann, Thomas Hühn, Ruben Merz
USENIX ATC 2014.
Towards Programmable Enterprise WLANs with Odin
Lalith Suresh, Julius Schulz-Zander, Ruben Merz, Anja Feldmann, Teresa Vazao
HotSDN 2012.

Students

I’ve had the privilege of working with some fantastic PhD interns while at VMware Research:

Athinagoras Skiadopoulos (Stanford)
Xudong Sun (UIUC)
Faria Kalim (UIUC)
João Loff (IST - Lisbon)
Muhammad Shahbaz (Princeton University)
Michael Tong (University of Chicago)

Recent professional service

Program committee:

SIGCOMM 22
OSDI 21
ATC 20
NSDI 20
SOCC 19
ATC 18
ICDCS 18
HotCloud 17

Artifact Evaluation Committee co-chair:

OSDI 2021