MINDS Research Group

Paper accepted at EuroMLSys

Marco Chiesa

, 2025-02-02

Our preliminary work on scheduling LLM inferences on a GPU has been accepted at the EuroMLSys workshop. Paper.

Applied Networking Research Prize!

Marco Chiesa

, 2025-01-15

Our work on large language models and network management has received the IRTF/IETF Applied Networking Research Prize. Congratulations to Changjie, the recipient of the award!

Presentation at CoNEXT ’24: “FAJITA: Stateful Packet Processing at 100 Million pps”

Marco Chiesa

, 2024-12-11

At CoNEXT ’24, Mariano presented our FAJITA paper. This work shows that a commodity server running a chain of stateful network functions can process more than 170 M packets per second (equivalent of 1.4 Tbps if payloads are stored in a disaggregated fashion as in our earlier Ribosome work [NSDI ’23])! Something else that is interesting and perhaps unexpected is that, unless the number of so-called “elephant flows” is very small, spreading incoming packets among the cores using plain Receive Side Scaling (RSS) outperforms existing approaches that perform fine-grained flow accounting and load-balancing. This happens because possible gains get dwarfed by slowdowns in accessing memory.

This is joint work with Hamid Ghasemirahni, Alireza Farshin (now at Nvidia), Mariano Scazzariello (now at RISE), Gerald Q. Maguire Jr., Dejan Kostić, and Marco Chiesa.

Our recording of Mariano’s talk is below:

Data centers increasingly utilize commodity servers to deploy low-latency Network Functions (NFs). However, the emergence of multi-hundred-gigabit-per-second network interface cards (NICs) has drastically increased the performance expected from commodity servers. Additionally, recently introduced systems that store packet payloads in temporary off-CPU locations (e.g., programmable switches, NICs, and RDMA servers) further increase the load on NF servers, making packet processing even more challenging.

This paper demonstrates existing bottlenecks and challenges of state-of-the-art stateful packet processing frameworks and proposes a system, called FAJITA, to tackle these challenges & accelerate stateful packet processing on commodity hardware. FAJITA proposes an optimized processing pipeline for stateful network functions to minimize memory accesses and overcome the overheads of accessing shared data structures while ensuring efficient batch processing at every stage of the pipeline. Furthermore, FAJITA provides a performant architecture to deploy high-performance network functions service chains containing stateful elements with different state granularities. FAJITA improves the throughput and latency of high-speed stateful network functions by ~2.43x compared to the most performant state-of-the-art solutions, enabling commodity hardware to process up to ~178 Million 64-B packets per second (pps) using 16 cores.

NetConfEval accepted at CoNEXT 2024

Marco Chiesa

, 2024-05-08

Can Large Language Models facilitate network configuration? In our recently accepted CoNEXT 2024 paper, we investigate the opportunities and challenges in operating network systems using recent LLM models.

We devise a benchmark for evaluating the capabilities of different LLM models on a variety of networking tasks and show different ways of integrating such models within existing systems. Our results show that different models works better in different tasks. Translating high-level human-language requirements into formal specifications (e.g., API function calling) can be done with small models. However, generating code that controls network systems is only doable with larger LLMs, such as GPT4.

This is a first fundamental first step in our SEMLA project looking at ways to integrate LLMs into system development.

GitHub code: link

Hugging Face: link

Paper PDF: link

Best paper at ACM CoNEXT 2023

Marco Chiesa

, 2023-12-07

We are hugely honored that our “Millions of Low-Latency Insertions on ASIC switches” paper received the Best Paper Award at ACM CoNEXT 2023! More details are available in our earlier post.

From left to right: Tommaso Caiazzi, Mariano Scazzariello, Marco Chiesa, Olivier Bonaventure (TPC co-chair)

Studies

Research

Collaboration

About KTH

Library

Paper accepted at EuroMLSys

Applied Networking Research Prize!

Presentation at CoNEXT ’24: “FAJITA: Stateful Packet Processing at 100 Million pps”

NetConfEval accepted at CoNEXT 2024

Best paper at ACM CoNEXT 2023