{"id":27,"date":"2024-12-18T12:02:50","date_gmt":"2024-12-18T11:02:50","guid":{"rendered":"https:\/\/www.kth.se\/blogs\/dai\/?page_id=27"},"modified":"2026-03-30T13:19:00","modified_gmt":"2026-03-30T11:19:00","slug":"publications","status":"publish","type":"page","link":"https:\/\/www.kth.se\/blogs\/dai\/publications\/","title":{"rendered":"Publications"},"content":{"rendered":"<div class=\"post-content-wrapper\"><ol>\n<li>&#8220;<a href=\"https:\/\/dejankosticgithub.github.io\/documents\/publications\/queuemem-nsdi26.pdf\"><strong>Queue-Mem: Energy-Efficient Hardware Storage for Advanced Network Function Acceleration<\/strong><\/a>&#8220;, Mariano Scazzariello, Tommaso Caiazzi, Hamid Ghasemirahni, Dejan Kosti\u0107, and Marco Chiesa, <em>Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation (<strong>NSDI<\/strong>)<\/em>, May 2026. <em><em><em>To appear<\/em><\/em><\/em><\/li>\n<li>&#8220;<a href=\"https:\/\/openreview.net\/forum?id=F7rUng23nw\"><strong>KVComm: Enabling Efficient LLM Communication through Selective KV Sharing<\/strong><\/a>&#8220;, Xiangyu Shi, Marco Chiesa, Gerald Q. Maguire Jr., and Dejan Kostic, <em>Proceedings of the Fourteenth International Conference on Learning Representation (<strong>ICLR<\/strong>)<\/em>. April 2026.<\/li>\n<li>&#8220;<a href=\"https:\/\/kth.diva-portal.org\/smash\/record.jsf?pid=diva2%3A2025782&amp;dswid=-3153\"><strong>Cloud abstractions for AI workloads<\/strong><\/a>&#8220;, Marco Canini, Theophilus A. Benson, Ricardo Bianchini, \u00cd\u00f1igo Goiri, Dejan Kosti\u0107, Peter Pietzuch, Simon Peter, <em>Proceedings of the 16th ACM SIGOPS Asia-Pacific Workshop on Systems (<strong>ApSys<\/strong>)<\/em>, October 2025.<\/li>\n<li>&#8220;<a href=\"https:\/\/kth.diva-portal.org\/smash\/record.jsf?pid=diva2%3A2025739&amp;dswid=-2424\"><strong>Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning<\/strong><\/a>&#8220;, Laura Puccioni, Alireza Farshin, Mariano Scazzariello, Changjie Wang, Marco Chiesa, and Dejan Kosti\u0107, Proceedings of the Second International Workshop on Large Language Models for Code (<strong>LLM4Code<\/strong>), May 2025.<\/li>\n<li>&#8220;<a href=\"https:\/\/kth.diva-portal.org\/smash\/record.jsf?pid=diva2%3A2025736&amp;dswid=3848\"><strong>Automating the Detection of Code Vulnerabilities by Analyzing GitHub Issues<\/strong><\/a>&#8220;, Daniele Cipollone, Changjie Wang, Mariano Scazzariello, Simone Ferlin, Maliheh Izadi, Dejan Kosti\u0107, and Marco Chiesa, Proceedings of the Second International Workshop on Large Language Models for Code (<strong>LLM4Code<\/strong>), May 2025.<\/li>\n<li>&#8220;<a href=\"https:\/\/kth.diva-portal.org\/smash\/record.jsf?pid=diva2%3A1980009&amp;dswid=4796\"><strong>Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference<\/strong><\/a>&#8220;, Mohammad Siavashi, Faezeh Keshmiri Dindarloo, Dejan Kosti\u0107, and Marco Chiesa, <em>Proceedings of the 5th Workshop on Machine Learning and Systems (<strong>EuroMLSys<\/strong>)<\/em>, March 2025.<\/li>\n<li style=\"list-style-type: none\"><\/li>\n<\/ol>\n<\/div>","protected":false},"excerpt":{"rendered":"<p>&#8220;Queue-Mem: Energy-Efficient Hardware Storage for Advanced Network Function Acceleration&#8220;, Mariano Scazzariello, Tommaso Caiazzi, Hamid Ghasemirahni, Dejan Kosti\u0107, and Marco Chiesa, Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI), May 2026. To appear &#8220;KVComm: Enabling Efficient LLM Communication through Selective KV Sharing&#8220;, Xiangyu Shi, Marco Chiesa, Gerald Q. Maguire Jr., and [&hellip;]<\/p>\n","protected":false},"author":621,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"inline_featured_image":false,"footnotes":""},"class_list":["post-27","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/www.kth.se\/blogs\/dai\/wp-json\/wp\/v2\/pages\/27","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.kth.se\/blogs\/dai\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.kth.se\/blogs\/dai\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.kth.se\/blogs\/dai\/wp-json\/wp\/v2\/users\/621"}],"replies":[{"embeddable":true,"href":"https:\/\/www.kth.se\/blogs\/dai\/wp-json\/wp\/v2\/comments?post=27"}],"version-history":[{"count":2,"href":"https:\/\/www.kth.se\/blogs\/dai\/wp-json\/wp\/v2\/pages\/27\/revisions"}],"predecessor-version":[{"id":40,"href":"https:\/\/www.kth.se\/blogs\/dai\/wp-json\/wp\/v2\/pages\/27\/revisions\/40"}],"wp:attachment":[{"href":"https:\/\/www.kth.se\/blogs\/dai\/wp-json\/wp\/v2\/media?parent=27"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}