03-31, 18:00–19:30 (CET), Mees I
Revealing the Unstable Foundations of eBPF-Based Kernel Extensions
Shawn Zhong (University of Wisconsin-Madison); Jing Liu (Microsoft Research); Andrea Arpaci-Dusseau (University of Wisconsin-Madison); Remzi Arpaci-Dusseau (University of Wisconsin-Madison)
Dandelion Hashtable: Cracking the Billion Memory Requests Per Second Barrier — Effortlessly
Antonios Katsarakis, Vasilis Gavrielatos (Huawei Research); Nikos Ntarmos (Huawei Research UK)
MEPipe: Democratizing LLM Training with Memory-Efficient Slice-Level Pipeline Scheduling on Cost-Effective Accelerators
Zhenbo Sun, Shengqi Chen, Yuanwei Wang (Tsinghua University); Jian Sha (Ant Group); Guanyu Feng (Zhipu AI); Wenguang Chen (Tsinghua University)
Towards Interference-aware Application Co-locations
Ioannis A. Papadakis (National Technical University of Athens); Nectarios Koziris (National Technical University of Athens, Greece); Georgios Goumas (National Technical University of Athens)
Beyond Layers: Container Registries for Files Distribution and On-Demand Image Partitioning
Giovanni Bartolomeo, Navidreza Asadi, Wolfgang Kellerer (Technical University of Munich); Jorg Ott (Technische Universität München); Nitinder Mohan (TU Delft)
Boosting Rematerialization Training via Execution Mode Splitting Modeling on Convex Optimized Dynamic Programming
Yu Tang, Lujia Yin (National University of Defense Technology); Qiao Li, Yiming Zhang (Xiamen University)
The LAW Behind ALRs: Redefining Crash-Tolerant Reads
Antonios Katsarakis (Huawei Research); Emmanouil Giortamis (TU Munich); Vasilis Gavrielatos (Huawei Research); Pramod Bhatotia (TU Munich); Aleksandar Dragojevic (None); Boris Grot (University of Edinburgh); Vijay Nagarajan (University of Utah); Panagiota Fatourou (FORTH ICS and University of Crete, Greece)
DPack: Efficiency-Oriented Privacy Budget Scheduling
Pierre Tholoniat, Kelly Kostopoulou (Columbia University)
Adios to Busy-Waiting for Microsecond-scale Memory Disaggregation
Wonsup Yoon, Jisu Ok, Sue Moon, Youngjin Kwon (KAIST)
Reproducible Fault Injection at the Operating System Level
Sebastião Amaro, Miguel Matos (IST Lisbon & INESC-ID); Pedro Fonseca (Purdue University)
HawkSet: Automatic, Application-Agnostic, and Efficient Concurrent PM Bug Detection
João Oliveira (INESC-ID, IST); João Gonçalves (INESC-ID & IST U. Lisboa); Miguel Matos (IST Lisbon & INESC-ID)
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion
Jiayi Yao (The Chinese University of Hong Kong, Shenzhen); Hanchen Li, Yuhan Liu, Siddhant Ray, Yihua Cheng (University of Chicago); Qizheng Zhang (Stanford University); Kuntai Du (University of Chicago); Shan Lu (University of Chicago / Microsoft Research); Junchen Jiang (University of Chicago)
Evolving XFS with Zoned Storage and Intelligent Data Placement
Hans Holmberg, Christoph Hellwig (Western Digital Research)
NeuStream: Bridging Deep Learning Serving and Stream Processing
Haochen Yuan (Peking University); Yuanqing Wang (Peking University and Microsoft Research); Wenhao Xie (Peking University); Yu Cheng (Peking University and Microsoft Research); Ziming Miao, Lingxiao Ma, Jilong Xue (Microsoft Research); Zhi Yang (Peking University)
GraphGen+: Advancing Distributed Subgraph Generation and Graph Learning On Industrial Graphs
Yue Jin, Yongchao Liu, Chuntao Hong (Ant Group)
CAPSys: Contention-aware task placement for data stream processing
Yuanli Wang, Lei Huang, Zikun Wang, Vasiliki Kalavri, Abraham Matta (Boston University)
SeBS-Flow: Benchmarking Serverless Cloud Function Workflows
Larissa Schmid (Karlsruhe Institute of Technology); Marcin Copik (ETH Zurich); Alexandru Calotoiu (ETH Zurich, Switzerland); Laurin Brandner (ETH Zürich); Anne Koziolek (KIT Karlsruhe Institute of Technology); Torsten Hoefler (ETH Zurich)
Towards VM Rescheduling Optimization Through Deep Reinforcement Learning
Xianzhong Ding (University of California, Merced); Yunkai Zhang (University of California, Berkeley); Binbin Chen (ByteDance Inc); Donghao Ying (University of California, Berkeley); Tieying Zhang (ByteDance); Jianjun Chen (Bytedance); Lei Zhang (ByteDance); Alberto Cerpa, Wan Du (University of California, Merced)
HyperAlloc: Efficient VM Memory De/Inflation via Hypervisor-Shared Page-Frame Allocators
Lars Wrenger, Kenny Albes, Marco Wurps (Leibniz Universität Hannover); Christian Dietrich (Technische Universität Braunschweig); Daniel Lohmann (Leibniz Universität Hannover)
Kirsch: It's 2025. Does your OS know what's on your SoC?
Roman Meier, Zikai Liu, Ben Fiedler, Timothy Roscoe (ETH Zurich)
ConsenStress: A Framework to Torture Test Consensus Protocols
Pasindu Tennage, Shailesh Mishra (EPFL); Alberto Sonnino (Mysten Labs, UCL); Eleftherios Kokoris Kogias (Mysten Labs); Philipp Jovanovic (UCL); Bryan Ford (EPFL)
Distributed Graph Neural Network Inference With Just-In-Time Compilation For Industry-Scale Graphs
Xiabao Wu, Yongchao Liu, Wei Qin, Chuntao Hong (Ant Group, China)
Comprehensive Deadlock Prevention for GPU Collective Communication
Lichen Pan (School of Computer Science, Peking University); Juncheng Liu (OneFlow Research); Yongquan Fu (Science and Technology Laboratory of Parallel and Distributed Processing; College of Computer, National University of Defense Technology, Changsha, Hunan province, Chin); Jinhui Yuan (OneFlow Research); Rongkai Zhang, Pengze Li, Zhen Xiao (School of Computer Science, Peking University)
Understanding the Linux Kernel, Visually
Hanzhi Liu, Yanyan Jiang, Chang Xu (Nanjing University)
Achilles: Efficient TEE-Assisted BFT Consensus via Rollback Resilient Recovery
Jianyu Niu (Southern University of Science and Technology); Xiaoqing Wen (University of British Columbia); Guanlong Wu (Southern University of Science and Technology); Shengqi Liu (Southern University of Science and Technology.); Jiangshan Yu (The University of Sydney); Yinqian Zhang (Southern University of Science and Technology)
Socarrat: Building Cost-Effective Secure WORM Devices Following the Reverse File System Approach
Gorka Guardiola Múzquiz, Enrique Soriano-Salvador (Universidad Rey Juan Carlos)
Eva: Cost-Efficient Cloud-Based Cluster Scheduling
Tzu-Tao Chang, Shivaram Venkataraman (University of Wisconsin-Madison)
Dandelion: Small Clusters, Massive Throughput—The Future of Distributed Transactions
Antonios Katsarakis, Vasilis Gavrielatos (Huawei Research); Chris Jensen (University of Cambridge); Nikos Ntarmos (Huawei Research UK)
Maestro: VM memory overcommit balancing platform
Adamos Ttofari, Lukas Humbel (Huawei)
Towards Efficient Large Language Model Serving for Multi-turn Conversations
Jianxiong Liao, Zhi Zhou (Sun Yat-sen University)
A Hardware-Software Co-Design for Efficient Secure Containers
Jiacheng Shi, Yang Yu, Jinyu Gu, Yubin Xia (Shanghai Jiao Tong University)
A House United Within Itself: SLO-Awareness for On-Premises Containerized ML Inference Clusters via Faro
Beomyeol Jeon (University of Illinois Urbana-Champaign); Chen Wang, Diana Arroyo, Alaa Youssef (IBM Research); Indranil Gupta (University of Illinois Urbana-Champaign)
Fast State Restoration in LLM Serving with HCache
Shiwei Gao, Youmin Chen, Jiwu Shu (Tsinghua University)
Optimizing Task Scheduling in Cloud VMs with Accurate vCPU Abstraction
Edward Guo (Hofstra University); Weiwei Jia (The University of Rhode Island); Xiaoning Ding (New Jersey Institute of Technology); Jianchen Shan (Hofstra University)
RoCE BALBOA: Towards FPGA-enhanced RDMA
Maximilian Jakob Heer, Benjamin Ramhorst, Jonas Dann, Gustavo Alonso (ETH Zurich)
Bounded Resource Reclamation
Viktor Reusch, Till Miemietz (Barkhausen Institut)
Towards Energy-Efficient Split Computing: A Hardware-Software Co-Design Perspective
Daniel May, Alessandro Tundo (Technische Universität Wien (TU Wien)); Shashikant Ilager (University of Amsterdam (UvA)); Ivona Brandic (Technische Universität Wien (TU Wien))
Solid State Drive Targeted Memory-Efficient Indexing for Universal I/O Patterns and Fragmentation Degrees
Junsu Im, Jeonggyun Kim, Seonggyun Oh, Jinhyung Koo, Juhyung Park, Hoon Sung Chwa (DGIST); Sam H. Noh (Virginia Tech); Sungjin Lee (DGIST)
Ladon: High-Performance Multi-BFT Consensus via Dynamic Global Ordering
Hanzheng Lyu (The University of British Columbia); Shaokang Xie, Jianyu Niu (Southern University of Science and Technology); Chen Feng (University of British Columbia); Yinqian Zhang (Southern University of Science and Technology); Ivan Beschastnikh (University of British Columbia)
HybridFlow: A Flexible and Efficient RLHF Framework
Guangming Sheng (The University of Hong Kong); Chi Zhang, Zilingfeng Ye (ByteDance); Xibin Wu (Bytedance); Wang Zhang (ByteDance); Ru Zhang (Bytedance); Yanghua Peng, Haibin Lin (ByteDance); Chuan Wu (The University of Hong Kong)
Serverless Cold Starts and Where to Find Them
Artjom Joosen (Huawei); Ahmed Hassan (Systems Infrastructure Lab, Huawei); Martin Asenov, Rajkarn Singh (Huawei); Luke Darlow (Sakana AI); Jianfeng Jianfeng (Wang); qiwen deng (Huawei Edinburgh Research Center); Adam Barker (Huawei)
Systems for LLMs Are Old News: Multimodality Is Redefining Everything We Know
Konstantinos Papaioannou, Thaleia Dimitra Doudali (IMDEA Software Institute)
TUNA: Tuning Unstable and Noisy Cloud Applications
Johannes Freischuetz (University of Wisconsin - Madison); Konstantinos Kanellis (University of Wisconsin-Madison); Brian Kroth (Microsoft); Shivaram Venkataraman (University of Wisconsin-Madison)
Maximizing Return On Investment for Sustainable Operations through Smart Workload Migration
Georgia Christofidi (IMDEA Software Institute, Universidad Politécnica de Madrid); Francisco Álvarez Terribas, Jesus Alberto Omaña Iglesias (Telefónica Research); Nicolas Kourtellis (Keysight); Thaleia Dimitra Doudali (IMDEA Software Institute)
An Architecture for Shrinking the TCB of TEEs on Heterogeneous Systems
Nils Asmussen, Carsten Weinhold (Barkhausen Institut)
DuoSwap: adaptive concurrent swapping to compressed memory and NVMe SSD
Yuben Yang (University of Sydney); Baptiste Lepers (Inria); Kimberly Keeton, Khaled Elmeleegy (Google); Willy Zwaenepoel (University of Sydney)
Efficient Deep Learning Inference on IoT Devices
Zhizhuo Liu, Min Liu, Chaonong Xu (China University of Petroleum-Beijing)
JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs
Gyeongchan Yun, Junesoo Kang, Hyunjoon Jeong, Sanghyeon Eom (UNIST); Minsung Jang (Samsung SDS); Young-ri Choi (UNIST (Ulsan National Institute of Science and Technology))
HuffmanEmbed: Using Huffman Coding for Embedding Table Compression in Deep Learning Recommendation Models
Chaoyi Jiang, Abdulla Alshabanah, Hossein Entezari Zarch, Keshav Balasubramanian, Murali Annavaram (University of Southern California)
Lauberhorn: a Smart NIC that is part of the OS
Pengcheng Xu, Timothy Roscoe (ETH Zurich)
SpotHedge: Serving AI Models on Spot Instances
Ziming Mao, Tian Xia, Zhanghao Wu, Wei-Lin Chiang, Tyler Griggs, Romil Bhardwaj, Zongheng Yang (UC Berkeley); Scott Shenker (ICSI AND UC Berkeley); Ion Stoica (UC Berkeley)
RoboRebound: Multi-Robot System Defense with Bounded-Time Interaction
Neeraj Gandhi, Yifan Cai, Andreas Haeberlen, Linh Thi Xuan Phan (University of Pennsylvania)
Automated Reasoning About Memory Accesses on Systems-on-Chip
Ben Fiedler, Samuel Gruetter, Timothy Roscoe (ETH Zurich)
Dynamic Dispatcher Assignment With Flat-Combining
Gangmin Lee, Wonsup Yoon, Sue Moon (KAIST)
Systematic Testing of Persistent Memory Applications
Henrique Fernandes, João Gonçalves, Miguel Matos (IST U. Lisboa & INESC-ID)
Ad-hoc composable cache coherent systems - a fairy tale?
Jasmin Schult, Timothy Roscoe (ETH Zurich)