About me
I’m a third-year PhD student in the Department of CS at University of Chicago, advised by Prof. Junchen Jiang and Prof. Shan Lu. My research interest is Systems/Software engineering for ML. I received my B.S. in CS at University of Wisconsin-Madison, fortunate to be advised by Prof. Shivaram Venkataraman.
Publications
- CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving paper
Yuhan Liu, Hanchen Li, Yihua Cheng, Siddhant Ray, Yuyang Huang, Qizheng Zhang, Kuntai Du, Jiayi Yao, Shan Lu, Ganesh Ananthanarayanan, Michael Maire, Henry Hoffmann, Ari Holtzman, Junchen Jiang
SIGCOMM 2024 - CacheBlend: Fast Large Language Model Serving with Cached Knowledge Fusion paper
Jiayi Yao, Hanchen Li, Yuhan Liu, Siddhant Ray, Yihua Cheng, Qizheng Zhang, Kuntai Du, Shan Lu, Junchen Jiang
EuroSys 2025 - ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications
Yuhan Liu, Chengcheng Wan, Kuntai Du, Henry Hoffmann, Junchen Jiang, Shan Lu, Michael Maire
OSDI 2024 - GRACE: Loss-Resilient Real-Time Video through Neural Codecs
Yihua Cheng, Ziyi Zhang, Hanchen Li, Anton Arapin, Yue Zhang, Qizheng Zhang, Yuhan Liu, Kuntai Du, Xu Zhang, Francis Y. Yan, Amrita Mazumdar, Nick Feamster, Junchen Jiang
NSDI 2024 - Keeper: Automated Testing and Fixing of Machine Learning Software
Chengcheng Wan, Shicheng Liu, Sophie Xie, Yuhan Liu, Henry Hoffmann, Michael Maire, Shan Lu
TOSEM 2024 - OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation
Kuntai Du, Yuhan Liu, Yitian Hao, Qizheng Zhang, Haodong Wang, Yuyang Huang, Ganesh Ananthanarayanan, Junchen Jiang
SoCC 2023 - Run-Time Prevention of Software Integration Failures of Machine Learning APIs
Chengcheng Wan, Yuhan Liu, Kuntai Du, Henry Hoffmann, Junchen Jiang, Michael Maire, Shan Lu
OOPSLA 2023
Workshops
- Chatterbox: Robust Transport for LLM Token Streaming under Unstable Network paper
Hanchen Li, Yuhan Liu, Yihua Cheng, Siddhant Ray, Kuntai Du, Junchen Jiang
SIGCOMM Workshop on Networks for AI Computing (NAIC)
Preprints
- DroidSpeak: Enhancing Cross-LLM Communication paper
Yuhan Liu, Esha Choukse, Shan Lu, Junchen Jiang, Madan Musuvathi - AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning paper code
Yuhan Liu, Saurabh Agarwal, Shivaram Venkataraman - Accelerating deep learning inference via learned caches paper
Arjun Balasubramanian, Adarsh Kumar, Yuhan Liu, Han Cao, Shivaram Venkataraman, Aditya Akella
Posters
- Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache
Hanchen Li, Yuhan Liu, Yihua Cheng, Kuntai Du, Junchen Jiang
NSDI 2024 Posters - AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning
Yuhan Liu, Saurabh Agarwal and Shivaram Venkataraman.
Women in Machine Learning Workshop 2020 co-located with the NeurIPS conference.
Invited Talks
- Distributed Systems Lab @ University of Pennsylvania, Nov. 2024
Teaching
- Graduate Networking (CMSC 33300), Teaching Assistant, Autumn 2024
- Intro to computer systems (CMSC 15400), Teaching Assistant, Winter 2022
- Intro to database systems (CS 564), Peer mentor, Fall 2020 (At Madison)
Awards
- UU Fellowship (2023): University of Chicago fellowship
- Neubauer Graduate Scholarship (2021): University of Chicago fellowship
- Computing Research Association Outstanding Undergraduate Researcher Awards (2021): Honorable Mention
- Trewartha Honors Senior Thesis award (2020): research grant for senior students carrying out thesis research with honor in CS.
Work Experience
- Microsoft Research, Summer 2024
Research Intern
Mentors: Madan Musuvathi, Esha Choukse, Shan Lu
Contact
yuhanl[at]uchicago.edu