I am a Ph.D. student at the University of Virginia, advised by Prof. Felix Xiaozhu Lin. My research lies at the intersection of Systems and Machine Learning, focusing on efficient inference frameworks that minimize latency and memory consumption. Previously, I have worked on low-level optimizations in the Linux kernel.
Efficient Sparsity Management for LLMs (under review)
Wonkyo Choe, Felix Xiaozhu Lin
Proto: A Guided Journey through Modern OS Construction (SOSP’25)
Wonkyo Choe*, Rongxiang Wang*, Afsara Benazir*, Felix Xiaozhu Lin
(* = equal contribution)
[PDF]
RWKV-Lite: Deeply Compressed RWKV for Resource-Constrained Devices (arxiv)
Wonkyo Choe, Yangfeng Ji, Felix Xiaozhu Lin
[PDF]
AnA: An Attentive Autonomous Driving System (ASPLOS’25)
Wonkyo Choe, Rongxiang Wang, Felix Xiaozhu Lin
[PDF]
Efficient NLP Inference at the Edge via Elastic Pipelining (ASPLOS’23)
Liwei Guo, Wonkyo Choe, Felix Xiaozhu Lin
[PDF]
Rethinking Remote Memory Placement on Large-Memory Systems with Path Diversity (ApSys’21)
Wonkyo Choe*, Sang-Hoon Kim, Jeongseob Ahn
[PDF]
Exploring the Design Space of Page Management for Multi-Tiered Memory Systems (ATC’21)
Jonghyeon Kim*, Wonkyo Choe, Jeongseob Ahn
[PDF]
A Study of Memory Placement on Hardware-Assisted Tiered Memory Systems (CAL’20)
Wonkyo Choe*, Jonghyeon Kim, Jeongseob Ahn
[PDF]
Powered by Jekyll and Minimal Light theme.