Publications

Accelerating LLM Serving for Multi-turn Dialogues with Efficient Resource Management

Jinwoo Jeong and Jeongseob Ahn

ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2025

GPU-centric Memory Tiering for LLM Serving with NVIDIA Grace Hopper Superchip

Woohyung Choi, Jinwoo Jeong, Hanhwi Jang, and Jeongseob Ahn

IEEE Computer Architecture Letters (CAL), 2025

Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access [ Paper | Slides | Code ]

Jinwoo Jeong, Seungsu Baek, and Jeongseob Ahn

ACM European Conference on Computer Systems (EuroSys), 2023

Gilles Muller Best Artifact Award

Memory Harvesting in Multi-GPU Systems with Hierarchical Unified Virtual Memory

Sangjin Choi*, Taeksoo Kim*, Jinwoo Jeong, Rachata Ausavarungnirun, Myeongjae Jeon, Youngjin Kwon, and Jeongseob Ahn

USENIX Annual Technical Conference (ATC), 2022

Education

Ph.D. Student in Electrical and Computer Engineering

Korea University • 2024 — current

Ph.D. Student in Computer Engineering

Ajou University • 2020 — 2024

B.E. in Software and Computer Engineering

Ajou University • 2016 — 2020