Yuzhen Mao's Homepage

Yuzhen Mao
yuzhenm [at] stanford [dot] edu

I am a Ph.D. student in Computer Science at Stanford University, fortunately advised by Prof. Azalia Mirhoseini and Prof. Christos Kozyrakis. My current research interests focus on efficient and agentic LLM systems, especially for long-context reasoning.

If you find any research interests that we might share, feel free to drop me an email. I am always open to potential collaborations.

News

Selected Publications (* denotes equal contribution)

Decentralized Multi-Agent Systems with Shared Context
Yuzhen Mao, Azalia Mirhoseini
arXiv preprint, 2026
Media coverage: VentureBeat

Simplified Sparse Attention via Gist Tokens
Yuzhen Mao, Michael Y. Li, Emily B. Fox
arXiv preprint, 2026

IceCache: Memory-Efficient KV-cache Management for Long-Sequence LLMs
Yuzhen Mao, Qitong Wang, Martin Ester, Ke Li
International Conference on Learning Representations (ICLR), 2026

Mem-α: Learning Memory Construction via Reinforcement Learning
Yu Wang, Ryuichi Takanobu, Zhiqi Liang, Yuzhen Mao, Yuanzhe Hu, Julian McAuley, Xiaojian Wu
arXiv preprint, 2025

IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs
Yuzhen Mao, Martin Ester, Ke Li
International Conference on Learning Representations (ICLR), 2024

Phenotype prediction from single-cell RNA-seq data using attention-based neural networks
Yuzhen Mao*, Yen-Yi Lin*, Nelson K. Y. Wong, Stanislav Volik, Funda Sar, Colin Collins, Martin Ester
Bioinformatics, 2024

Last-Layer Fairness Fine-tuning is Simple and Effective for Neural Networks
Yuzhen Mao, Zhun Deng, Huaxiu Yao, Ting Ye, Kenji Kawaguchi, James Zou
ICML Workshop on Spurious Correlations, Invariance, and Stability, 2023

Augmenting Knowledge Transfer across Graphs
Yuzhen Mao, Jianhui Sun, Dawei Zhou
IEEE International Conference on Data Mining (ICDM), 2022

Design and source code from Jon Barron. Style adapted from Zhijian Liu and Ligeng Zhu.