BIO

I am a third-year PhD student in the School of Computer Science at Carnegie Mellon University(CMU), specializing in machine learning and computer software engineering. My research focuses on Large Language Model post-training, make the LLMs better (more align with domain specific tasks), faster (more efficient in training and inference), and cheaper (training with less GPU hours and GPU memory utilization).

At CMU, I am advised by Prof. Heather Miller. Previously, I earned my master in Computer Science from New York University advised by Prof. Anna Choromanska and Prof. Parijat Dube. I received my B.S. in Computer Science and Engineering from The Chinese University of Hong Kong(CUHK) where I worked with Prof. David Zhang, Dapeng and Prof. Rui Huang. Before starting my PhD, My research mainly focuses on distributed machine learning system.

  • (This personal website is updated as of March 2025.)

News

Selected Publications

My full publication list can be found on my Google Scholar profile.


Academic Blog

Education

  • Ph.D. in Machine Learning and Software Engineering at Carnegie Mellon University, 2023-present
    • GPA: 4.16/4.0, Rank: top1%
  • M.S. in Computer Engineering at New York University, 2021-2023
    • GPA: 3.93/4.0, Rank: top1%
  • B.S. in Computer Science and Engineering at The Chinese University of Hong Kong, 2016-2020

Work Experience

  • Applied Research Scientist Intern, Amazon AWS AI Labs, Summer 2025
  • Teaching Assistant, Carnegie Mellon University, LTI at SCS, Large Language Model Systems (11-868), Spring 2025
  • Research Assistant, Carnegie Mellon University, S3D at SCS, 2023 ~ present
  • Research Assistant, New York University, Engineering School, 2022 ~ 2023

Service

  • Reviewer, International Conference on Learning Representations (ICLR) — 2025, 2026
  • Reviewer, International Joint Conference on Neural Networks (IJCNN) — 2025
  • Reviewer, The Association for the Advancement of Artificial Intelligence (AAAI) — 2025
  • Reviewer, International Conference on Acoustics, Speech, and Signal Processing (ICASSP) — 2022 – 2025
  • Reviewer, International Conference on Computer Vision (ICCV) Workshops — 2023