Mengxi Wu 武梦溪

I am a Ph.D. Candidate in the Department of Computer Science at the University of Southern California, advised by Prof. Xuezhe Ma. I received my Master of Science in Computer Science from New York University, where I was advised by Prof. Yi-Jen Chiang, Prof. Christopher Musco, and Prof. Yi Fang. Before that, I was an undergraduate student in EECS at the University of Michigan, Ann Arbor.

Research

My research focuses on the foundations of efficiency in large language model (LLM) pre-training. I am interested in methods that scale from small proxy models to trillion-parameter foundation models, as well as the mathematical principles underlying training dynamics. I work on hyperparameter optimization, analytically linking regularization, model architecture, and dataset properties. I also study optimization algorithms and develop new optimizers. Previously, my work included transfer learning with theoretical guarantees, adversarial machine learning for 3D point cloud processing, and streaming algorithms for time-varying volume data.

Publications and Manuscripts (By Years / Selected)
Gecko: An Efficient Neural Architecture Inherently Processing Sequences with Arbitrary Lengths
Xuezhe Ma*, Shicheng Wen*, Linghao Jin*, Bilge Acun*, Ruihang Lai*, Bohan Hou, Will Lin, Hao Zhang, Songlin Yang, Ryan Lee, Mengxi Wu, Jonathan May, Luke Zettlemoyer, Carole-Jean Wu.

Preprint

[Paper] [Code]

Curvature Diversity-Driven Nuclear-Norm Wasserstein Domain Alignment for Point Cloud
Mengxi Wu, Hao Huang, Yi Fang, Mohammad Rostami

Transactions on Machine Learning Research, 2025

[Paper] [Code]

Graph Harmony: Denoising and Nuclear-Norm Wasserstein Adaptation for Enhanced Domain Transfer in Graph-Structured Data
Mengxi Wu, Mohammad Rostami

Transactions on Machine Learning Research, 2024

[Paper] [Code]

Streaming Approach to In Situ Selection of Key Time Steps for Time-Varying Volume Data
Mengxi Wu, Yi-Jen Chiang, Christopher Musco

Eurographics/IEEE Conference on Visualization, 2022

[Paper] [Code]

3D Point Cloud Completion with Geometric-Aware Adversarial Augmentation
Mengxi Wu, Hao Huang, Yi Fang

International Conference on Pattern Recognition, 2022

[Paper] [Code]


Website template from Jiayuan Mao. Last updated: Jan 2026.