About me - Pu Yang
I am an Embodied Intelligence Algorithm Researcher at AgiBot’s Embodied Intelligence Research Center (具身研究中心). I received my Ph.D. in Computational Mathematics from Peking University in 2026, advised by Prof. Bin Dong.
My research sits at the intersection of mathematics and applied artificial intelligence. I am broadly interested in embodied intelligence and robot learning, large language models and synthetic data, reinforcement learning, and inverse problems.
Research Interests
- Embodied intelligence and robot learning
- Large language models and synthetic data
- Reinforcement learning
- Inverse problems and mathematical modeling
Selected Publications
- Learning while Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies
arXiv, 2026. Paper - Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Bootstrapping
NeurIPS 2025 Spotlight. Paper - Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement
ICLR 2025. Paper - L2SR: Learning to Sample and Reconstruct for Accelerated MRI via Reinforcement Learning
Inverse Problems, 2024. Paper
