About me - Pu Yang

I am an Embodied Intelligence Algorithm Researcher at AgiBot’s Embodied Intelligence Research Center (具身研究中心). I received my Ph.D. in Computational Mathematics from Peking University in 2026, advised by Prof. Bin Dong.

My research sits at the intersection of mathematics and applied artificial intelligence. I am broadly interested in embodied intelligence and robot learning, large language models and synthetic data, reinforcement learning, and inverse problems.

Research Interests

  • Embodied intelligence and robot learning
  • Large language models and synthetic data
  • Reinforcement learning
  • Inverse problems and mathematical modeling

Selected Publications

  • Learning while Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies
    arXiv, 2026. Paper
  • Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Bootstrapping
    NeurIPS 2025 Spotlight. Paper
  • Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement
    ICLR 2025. Paper
  • L2SR: Learning to Sample and Reconstruct for Accelerated MRI via Reinforcement Learning
    Inverse Problems, 2024. Paper