pengxiang-li-1-1.jpg

Pengxiang LI

Email: pengxiangli1999[at]gmail[dot]com

I am a second-year PhD student in Beijing Institute of Technology (BIT), advised by Dr. Yuwei Wu and Dr. Yunde Jia. I am also a member of the joint PhD program (‘TONG Program’) with Beijing Institute for General Artificial Intelligence (BIGAI), and I am grateful to be advised by Dr. Qing Li and Dr. Zhi Gao. Previously, I got my Bachelor’s degree in Computer Science and Technology from BIT in 2021.
My research interests lie in Multimodal Agents, Non-Euclidean Optimization, and 3DV Understanding. Feel free to reach out if you are interested in my work or have any questions! 🤝


News

2025-01 🌟 One paper on Multimodal Agent Tuning is accepted by ICLR 2025 Spotlight.
2024-10 🌟 One paper on Feedback Learning in VLM is accepted by NeurIPS 2024.
2024-09 🌟 One journal paper on Stereo Matching is accepted by T-CSVT.

Publications

  1. 2025sport.png
    Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning
    Pengxiang Li* , Zhi Gao* , Bofei Zhang , Yapeng Mi , Xiaojian Ma , Chenrui Shi , Tao Yuan , Yuwei WuYunde JiaSong-Chun Zhu , and Qing Li
    arXiv preprint arXiv:2504.21561, 2025
  2. mat.png
    Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage Spotlight
    Zhi Gao* , Bofei Zhang* , Pengxiang Li* , Xiaojian Ma , Tao Yuan , Yue FanYuwei WuYunde JiaSong-Chun Zhu , and Qing Li
    International Conference on Learning Representations (ICLR), 2025
  3. 2024sg3d.png
    Task-oriented Sequential Grounding in 3D Scenes
    Zhuofan Zhang , Ziyu ZhuPengxiang LiTengyu LiuXiaojian MaYixin ChenBaoxiong JiaSiyuan Huang , and Qing Li
    arXiv preprint arXiv:2408.04034, 2024
  4. fire.jpg
    FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models
    Neural Information Processing Systems: Datasets and Benchmarks (NeurIPS D&B), 2024
  5. issga.jpg
    Inter-Scale Similarity Guided Cost Aggregation for Stereo Matching
    Pengxiang Li , Chengtang Yao , Yuwei Wu , and Yunde Jia
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024
  6. tutorial.jpg
    Hyperbolic Learning: Theory and Applications
    Pengxiang Li , Peilin Yu , Yangkai Xue , Yuwei Wu , and Zhi Gao
    2023
  7. Decnet.png
    A decomposition model for stereo matching
    Chengtang Yao , Yunde Jia , Huijun Di , Pengxiang Li , and Yuwei Wu
    The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021

Education

2023 - Present

Joint PhD Program with Beijing Institute for General Artificial Intelligence (BIGAI), China

2023 - Present

PhD student in Computer Science, Beijing Institute of Technology (BIT), China

2021 - 2023

MSc in Computer Science, Beijing Institute of Technology (BIT), China

2017 - 2021

BSc in Computer Science, Beijing Institute of Technology (BIT), China