Yudi Shi
I am a 3rd-year PhD student at Shanghai Jiao Tong University, supervised by Weidi Xie.
My research interests lie in Multimodal learning and Video understanding, with a primary focus on video agentic reasoning and multimodal Chain-of-Thought learning. If you’d like to collaborate or have any questions, feel free to reach out to me via email.
🚀News
- [2026.02] Weaver has been opened to Arxiv!
- [2025.06] StreamFormer has been accepted to ICCV 2025!
- [2025.02] AoTD has been accepted to the CVPR 2025!
📄Publications

Weaver: End-to-End Agentic System Training for Video Interleaved Reasoning
Yudi Shi, Shangzhe Di, Qirui Chen, Qinian Wang, Jiayin Cai, Xiaolong Jiang, Yao Hu, Weidi Xie
Arxiv, 2026
[Project Page] [Code] [Paper]

Learning Streaming Video Representation via Multitask Training
Yibin Yan*, Jilan Xu*, Shangzhe Di, Yikun Liu, Yudi Shi, Qirui Chen, Zeqian Li, Yifei Huang, Weidi Xie
ICCV, 2025 (Oral)
[Project Page] [Code] [Paper]

