Yudi Shi

I am a 3rd-year PhD student at Shanghai Jiao Tong University, supervised by Weidi Xie.

My research interests lie in Multimodal learning and Video understanding, with a primary focus on video agentic reasoning and multimodal Chain-of-Thought learning. If you’d like to collaborate or have any questions, feel free to reach out to me via email.

🚀News

  • [2026.02] Weaver has been opened to Arxiv!
  • [2025.06] StreamFormer has been accepted to ICCV 2025!
  • [2025.02] AoTD has been accepted to the CVPR 2025!

📄Publications

Weaver

Weaver: End-to-End Agentic System Training for Video Interleaved Reasoning

Yudi Shi, Shangzhe Di, Qirui Chen, Qinian Wang, Jiayin Cai, Xiaolong Jiang, Yao Hu, Weidi Xie

Arxiv, 2026

[Project Page] [Code] [Paper]

StreamFormer

Learning Streaming Video Representation via Multitask Training

Yibin Yan*, Jilan Xu*, Shangzhe Di, Yikun Liu, Yudi Shi, Qirui Chen, Zeqian Li, Yifei Huang, Weidi Xie

ICCV, 2025 (Oral)

[Project Page] [Code] [Paper]

AoTD

Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation

Yudi Shi, Shangzhe Di, Qirui Chen, Weidi Xie

CVPR, 2025

[Project Page] [Code] [Paper]