Biography
Hi, I'm Xiangpeng. I am currently a Ph.D. student at the University of Technology Sydney (UTS).
My research interests involve Generative AI, Video Generation, and Multi-modal Learning. Specifically, I focus on video world models, video generation, and multi-modal foundation models.
Looking ahead, I am deeply motivated to build unified video models capable of jointly understanding dynamic visual environments and generating coherent future content within a single framework. I believe this direction is a crucial step toward world models, where systems can reason about and interact with the physical world through continuous video understanding and prediction.
I am currently seeking research intern opportunities. If there are suitable positions available, please feel free to reach out. Thank you!
News
Selected Publications

Industry Experience
Select Awards
Academic Service