Beijing University of Posts and Telecommunications (BUPT), Beijing, China
I am an associate professor at BUPT, leading a small research group working on generative and multimodal AI, with recent publications at CVPR, ICLR, ICML, NeurIPS, and IJCV. I received my PhD in Signal Processing from BUPT in 2015, advised by Professor Jun Guo at the Pattern Recognition and Intelligent Systems (PRIS) Laboratory. From 2019 to 2020, I was a visiting scholar at SketchX Lab, headed by Dr. Yi-Zhe Song, at the Centre for Vision, Speech and Signal Processing (CVSSP), University of Surrey. I was also a guest PhD at Aalborg University, Denmark in 2013, and a visiting researcher at Sun Yat-sen University, China in 2014.
My research lies at the intersection of computer vision, generative modeling, and multimodal learning. Recent work focuses on (i) scalable generative models for video and 3D (autoregressive & diffusion), (ii) geometry- and physics-aware world modeling, (iii) chain-of-thought reasoning for embodied tasks such as vision-language navigation, and (iv) post-training of diffusion models with reinforcement learning. Earlier work centered on free-hand sketch as a window into human visual abstraction.
拟招收2027年博士研究生一名(申请-考核、硕博连读),欢迎带简历邮件联系。
常年招收2-4名硕士研究生(保研+考研)、科研实习生若干名(3-6个月及以上),欢迎有科研热情的同学带简历邮件联系。
Updated May. 2026, page created using Bootstrap