I am a second-year Ph.D. candidate at Shanghai Jiao Tong University, supervised by Prof. Jifeng Dai. I obtained my bachelor’s degree from Beihang University and a double bachelor’s degree in economics from Peking University. I won the Best Paper Award of ACM Multimedia 2021 and Best Video Award of IJCAI 2021. Currently I intern at OpenGVLab of Shanghai AI Laboratory. Previously I interned at CoLab, SenseTime, and Sea AI Lab.

  • Artificial Intelligence
  • Computer Vision
  • Music Generation
  • Ph.D. (joint program with Shanghai AI Lab), 2022-

    Shanghai Jiao Tong University

  • B.A. in Economics (double major), 2019-2022

    Peking University

  • B.Eng. in Computer Science, 2018-2022

    Beihang University

Working Experience

Shanghai AI Lab
Research Intern
Nov 2022 – Present Shanghai
Research Intern
Feb 2022 – Oct 2022 Beijing
Fundamental Vision Group
Sea AI Lab
Research Intern
Aug 2021 – Feb 2022 Beijing
Research on music AI


(2024). Parameter-Inverted Image Pyramid Networks. Preprint.

PDF Cite Code

(2024). Synergizing Spatial Optimization with Large Language Models for Open-Domain Urban Itinerary Planning. Preprint.

PDF Cite

(2023). Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft. In CVPR 2024.

PDF Cite Demo

(2022). Video Background Music Generation: Dataset, Method and Evaluation. In ICCV 2023.

PDF Cite Demo

(2021). Video Background Music Generation with Controllable Music Transformer. In ACM MM 2021 (Best Paper Award).

PDF Cite Code Colab Notebook Demo

(2021). Confidence-aware Non-repetitive Multimodal Transformers for TextCaps. In AAAI 2021.

PDF Cite Code


Conference Reviewer:

  • ICCV 2023
  • CVPR 2024
  • ECCV 2024
  • NeurIPS 2024

Teaching Assistant:

  • Fundamentals of Computers (2021 spring)
  • Software Engineering (2022 spring)