Kevin (Qi) Zhao 赵齐

picture

About Me

Hi there, I am Kevin. Currently, I am an AI researcher at TikTok focusing on vision language models(VLM), 3D, and unified visual understanding and generation. As part of Dr. Lu Jiang's team, I helped build Seaweed, a foundation model for video generation.

Previously, I obtained my Master of Science degree at Brown University, where I was fortunate to collaborate closely with professor Chen Sun and his group on video understanding. I have also worked with professor George Konidaris and his PhD student Haotian Fu on embodied agents. I obtained my Bachelor of Science degree at NYU.

I grew up in Shanghai until I went to high school in Scottsdale, AZ.



Selected Projects

Synthetic Video Enhances Physical Fidelity in Video Synthesis

Qi Zhao, Xingyu Ni, Ziyu Wang, Feng Cheng, Ziyan Yang, Lu Jiang*, Bohan Wang*

ICCV 2025

[arXiv] [website] [huggingface]

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Team Seaweed, Qi Zhao, et al.

Technical Report

[website] [paper]

EPO: Hierarchical LLM Agents with Environment Preference Optimization

Qi Zhao*, Haotian Fu*, Chen Sun, George Konidaris

EMNLP 2024 Main

[arXiv] [code]

Vamos: Versatile Action Models for Video Understanding

Shijie Wang, Qi Zhao, Minh Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun

ECCV 2024

[website] [arXiv] [code]

AntGPT: Can Large Language Model Help Long-term Action Anticipation from Videos?

Qi Zhao*, Shijie Wang*, Ce Zhang, Changcheng Fu, Minh Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun

ICLR 2024

[website] [arXiv] [video] [code]

Contact Me

Email: qi_zhao [at] alumni [dot] brown [dot] edu