Google Scholar / DBLP / GitHub / Twitter
I am a 2nd-year student in Computer Science at Xi’an Jiaotong University, under the supervision of Prof. Andrew C. Yao. I collaborate closely with Prof. Kaisheng Ma and Prof. Li Yi at IIIS, Tsinghua University. Prior to this, I got my bachelor’s degree in Computer Science from Xidian University in 2021. Additionally, I am currently a research intern of the Foundation Model Group at Megvii Research (Face++), where I work with Zheng Ge and Xiangyu Zhang.
My recent research focuses on several areas, including Generative Modeling, 2D/3D/4D Visual Geometry Understanding, Self-Supervised Learning, and Multi/Cross-Modal Representation Learning. My goal is to develop Vision and Language Foundation Models that can effectively interact with humans. I enjoy working on interesting and cool things that fascinate me. I am enthusiastic about contributing to the research community and I support Slow Science.