Jiahao Xie
I am currently a postdoc researcher at Max Planck Institute for Informatics (MPI-INF), working with Prof. Bernt Schiele.
Previously, I obtained my Ph.D. degree from MMLab@NTU, Nanyang Technological University, supervised by Prof. Chen Change Loy and Prof. Yew Soon Ong.
I also worked closely with Prof. Ziwei Liu.
My research interests include computer vision and machine learning, with a focus on foundation models and their downstream applications.
Particularly, I am interested in advancing foundation models—from vision to multimodality—beyond human supervision.
This includes self-supervised learning, representation learning, multimodal learning, generative models, and other topics related to foundation models.
I am a core developer and maintainer of OpenMMLab projects including MMSelfSup
and MMPreTrain
.
Email  / 
Google Scholar  / 
GitHub  / 
Twitter
|
|
|
Chain-of-Region Verification Reduces Hallucinations in Large Vision-Language Models
Jiahao Xie, Alessio Tonioni, Nathalie Rauschmayr, Federico Tombari, Bernt Schiele
Under Review, 2025
|
|
Improved Vision-Language Alignment via Text-Conditioned Image Embeddings
Sweta Mahajan*, Sukrut Rao*, Jiahao Xie, Alexander Koller, Bernt Schiele
(*=equal contribution)
Under Review, 2025
|
|
CNS-Bench: Benchmarking Model Robustness Under Continuous Nuisance Shifts
Olaf Dünkel, Jiahao Xie*, Artur Jesslen*, Christian Theobalt, Christian Rupprecht, Adam Kortylewski
(*=equal contribution)
Under Review, 2025
|
|
Test-Time Visual In-Context Tuning
Jiahao Xie, Alessio Tonioni, Nathalie Rauschmayr, Federico Tombari, Bernt Schiele
CVPR, 2025
Paper /
Code
|
|
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy
IJCV, 2024
Paper /
Code
|
|
Correlational Image Modeling for Self-Supervised Visual Pre-Training
Wei Li, Jiahao Xie, Chen Change Loy
CVPR, 2023
Paper /
Code
|
|
Masked Frequency Modeling for Self-Supervised Visual Pre-Training
Jiahao Xie, Wei Li, Xiaohang Zhan, Ziwei Liu, Yew Soon Ong, Chen Change Loy
ICLR, 2023
Paper /
Project Page /
Code
|
|
Delving into Inter-Image Invariance for Unsupervised Visual Representations
Jiahao Xie, Xiaohang Zhan, Ziwei Liu, Yew Soon Ong, Chen Change Loy
IJCV, 2022
Paper /
Code
|
|
UniVIP: A Unified Framework for Self-Supervised Visual Pre-training
Zhaowen Li, Yousong Zhu, Fan Yang, Wei Li, Chaoyang Zhao, Yingying Chen, Zhiyang Chen, Jiahao Xie, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang
CVPR, 2022
Paper
|
|
Unsupervised Object-Level Representation Learning from Scene Images
Jiahao Xie, Xiaohang Zhan, Ziwei Liu, Yew Soon Ong, Chen Change Loy
NeurIPS, 2021
Paper /
Project Page /
Code
|
|
Online Deep Clustering for Unsupervised Representation Learning
Jiahao Xie*, Xiaohang Zhan*, Ziwei Liu, Yew Soon Ong, Chen Change Loy
(*=equal contribution)
CVPR, 2020
Paper /
Code
|
- Conference Reviewer: NeurIPS 2022-25, ICLR 2024-25, ICML 2024, CVPR 2021-25, ICCV 2021-25, ECCV 2024, AAAI 2021
- Journal Reviewer: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), International Journal of Computer Vision (IJCV)
|