Kai Han

🚩Openings:
We are always looking for strong candidates (PhD/Postdoc/RA/...) to work on exciting research problems in computer vision, machine learning, and artificial intelligence. PhD scholarship opportunities ☞ HKU-PS/HKPFS/PGS; HKU-BICI; HKU-ASTRI.
Please drop me an email with your resume if you are interested in working with me.

▼ SHOW MORE ▲ SHOW LESS

June 2025	Two papers (Inpaint4Drag, GRAB) are accepted to ICCV 2025.
June 2025	Talks @ CVPR 2025 in Nashville: Lightning talk at CVPR Area Chair Workshop; Keynote talks at CVPR Workshops on Fine-Grained Visual Categorization, Visual Anomaly and Novelty Detection, and Domain Generalization.
May 2025	Two papers (GAMEBot, PruneVid) are accepted to ACL 2025.
Mar 2025	Splat4D is accepted to SIGGRAPH 2025.
Feb 2025	Six papers (ICE, HypCD, Mr. DETR, v-CLR, GSPN, PASS) are accepted to CVPR 2025.
Jan 2025	Five papers (HiLo, DebGCD, BiGR, Needle Threading, AvatarGO) are accepted to ICLR 2025.
Sept 2024	SciFIBench is accepted to NeurIPS 2024.
Sept 2024	Invited to serve as: Area Chair for ICLR 2025, Area Chair for CVPR 2025.
Aug 2024	Our Dissecting OOD and OSR paper is accepted to IJCV.
Jul 2024	Three papers (RegionDrag, PromptCCD, and ConceptExpress) are accepted to ECCV 2024.
Feb 2024	Three papers (IBD-SLAM, DreamAvatar, and SD4Match) are accepted to CVPR 2024.
Feb 2024	CiPR is accepted to TMLR 2024.
Jan 2024	Two papers (SPTNet and FROSTER) are accepted to ICLR 2024.
Oct 2023	Invited to serve as an Area Chair for ECCV 2024.
Sept 2023	One paper on text-guided 3D head avatar generation and editing is accepted to NeurIPS 2023.
Aug 2023	One paper on visual correspondence is accepted to TPAMI.
July 2023	Two papers (on generalized category discovery/open-vocabulary semantic segmentation) are accepted to ICCV 2023.
June 2023	Invited to serve as an Area Chair for CVPR 2024.
Mar 2023	OOD-CV workshop @ ICCV 2023. Welcome participants from all over!
Feb 2023	Two papers (on compositional zero-shot learning/3D human digitization) are accepted to CVPR 2023.
Jul 2022	One paper on novel category discovery without forgetting is accepted to ECCV 2022.
Jun 2022	Best Paper Runner-Up Award at CVPR 2022 Workshop on Continual Learning in Computer Vision.
Mar 2022	Three papers (about generalized category discovery/3D human digitization/instance segmentation) are accepted to CVPR 2022.
Jan 2022	One paper about open-set recognition is accepted to ICLR 2022.
Oct 2021	One paper about visual correspondence is accepted to BMVC 2021.
Sept 2021	One paper about novel category discovery is accepted to NeurIPS 2021.
Sept 2021	Recognized as an Outstanding Reviewer for ICCV 2021, in the top 5% of experienced reviewers.
July 2021	One paper about single- and multi-modal novel category discovery is accepted to ICCV 2021.
June 2021	Our AutoNovel paper is accepted to TPAMI.
June 2021	Recognized as an Outstanding Reviewer for CVPR 2021.
May 2021	One paper about dynamic convolution for semantic scene completion is accepted to TPAMI.
Mar 2021	One paper about long-tailed recognition is accepted to CVPR 2021.
Dec 2020	One paper about mirror surface reconstruction is accepted to IEEE TIP.
Sept 2020	One paper about dense correspondence is accepted to NeurIPS 2020.
Jun 2020	One paper about deep photometric stereo is accepted to TPAMI.
June 2020	Recognized as an Outstanding Reviewer for CVPR 2020.
Feb 2020	Two papers (about semantic correspondence / 3D semantic scene completion) are accepted to CVPR 2020.
Dec 2019	One paper about novel category discovery is accepted to ICLR 2020.
Jul 2019	One paper about novel category discovery is accepted to ICCV 2019.
Jul 2019	One paper about transparent object matting is accepted to IJCV.
Mar 2019	Two papers (about unsupervised object discovery and matching / uncalibrated photometric stereo) are accepted to CVPR 2019.

	Jonathan Roberts, Kai Han, Samuel Albanie GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models International Conference on Computer Vision (ICCV), 2025. BibTeX \| PDF \| arXiv \| project page \| data \| code @inproceedings{Roberts2025GRAB, author = {Jonathan Roberts and Kai Han and Samuel Albanie}, title = {GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models}, booktitle = {International Conference on Computer Vision (ICCV)}, year = {2025}, }
	Wenye Lin, Jonathan Roberts, Yunhan Yang, Samuel Albanie, Zongqing Lu, Kai Han^† GAMEBoT: Transparent Assessment of LLM Reasoning in Games Annual Meeting of the Association for Computational Linguistics (ACL), 2025. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Lin2025GAMEBot, author = {Jonathan Roberts and Kai Han and Samuel Albanie}, title = {GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models}, booktitle = {Annual Meeting of the Association for Computational Linguistics (ACL)}, year = {2025}, }
	Xiaohu Huang, Hao Zhou, Kai Han^† PruneVid: Visual Token Pruning for Efficient Video Large Language Models Annual Meeting of the Association for Computational Linguistics (ACL) Findings, 2025. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Huang2025PruneVid, author = {Xiaohu Huang and Hao Zhou and Kai Han}, title = {PruneVid: Visual Token Pruning for Efficient Video Large Language Models}, booktitle = {Annual Meeting of the Association for Computational Linguistics (ACL) Findings}, year = {2025}, }
	Minghao Yin, Yukang Cao, Songyou Peng, Kai Han^† Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation SIGGRAPH, 2025. BibTeX \| PDF \| DOI @inproceedings{Yin2025Splat4D, author = {Minghao Yin and Yukang Cao and Songyou Peng and Kai Han}, title = {Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation}, booktitle = {SIGGRAPH}, year = {2025}, }
	Fernando Julio Cendra, Kai Han^† ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025. Highlight presentation (2.8% of submissions) BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Cendra2025ICE, author = {Fernando Julio Cendra and Kai Han}, title = {ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2025}, }
	Yuanpei Liu, Zhenqi He, Kai Han^† Hyperbolic Category Discovery IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Liu2025HypCD, author = {Yuanpei Liu and Zhenqi He and Kai Han}, title = {Hyperbolic Category Discovery}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2025}, }
	Chang-Bin Zhang, Yujie Zhong, Kai Han^† Mr. DETR: Instructive Multi-Route Training for Detection Transformers IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Zhang2025MrDETR, author = {Chang-Bin Zhang and Jinhong Ni and Yujie Zhong and Kai Han}, title = {Mr. DETR: Instructive Multi-Route Training for Detection Transformers}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2025}, }
	Chang-Bin Zhang, Jinhong Ni, Yujie Zhong, Kai Han^† v-CLR: View-Consistent Learning for Open-World Instance Segmentation IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025. Highlight presentation (2.8% of submissions) BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Zhang2025vCLR, author = {Chang-Bin Zhang and Jinhong Ni and Yujie Zhong and Kai Han}, title = {v-CLR: View-Consistent Learning for Open-World Instance Segmentation}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2025}, }
	Hongjun Wang, Wonmin Byeon, Jiarui Xu, Jinwei Gu, Ka Chun Cheung, Xiaolong Wang, Kai Han^†, Jan Kautz, Sifei Liu Parallel Sequence Modeling via Generalized Spatial Propagation Network IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Wang2025GSPN, author = {Hongjun Wang and Wonmin Byeon and Jiarui Xu and Jinwei Gu and Ka Chun Cheung and Xiaolong Wang and Kai Han and Jan Kautz and Sifei Liu}, title = {Parallel Sequence Modeling via Generalized Spatial Propagation Network}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2025}, }
	Muli Yang, Gabriel James Goenawan, Huaiyuan Qin, Kai Han, Xi Peng, Yanhua Yang, Hongyuan Zhu Detecting Open World Objects via Partial Attribute Assignment IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025. BibTeX \| PDF \| code @inproceedings{Yang2025PASS, author = {Muli Yang and Gabriel James Goenawan and Huaiyuan Qin and Kai Han and Xi Peng and Yanhua Yang and Hongyuan Zhu}, title = {Detecting Open World Objects via Partial Attribute Assignment}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2025}, }
	Yuanpei Liu, Kai Han^† DebGCD: Debiased Learning with Distribution Guidance for Generalized Category Discovery International Conference on Learning Representations (ICLR), 2025. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Liu2025DebGCD, author = {Yuanpei Liu and Kai Han}, title = {DebGCD: Debiased Learning with Distribution Guidance for Generalized Category Discovery}, booktitle = {International Conference on Learning Representations (ICLR)}, year = {2025}, }
	Hongjun Wang, Sagar Vaze, Kai Han^† HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts International Conference on Learning Representations (ICLR), 2025. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{wang2025hilo, author = {Hongjun Wang and Sagar Vaze and Kai Han}, title = {HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts}, booktitle = {International Conference on Learning Representations (ICLR)}, year = {2025}, }
	Jonathan Roberts, Kai Han, Samuel Albanie Needle Threading: Can LLMs Follow Threads Through Near-Million-Scale Haystacks? International Conference on Learning Representations (ICLR), 2025. BibTeX \| PDF \| arXiv \| project page \| code \| dataset @inproceedings{Roberts2025Needle, author = {Jonathan Roberts and Kai Han and Samuel Albanie}, title = {Needle Threading: Can LLMs Follow Threads Through Near-Million-Scale Haystacks?}, booktitle = {International Conference on Learning Representations (ICLR)}, year = {2025}, }
	Shaozhe Hao, Xuantong Liu, Xianbiao Qi, Shihao Zhao, Bojia Zi, Rong Xiao, Kai Han^†, Kwan-Yee K. Wong BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities International Conference on Learning Representations (ICLR), 2025. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Hao2025BiGR, author = {Shaozhe Hao and Xuantong Liu and Xianbiao Qi and Shihao Zhao and Bojia Zi and Rong Xiao and Kai Han and Kwan-Yee~K. Wong}, title = {Bi{GR}: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities}, booktitle = {International Conference on Learning Representations (ICLR)}, year = {2025}, }
	Yukang Cao, Liang Pan, Kai Han, Kwan-Yee K. Wong, Ziwei Liu AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation International Conference on Learning Representations (ICLR), 2025. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Cao2025AvatarGO, author = {Yukang Cao and Liang Pan and Kai Han and Kwan-Yee K. Wong and Ziwei Liu}, title = {AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation}, booktitle = {International Conference on Learning Representations (ICLR)}, year = {2025}, }
	Chaohao Xie, Kai Han^†, Kwan-Yee K. Wong VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{Xie2025VipDiff, author = {Chaohao Xie and Kai Han and Kwan-Yee K. Wong}, title = {VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models}, booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, year = {2025}, }
	Zhi Xu, Shaozhe Hao, Kai Han^† CusConcept: Customized Visual Concept Decomposition with Diffusion Models IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025. BibTeX \| PDF \| arXiv \| code @inproceedings{Xu2025CusConcept, author = {Zhi Xu and Shaozhe Hao and Kai Han}, title = {CusConcept: Customized Visual Concept Decomposition with Diffusion Models}, booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, year = {2025}, }

	Jonathan Roberts, Kai Han, Neil Houlsby, Samuel Albanie SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation Conference on Neural Information Processing Systems (NeurIPS), 2024. BibTeX \| PDF \| arXiv \| data & code @inproceedings{Roberts2024SciFIBench, author = {Jonathan Roberts and Kai Han and Neil Houlsby and Samuel Albanie}, title = {SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation}, booktitle = {Conference on Neural Information Processing Systems}, year = {2024}, }
	Hongjun Wang, Sagar Vaze, Kai Han^† Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks International Journal of Computer Vision (IJCV), 2024. BibTeX \| PDF \| arXiv \| project page \| code @article{wang2024dissect, author = {Wang, Hongjun and Vaze, Sagar and Han, Kai}, title = {Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks}, journal = {International Journal of Computer Vision (IJCV)}, year = {2024} }
	Jingyi Lu, Xinghui Li, Kai Han^† RegionDrag: Fast Region-Based Image Editing with Diffusion Models European Conference on Computer Vision (ECCV), 2024. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{lu2024regiondrag, author = {Jingyi Lu and Xinghui Li and Kai Han}, title = {RegionDrag: Fast Region-Based Image Editing with Diffusion Models}, booktitle = {European Conference on Computer Vision (ECCV)}, year = {2024}, }
	Fernando Julio Cendra, Bingchen Zhao, Kai Han^† PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery European Conference on Computer Vision (ECCV), 2024. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{cendra2024promptccd, author = {Fernando Julio Cendra and Bingchen Zhao and Kai Han}, title = {PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery}, booktitle = {European Conference on Computer Vision}, year = {2024} }
	Shaozhe Hao, Kai Han^†, Zhengyao Lv, Shihao Zhao, Kwan-Yee K. Wong ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction European Conference on Computer Vision (ECCV), 2024. Oral presentation (2.3% of submissions) BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{hao2024conceptexpress, author = {Shaozhe Hao and Kai Han and Zhengyao Lv and Shihao Zhao and Kwan-Yee~K. Wong}, title = {Concept{E}xpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction}, booktitle = {European Conference on Computer Vision}, year = {2024} }
	Minghao Yin, Shangzhe Wu, Kai Han^† IBD-SLAM: Learning Image-Based Depth Fusion for Generalizable SLAM IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024. BibTeX \| PDF \| project page @inproceedings{yin2024ibdslam, author = {Minghao Yin and Shangzhe Wu and Kai Han}, title = {IBD-SLAM: Learning Image-Based Depth Fusion for Generalizable SLAM}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2024}, }
	Yukang Cao, Yan-Pei Cao, Kai Han^†, Ying Shan, Kwan-Yee K. Wong DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{cao24dream, author = {Yukang Cao and Yan-Pei Cao and Kai Han and Ying Shan and Kwan-Yee K. Wong}, title = {DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2024}, }
	Xinghui Li, Jingyi Lu, Kai Han^†, Victor Prisacariu SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{li2024sd4match, author = {Xinghui Li and Jingyi Lu and Kai Han and Victor Prisacariu}, title = {SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2024}, }
	Kai Han, Xiaohu Huang, Yandong Li, Sagar Vaze, Jie Li, Xuhui Jia What’s in a Name? Beyond Class Indices for Image Recognition CVPR Workshop on Computer Vision in the Wild, 2024. BibTeX \| PDF \| arXiv \| code @inproceedings{han23scd, author = {Kai Han and Yandong Li and Sagar Vaze and Jie Li and Xuhui Jia}, title = {What's in a Name? Beyond Class Indices for Image Recognition}, booktitle = {CVPR Workshop on Computer Vision in the Wild}, year = {2024}, }
	Jonathan Roberts, Timo Lüddecke, Rehan Sheikh, Kai Han, Samuel Albanie Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs CVPR Workshop on EarthVision, 2024. BibTeX \| PDF \| arXiv \| dataset @inproceedings{Roberts2024NewTerri, author = {Roberts, Jonathan and L\"uddecke, Timo and Sheikh, Rehan and Han, Kai and Albanie, Samuel}, title = {Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs}, booktitle = {CVPR Workshop on EarthVision}, year = {2024}, }
	Shaozhe Hao, Kai Han^†, Kwan-Yee K. Wong CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery Transactions on Machine Learning Research (TMLR), 2024. BibTeX \| PDF \| arXiv \| code @article{hao24cipr, title = {CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery}, author = {Shaozhe Hao and Kai Han and Kwan-Yee K. Wong}, journal = {Transactions on Machine Learning Research}, year = {2024}, }
	Hongjun Wang, Sagar Vaze, Kai Han^† SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning International Conference on Learning Representations (ICLR), 2024. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{wang2024sptnet, author = {Hongjun Wang and Sagar Vaze and Kai Han}, title = {SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning}, booktitle = {International Conference on Learning Representations (ICLR)}, year = {2024}, }
	Xiaohu Huang, Hao Zhou, Kun Yao, Kai Han^† FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition International Conference on Learning Representations (ICLR), 2024. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{huang2024froster, author = {Xiaohu Huang and Hao Zhou and Kun Yao and Kai Han}, title = {FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition}, booktitle = {International Conference on Learning Representations (ICLR)}, year = {2024}, }

	Xiao Han, Yukang Cao, Kai Han, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang, Kwan-Yee K. Wong HeadSculpt: Crafting 3D Head Avatars with Text Conference on Neural Information Processing Systems (NeurIPS), 2023. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{han2023headsculpt, author = {Xiao Han and Yukang Cao and Kai Han and Xiatian Zhu and Jiankang Deng and Yi-Zhe Song and Tao Xiang and Kwan-Yee K. Wong}, title = {HeadSculpt: Crafting 3D Head Avatars with Text}, booktitle = {Conference on Neural Information Processing Systems (NeurIPS)}, year = {2023}, }
	Jonathan Roberts, Timo Lüddecke, Sowmen Das, Kai Han, Samuel Albanie GPT4GEO: How a Language Model Sees the World's Geography NeurIPS Workshop on Foundation Models for Decision Making, 2023. BibTeX \| PDF \| arXiv \| code @inproceedings{roberts2023GPT4GEO, author = {Roberts, Jonathan and L{\"u}ddecke, Timo and Das, Sowmen and Han, Kai and Albanie, Samuel}, title = {GPT4GEO: How a Language Model Sees the World's Geography}, booktitle = {NeurIPS Workshop on Foundation Models for Decision Making}, year = {2023}, }
	Xinghui Li, Kai Han^†, Shuda Li, Victor Prisacariu DualRC: A Dual-Resolution Learning Framework with Neighbourhood Consensus for Visual Correspondences IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023. BibTeX \| DOI \| project page \| code @article{li23dualrc, author = {Xinghui Li and Kai Han and Shuda Li and Victor Prisacariu}, title = {DualRC: A Dual-Resolution Learning Framework with Neighbourhood Consensus for Visual Correspondences}, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)}, year = {2023}, }
	Bingchen Zhao, Xin Wen, Kai Han^† Learning Semi-supervised Gaussian Mixture Models for Generalized Category Discovery International Conference on Computer Vision (ICCV), 2023. BibTeX \| PDF \| arXiv \| code @inproceedings{zhao23learning, author = {Bingchen Zhao and Xin Wen and Kai Han}, title = {Learning Semi-supervised Gaussian Mixture Models for Generalized Category Discovery}, booktitle = {International Conference on Computer Vision (ICCV)}, year = {2023}, }
	Cong Han, Yujie Zhong, Dengjie Li, Kai Han^†, Lin Ma^† Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network International Conference on Computer Vision (ICCV), 2023. BibTeX \| PDF \| arXiv \| code @inproceedings{han23open, author = {Cong Han and Yujie Zhong and Dengjie Li and Kai Han and Lin Ma}, title = {Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network}, booktitle = {International Conference on Computer Vision (ICCV)}, year = {2023}, }
	Jonathan Roberts, Kai Han, Samuel Albanie SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models ICCV TNGCV Workshop, 2023. BibTeX \| PDF \| arXiv \| project page @inproceedings{roberts2023satin, title = {SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models}, author = {Jonathan Roberts and Kai Han and Samuel Albanie}, booktitle = {ICCV TNGCV Workshop}, year = {2023}, }
	Shaozhe Hao, Kai Han^†, Kwan-Yee K. Wong Learning Attention as Disentangler for Compositional Zero-shot Learning IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{hao2023ade, author = {Shaozhe Hao and Kai Han and Kwan-Yee K. Wong}, title = {Learning Attention as Disentangler for Compositional Zero-shot Learning}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2023}, }
	Yukang Cao, Kai Han, Kwan-Yee K. Wong SeSDF: Self-evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{cao2023sesdf, author = {Yukang Cao and Kai Han and Kwan-Yee K. Wong}, title = {SeSDF: Self-evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2023}, }

	K J Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N Balasubramanian Novel Class Discovery without Forgetting European Conference on Computer Vision (ECCV), 2022. BibTeX \| PDF \| arXiv @inproceedings{joseph22ncdwf, author = {K J Joseph and Sujoy Paul and Gaurav Aggarwal and Soma Biswas and Piyush Rai and Kai Han and Vineeth N Balasubramanian}, title = {Novel Class Discovery without Forgetting}, booktitle = {European Conference on Computer Vision (ECCV)}, year = {2022}, }
	Sagar Vaze, Kai Han, Andrea Vedaldi, Andrew Zisserman Generalized Category Discovery IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{vaze22generalized, author = {Sagar Vaze and Kai Han and Andrea Vedaldi and Andrew Zisserman}, title = {Generalized Category Discovery}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2022}, }
	Yukang Cao, Guanying Chen, Kai Han, Wenqi Yang, Kwan-Yee K. Wong JIFF: Jointly-aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022. Oral presentation (4.2% of submissions) BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{cao22jiff, author = {Yukang Cao and Guanying Chen and Kai Han and Wenqi Yang and Kwan-Yee K. Wong}, title = {JIFF: Jointly-aligned Implicit Face Function for High Quality Single View Clothed Human Reconstruction}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2022}, }
	Chenming Zhu, Xuanye Zhang, Yanran Li, Liangdong Qiu, Kai Han, Xiaoguang Han SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022. BibTeX \| PDF \| arXiv \| project page @inproceedings{zhu22sharpcontour, author = {Chenming Zhu and Xuanye Zhang and Yanran Li and Liangdong Qiu and Kai Han and Xiaoguang Han}, title = {SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2022}, }
	K J Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N Balasubramanian Spacing Loss for Discovering Novel Categories CVPR Workshop on Continual Learning in Computer Vision, 2022. Best Paper Runner-Up Award BibTeX \| PDF \| arXiv @inproceedings{joseph22spacing, author = {K J Joseph and Sujoy Paul and Gaurav Aggarwal and Soma Biswas and Piyush Rai and Kai Han and Vineeth N Balasubramanian}, title = {Spacing Loss for Discovering Novel Categories}, booktitle = {CVPR Workshop on Continual Learning in Computer Vision}, year = {2022}, }
	Sagar Vaze, Kai Han, Andrea Vedaldi, Andrew Zisserman Open-Set Recognition: A Good Closed-Set Classifier is All You Need? International Conference on Learning Representations (ICLR), 2022. Oral presentation (1.6% of submissions) BibTeX \| PDF \| arXiv \| OpenReview \| project page \| code @inproceedings{vaze22openset, author = {Sagar Vaze and Kai Han and Andrea Vedaldi and Andrew Zisserman}, title = {Open-Set Recognition: A Good Closed-Set Classifier is All You Need?}, booktitle = {International Conference on Learning Representations (ICLR)}, year = {2022}, }

	Bingchen Zhao, Kai Han^† Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation Conference on Neural Information Processing Systems (NeurIPS), 2021. BibTeX \| PDF \| arXiv \| OpenReview \| supplementary \| project page \| code @inproceedings{zhao21novel, author = {Bingchen Zhao and Kai Han}, title = {Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation}, booktitle = {Conference on Neural Information Processing Systems (NeurIPS)}, year = {2021} }
	Xuhui Jia, Kai Han^†, Yukun Zhu, Bradley Green Joint Representation Learning and Novel Category Discovery on Single- and Multi-modal Data International Conference on Computer Vision (ICCV), 2021. BibTeX \| PDF \| arXiv @inproceedings{jia21joint, author = {Xuhui Jia and Kai Han and Yukun Zhu and Bradley Green}, title = {Joint Representation Learning and Novel Category Discovery on Single- and Multi-modal Data}, booktitle = {International Conference on Computer Vision (ICCV)}, year = {2021} }
	Georgi Tinchev, Shuda Li, Kai Han, David Mitchell, Rigas Kouskouridas 𝕏Resolution Correspondence Networks British Machine Vision Conference (BMVC), 2021. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{tinchev20xresolution, author = {Georgi Tinchev and Shuda Li and Kai Han and David Mitchell and Rigas Kouskouridas}, title = {{$\mathbb{X}$}Resolution Correspondence Networks}, booktitle = {British Machine Vision Conference (BMVC)}, year = {2021} }
	Sylvestre-Alvise Rebuffi, Sebastien Ehrhardt, Kai Han, Andrea Vedaldi, Andrew Zisserman LSD-C: Linearly Separable Deep Clusters* ICCV Workshop on Visual Inductive Priors for Data-Efficient Deep Learning, 2021. (* indicates equal contribution.) BibTeX \| PDF \| arXiv \| code @inproceedings{rebuffi21lsdc, author = {Sylvestre-Alvise Rebuffi and Sebastien Ehrhardt and Kai Han and Andrea Vedaldi and Andrew Zisserman}, title = {LSD-C: Linearly Separable Deep Clusters}, booktitle = {ICCV Workshop on Visual Inductive Priors for Data-Efficient Deep Learning}, year = {2021} }
	Peng Wang, Kai Han, Xiu-Shen Wei, Lei Zhang, Lei Wang Contrastive Learning based Hybrid Networks for Long-Tailed Image Classification IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021. BibTeX \| PDF \| arXiv @inproceedings{wang21contrastive, author = {Peng Wang and Kai Han and Xiu-Shen Wei and Lei Zhang and Lei Wang}, title = {Contrastive Learning based Hybrid Networks for Long-Tailed Image Classification}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2021}, }
	Kai Han, Sylvestre-Alvise Rebuffi, Sebastien Ehrhardt, Andrea Vedaldi, Andrew Zisserman AutoNovel: Automatically Discovering and Learning Novel Visual Categories IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021. BibTeX \| PDF \| arXiv \| DOI \| project page \| code @article{han21autonovel, author = {Kai Han and Sylvestre-Alvise Rebuffi and Sebastien Ehrhardt and Andrea Vedaldi and Andrew Zisserman}, title = {AutoNovel: Automatically Discovering and Learning Novel Visual Categories}, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)}, year = {2021}, }
	Jie Li, Peng Wang, Kai Han, Yu Liu Anisotropic Convolutional Neural Networks for RGB-D based Semantic Scene Completion IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021. BibTeX \| PDF \| arXiv \| DOI \| project page \| code @article{li21anisotropic, author = {Jie Li and Peng Wang and Kai Han and Yu Liu}, title = {Anisotropic Convolutional Neural Networks for RGB-D based Semantic Scene Completion}, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)}, year = {2021}, }
	Kai Han, Miaomiao Liu, Dirk Schnieders, Kwan-Yee K. Wong Fixed Viewpoint Mirror Surface Reconstruction under an Uncalibrated Camera IEEE Transactions on Image Processing (TIP), 2021. BibTeX \| PDF \| arXiv \| DOI \| supplementary \| project page \| code @article{han21fixed, title = {Fixed Viewpoint Mirror Surface Reconstruction under an Uncalibrated Camera}, author = {Kai Han and Miaomiao Liu and Dirk Schnieders and Kwan-Yee K. Wong}, journal = {IEEE Transactions on Image Processing (TIP)}, year = {2021} }

Kai Han

News

Publications

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

Recent Preprints

PhD Dissertation

Awards and Honors

Professional Activities

Teaching

	Xinghui Li, Kai Han, Shuda Li, Victor Prisacariu Dual-Resolution Correspondence Networks Conference on Neural Information Processing Systems (NeurIPS), 2020. BibTeX \| PDF \| arXiv \| supplementary \| project page \| code @inproceedings{li20dualrc, author = {Xinghui Li and Kai Han and Shuda Li and Victor Prisacariu}, title = {Dual-Resolution Correspondence Networks}, booktitle = {Conference on Neural Information Processing Systems (NeurIPS)}, year = {2020}, }
	Kai Han, Sylvestre-Alvise Rebuffi, Sebastien Ehrhardt, Andrea Vedaldi, Andrew Zisserman Automatically Discovering and Learning New Visual Categories with Ranking Statistics* International Conference on Learning Representations (ICLR), 2020. (* indicates equal contribution.) BibTeX \| PDF \| arXiv \| OpenReview \| video \| project page \| code @inproceedings{han20automatically, author = {Kai Han and Sylvestre-Alvise Rebuffi and Sebastien Ehrhardt and Andrea Vedaldi and Andrew Zisserman}, title = {Automatically Discovering and Learning New Visual Categories with Ranking Statistics}, booktitle = {International Conference on Learning Representations (ICLR)}, year = {2020}, }
	Shuda Li, Kai Han*, Theo W. Costain, Henry Howard-Jenkins, Victor Prisacariu Correspondence Networks with Adaptive Neighbourhood Consensus* IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020. (* indicates equal contribution.) BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{li20correspondence, author = {Shuda Li and Kai Han and Theo W. Costain and Henry Howard-Jenkins and Victor Prisacariu}, title = {Correspondence Networks with Adaptive Neighbourhood Consensus}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2020}, }
	Jie Li, Kai Han, Peng Wang, Yu Liu, Xia Yuan Anisotropic Convolutional Networks for 3D Semantic Scene Completion IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020. BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{li20anisotropic, author = {Jie Li and Kai Han and Peng Wang and Yu Liu and Xia Yuan}, title = {Anisotropic Convolutional Networks for 3D Semantic Scene Completion}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2020}, }
	Sylvestre-Alvise Rebuffi, Sebastien Ehrhardt, Kai Han, Andrea Vedaldi, Andrew Zisserman Semi-Supervised Learning with Scarce Annotations* CVPR Deep Vision Workshop, 2020. (* indicates equal contribution.) BibTeX \| PDF \| arXiv \| project page \| code @inproceedings{rebuffi20SSL, title={Semi-Supervised Learning with Scarce Annotations}, author={Sylvestre-Alvise Rebuffi and Sebastien Ehrhardt and Kai Han and Andrea Vedaldi and Andrew Zisserman}, booktitle={CVPR Deep Vision Workshop}, year={2020} }
	Guanying Chen, Kai Han, Boxin Shi, Yasuyuki Matsushita, Kwan-Yee K. Wong Deep Photometric Stereo for Non-Lambertian Surfaces IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020. BibTeX \| PDF \| arXiv \| DOI \| supplementary \| project page \| code @article{chen20deepps, title = {Deep Photometric Stereo for Non-Lambertian Surfaces}, author = {Guanying Chen and Kai Han and Boxin Shi and Yasuyuki Matsushita and Kwan-Yee K. Wong}, journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)}, year = {2020} }

	Kai Han, Andrea Vedaldi, Andrew Zisserman Learning to Discover Novel Visual Categories via Deep Transfer Clustering International Conference on Computer Vision (ICCV), 2019. BibTeX \| PDF \| arXiv \| supplementary \| project page \| code @inproceedings{han19DTC, author = {Kai Han and Andrea Vedaldi and Andrew Zisserman}, title = {Learning to Discover Novel Visual Categories via Deep Transfer Clustering}, booktitle = {International Conference on Computer Vision (ICCV)}, year = {2019} }
	Huy V. Vo, Francis Bach, Minsu Cho, Kai Han, Yann LeCun, Patrick Pérez, Jean Ponce Unsupervised Image Matching and Object Discovery as Optimization IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. BibTeX \| PDF \| arXiv \| code @inproceedings{vo19unsup, title = {Unsupervised Image Matching and Object Discovery as Optimization}, author = {Huy V. Vo and Francis Bach and Minsu Cho and Kai Han and Yann LeCun and Patrick P\'{e}rez and Jean Ponce}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2019} }
	Guanying Chen, Kai Han, Boxin Shi, Yasuyuki Matsushita, Kwan-Yee K. Wong Self-calibrating Deep Photometric Stereo Networks IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. Oral presentation (5.6% of submissions) BibTeX \| PDF \| arXiv \| supplementary \| video \| poster \| project page \| code @inproceedings{chen19SDPS_Net, title = {Self-calibrating Deep Photometric Stereo Networks}, author = {Guanying Chen and Kai Han and Boxin Shi and Yasuyuki Matsushita and Kwan-Yee K. Wong}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2019} }
	Guanying Chen, Kai Han*, Kwan-Yee K. Wong Learning Transparent Object Matting* International Journal of Computer Vision (IJCV), 2019. (* indicates equal contribution.) BibTeX \| PDF \| arXiv \| DOI \| supplementary \| project page \| code @article{chen19LTOM, title = {Learning Transparent Object Matting}, author = {Guanying Chen and Kai Han and Kwan-Yee K. Wong}, journal = {International Journal of Computer Vision (IJCV)}, year = {2019} }

	Guanying Chen, Kai Han, Kwan-Yee K. Wong PS-FCN: A Flexible Learning Framework for Photometric Stereo European Conference on Computer Vision (ECCV), 2018. BibTeX \| PDF \| arXiv \| video \| poster \| project page \| code @inproceedings{chen18ps_fcn, title = {PS-FCN: A Flexible Learning Framework for Photometric Stereo}, author = {Guanying Chen and Kai Han and Kwan-Yee K. Wong}, booktitle = {European Conference on Computer Vision (ECCV)}, year = {2018} }
	Guanying Chen, Kai Han*, Kwan-Yee K. Wong TOM-Net: Learning Transparent Object Matting from a Single Image* IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018. (* indicates equal contribution.) Spotlight presentation (6.7% of submissions) BibTeX \| PDF \| arXiv \| video \| poster \| project page \| code @inproceedings{chen18tom_net, title = {TOM-Net: Learning Transparent Object Matting from a Single Image}, author = {Guanying Chen and Kai Han and Kwan-Yee K. Wong}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year = {2018} }
	Kai Han, Kwan-Yee K. Wong, Miaomiao Liu Dense Reconstruction of Transparent Objects by Altering Incident Light Paths Through Refraction International Journal of Computer Vision (IJCV), 2018. BibTeX \| PDF \| arXiv \| DOI @article{han18dense, title = {Dense Reconstruction of Transparent Objects by Altering Incident Light Paths through Refraction}, author = {Kai Han and Kwan-Yee K. Wong and Miaomiao Liu}, journal = {International Journal of Computer Vision (IJCV)}, year = {2018} }