Hao Fei

Senior Postdoctoral Researcher

Department of Computer Science, University of Oxford
Wolfson Building, Parks Road, Oxford OX1 3QD, UK

Profile

I am a senior postdoctoral researcher at the CS department and the Big Data Institute in the University of Oxford, working with Prof. Yarin Gal and other CeBAM PIs. Previously, I was a senior research fellow at National University of Singapore, where I worked with Prof. Mong-Li Lee, Prof. Wynne Hsu, Prof. Tat-Seng Chua and Prof. Shuicheng Yan. I also worked as a visiting researcher at Microsoft Research Asia, an associate researcher at Skywork AI Singapore, and SEA AI lab, respectively.

My research has been published in top-tier ML/NLP/CV/MM venues, e.g., ICML, NeurIPS, ICLR, ACL, CVPR, ACM-MM, AAAI, WWW, SIGIR, EMNLP, TPAMI, IJCV, TMM, TKDE, TOIS, TNNLS, TASLP. My papers were selected as Most Influential Papers by Paper Digest, and ESI Highly Influential Papers, 2024 WAIC Outstanding Paper Award and also several Best Papers (Nominations as well) on some venues. I was awarded the World AI Conference Rising Star in 2023. I was also the recipient of the 2023 WAIC Rising Star award, and ranked as Top 2% Scientists Worldwide 2024&2025 by Stanford University. I’ve regularly served as (Senior) Area Chair or Senior Program Committee of top-tier conferences. I was the organization committee of conferences, WSDM, EMNLP, ACL, ACM MM, etc. I serve as the Associate Editor of some journals, e.g., IEEE TAFFC, IEEE TASLP, ACM TALLIP, Neurocomputing. My Ph.D thesis was awarded the Excellent Doctoral Thesis of Chinese Information Processing Society (CIPS).

Research

My research interests lie in NLP, CV, and the intersection of both (i.e., Multimodal/Vision-Language Learning). My long-term goal is to achieve human-level AI centered around multimodal LLMs & generalists. I pay the main focus on building large foundation multimodal models and bridging physical and mental worlds. Know me via some latest series of representative works (see research statement for more):

Recently, I also extensively explore the AI for science, including 1) psychology & social norm studies, 2) bio-/medicine & healthcare & clinics, and 3) material science, by integrating the advanced LLM/agent methodologies.

Advertising

I am constantly looking for collaborations on the above topics. Remote manner is also supported. For promising students I will provide sufficient GPUs. Hit me up, if you are a Ph.D/master/bachelor student and interested in what I am doing now (with potential vacancies for research interns/RAs/visiting). For students from University of Oxford, I’m particularly looking for collaborations on world modeling and AI scientist. Please describe your research status and attach your resume & statement.

News

• 7 May 2026

We are releasing the survey of Audio-Visual Intelligence, check it now at Github!

• 30 April 2026

Three papers are accepted by ICML 2026, 1) One-to-Many Video Grounding, 2) AVI-Bench and 3) Multi-Agent Communication. Congrats to all my co-authors!

• 7 April 2026

Two papers are accepted by ACL 2026, 1) Multi-agent Reasoning and 2) Multimodal Deception Detection. Congrats to all my co-authors!

• 27 Mar 2026

One paper about Video Hallucination (Dr.V) is accepted by International Journal of Computer Vision (IJCV)!

• 21 Feb 2026