Academic Outputs
Workshop Organization
Open Challenge Organization
Invited Talks
- 2025.9: Toward Unified and Advanced Multimodal Generalist, Australian National University
- 2025.6: On Path to Multimodal Generalist General-Level and General-Bench, AI Time 2025, (slides)
- 2025.6: Toward Unified and Advanced Large Multimodal Foundation Model, MSRA
- 2025.6: On Path to Multimodal Generalist General-Level and General-Bench, NICE 2025, (slides)
- 2025.5: Toward Next-generation Large Multimodal Foundation Model, Central South University
- 2024.8: Towards AGI: from Unified MLLM to Multimodal Generalist, MLNLP 2024, (slides)
- 2024.1: From Multimodal LLM to AGI, Harbin Institute of Technology, Shenzhen
- 2023.12: From Multimodal LLM to AGI, CIPS Youth Working Committee in A Star, Singapore
- 2023.9: LLM-Empowered Text-to-Vision Diffusion Models, MLNLP 2023, (slides)
- 2023.8: LLM-Empowered Text-to-Vision Diffusion Models, WING lab @ NUS, (slides)
- 2023.8: Scene Graph-driven Structured Vision-Language Learning, Chinese Academy of Sciences, (slides)
- 2023.4: Towards Human-level AI, Huawei Cloud, (slides)
- 2023.2: Towards Human-level AI, NExT++ @ NUS, (slides)
- 2022.10: On the Structure-aware NLP and Beyond, Jianghan University, (slides)
- 2022.9: Neural Models for End-to-end Complex Information Extraction, SEA AI Lab, (slides)
- 2022.6: On the Structure-aware NLP and Beyond, NExT++ @ NUS, (slides)
- 2021.10: Language Semantic Vs. Syntactic Structure Parsing, Wuhan University, (slides)
- 2020.8: Deep Learning in NLP, Wuhan University, (slides)
- 2019.11: Implicit Objective Network for Emotion Detection, NLPCC 2019, (slides)
Tutorials
- 2025.9: A Beginner’s Guide to AI Research–Insights from a Multimodal AI Perspective, National University of Singapore, (slides)
- 2025.6: Evaluations and Benchmarks in the Context of Multimodal LLM, CVPR 2025, Nashville TN, USA
- 2024.10: From Multimodal LLM to Human-level AI: Architecture, Modality, Function, Instruction, Hallucination, Evaluation, Reasoning and Beyond, ACM MM 2024, Melbourne, Australia
- 2024.6: From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond, CVPR 2024, Seattle WA, USA
- 2024.5: From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and Beyond, LREC-COLING 2024, Torino, Italia
- 2022.10: Python Introduction and Preliminary for Research, Wuhan University
- 2020.4: How to AI Research and Paper Writing for AI Beginner, Wuhan University, (slides)
Teaching Assistant
- Hands-on Learning Large Language Models, Shanghai Jiao Tong University, May, 2024
- C programming and Algorithm, Wuhan University, Autumn 2019
- Natural Language Understanding, Wuhan University, Autumn 2020
- Deep Learning, Wuhan University, Spring 2021
Mentoring
Those are the students I am currently collaborating with.
- Ph.D Students
- Jundong Xu, NUS
- Meng Luo, NUS
- Pengxin Xu, HITSZ
- Kai Liu, Zhejiang University, Visiting @ NUS
- Tianjie Ju, SJTU, Visiting @ NUS
- Li Zheng, Wuhan University, Remote
- Yuhan Cui, CUHK, Remote
- Hao Li, WHU, Visiting @ NUS
- Zongru Wu, SJTU, Visiting @ NUS
- You Qin, NUS
- Daoan Zhang, University of Rochester, Remote
- Zhengyang Liang, SMU
- Wangcheng Tao, NUS, Remote
- Jiachen Tu, UIUC, Remote
- Yiwen Jiang, Monash University, Remote
- …
- Master Students (incl. RA)
- Yanlin Li, NUS
- Yanguang Zhao, NUS
- Lanhu Wu, NUS
- Minghui Guo, NUS
- Wenhao Xu, NUS
- Pengcheng Zhou, NUS
- Kaiming Jin, NUS
- Shize Zhang, NUS
- Kaiwen Zhang, NUS
- Mingyang Bao, NUS
- …
- Former Students
- Shengqiong Wu, NUS
- Bobo Li, Wuhan University
- Yu Zhao, Tianjin University
- Yaoting Yang, NUS
- Han Zhang, Xidian University, Remote
- Ji Qi, Tsinghua University
- Yichong Huang, HIT
- Jiang Liu, Wuhan University
- Haidong Xu, HITSZ
- Bin Wang, HITSZ
- Minghui Xu, Wuhan University
- Ling Zhuang, Central China Normal University
- Jingye Li, Wuhan University
- Peng Tao, Wuhan University
- Jun Gao, Wuhan University