Academic Outputs
Workshop Organization
Open Challenge Organization
Invited Talks
- 2025.9: Toward Unified and Advanced Multimodal Generalist, Australian National University
- 2025.6: On Path to Multimodal Generalist General-Level and General-Bench, AI Time 2025, (slides)
- 2025.6: Toward Unified and Advanced Large Multimodal Foundation Model, MSRA
- 2025.6: On Path to Multimodal Generalist General-Level and General-Bench, NICE 2025, (slides)
- 2025.5: Toward Next-generation Large Multimodal Foundation Model, Central South University
- 2024.8: Towards AGI: from Unified MLLM to Multimodal Generalist, MLNLP 2024, (slides)
- 2024.1: From Multimodal LLM to AGI, Harbin Institute of Technology, Shenzhen
- 2023.12: From Multimodal LLM to AGI, CIPS Youth Working Committee in A Star, Singapore
- 2023.9: LLM-Empowered Text-to-Vision Diffusion Models, MLNLP 2023, (slides)
- 2023.8: LLM-Empowered Text-to-Vision Diffusion Models, WING lab @ NUS, (slides)
- 2023.8: Scene Graph-driven Structured Vision-Language Learning, Chinese Academy of Sciences, (slides)
- 2023.4: Towards Human-level AI, Huawei Cloud, (slides)
- 2023.2: Towards Human-level AI, NExT++ @ NUS, (slides)
- 2022.10: On the Structure-aware NLP and Beyond, Jianghan University, (slides)
- 2022.9: Neural Models for End-to-end Complex Information Extraction, SEA AI Lab, (slides)
- 2022.6: On the Structure-aware NLP and Beyond, NExT++ @ NUS, (slides)
- 2021.10: Language Semantic Vs. Syntactic Structure Parsing, Wuhan University, (slides)
- 2020.8: Deep Learning in NLP, Wuhan University, (slides)
- 2019.11: Implicit Objective Network for Emotion Detection, NLPCC 2019, (slides)
Tutorials
- 2025.9: A Beginner’s Guide to AI Research–Insights from a Multimodal AI Perspective, National University of Singapore, (slides)
- 2025.6: Evaluations and Benchmarks in the Context of Multimodal LLM, CVPR 2025, Nashville TN, USA
- 2024.10: From Multimodal LLM to Human-level AI: Architecture, Modality, Function, Instruction, Hallucination, Evaluation, Reasoning and Beyond, ACM MM 2024, Melbourne, Australia
- 2024.6: From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond, CVPR 2024, Seattle WA, USA
- 2024.5: From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and Beyond, LREC-COLING 2024, Torino, Italia
- 2022.10: Python Introduction and Preliminary for Research, Wuhan University
- 2020.4: How to AI Research and Paper Writing for AI Beginner, Wuhan University, (slides)
Teaching Assistant
- Hands-on Learning Large Language Models, Shanghai Jiao Tong University, May, 2024
- C programming and Algorithm, Wuhan University, Autumn 2019
- Natural Language Understanding, Wuhan University, Autumn 2020
- Deep Learning, Wuhan University, Spring 2021
Mentoring
Those are the students I am currently collaborating with.