Haodong Duan 段浩东

Haodong Duan is a postdoctoral researcher at Shanghai AI Lab, working on the evaluation of large language models and multi-modality models. He received his Ph.D. degree from the Multimedia Laboratory @ CUHK in 2023, supervised by Professor Dahua Lin. During his Ph.D., he works on human-centric action understanding, including video classification, skeleton-based action recognition, pose estimation. Before joining MMLAB, he received his B.S. degree in data science at Peking University in 2019. His research interests include video recognition, human-centric action understanding, and multi-modality learning. You can find his CV here.

News

(2024.04) We released MMStar . In this work, we studied the vision-dispensable and data leakage problems in multi-modal evaluation
(2024.03) Two papers (BotChat , Ada-LEval ) are accepted by NAACL 2024
(2023.12) We released VLMEvalKit , an all-in-one toolkit for evaluating LVLMs
(2023.10) SkeleTR is accepted by ICCV 2023
(2023.07) We released MMBench, a pioneering benchmark for evaluating large vision-language models (LVLMs)
(2023.06) I received my Ph.D. degree from the Multimedia Laboratory @ CUHK
(2023.04) I will join the Shanghai AI Lab as a postdoctoral researcher
(2022.07) I started my internship at AWS AI, advised by Dr. Mingze Xu
(2022.06) Give a talk at CVPR 2022 OpenMMLab Tutorial on human-centric action understanding [Slides]
(2022.05) Release PYSKL , a codebase for skeleton action recognition [Report] accepted by MM 2022
(2022.03) 3 papers (PoseC3D, TransRank, OCSampler) are accepted by CVPR 2022
(2020.08) Join OpenMMLab and serve as a maintainer of MMAction2
(2020.07) OmniSource is accepted by ECCV 2020
(2019.07) TRB is accepted by ICCV 2019

Bold indicates being accepted as Oral presentation.

Professional Activities

Conference Reviewer: ICCV[21-23], AAAI[22-24], CVPR[22-24], ECCV[22-24], NeurIPS[22-23], WACV23, ICLR23, EuroGraphics, etc.
Journal Reviewer: TPAMI, IJCV, TIP, PR, TMM, etc.