Welcome!

I am now a PhD student at School of Computing in National University of Singapore. Fortunately, I will be supervised by Prof. Mong Li Lee and Prof. Wynne Hsu at Center for Trusted Internet and Community (CTIC). Prior to that, I received my master degree from NUS and bachelor degree from Wuhan University.

My research interest includes Video Understanding, Video Generation, Multimodal Large Language Model.

I am currently exploring new collaboration opportunities. If you are interested in any of the topics mentioned above, please feel free to reach out via mluo@u.nus.edu.

🔥 News

  • 2025.05:  🎉 Accepted at ACL 2025
    Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework.

  • 2025.05:  🎉 Accepted at ICML 2025 (Spotlight)
    On Path to Multimodal Generalist: Levels and Benchmarks.

  • 2025.05:  🎉 Accepted at ICML 2025
    SWIFTCODE: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning.

  • 2025.05:  🎉 Accepted at ICML 2025
    VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models.

  • 2025.04:  Co-organizing a Workshop at ACM MM 2025
    The 1st Cognition-oriented Multimodal Affective and Empathetic Computing (CogMAEC 2025) Workshop.

  • 2025.04:  Co-organizing a Grand Challenge at ACM MM 2025
    Multimodal Conversational Aspect-based Sentiment Analysis (MCABSA 2025).

  • 2025.01:  🎉 Accepted at ICLR 2025
    PAD: Personalized Alignment at Decoding-Time.

  • 2025.01:  🎉 Accepted at WWW 2025
    Towards Multimodal Empathetic Response Generation: A Rich Text-Speech-Vision Avatar-based Benchmark.

  • 2024.09:  New Paper Published on arxiv
    A Survey on Benchmarks of Multimodal Large Language Models.

  • 2024.08:  🎉 Accepted at ACM MM Workshop (MIS24) (Best Paper Award)
    Fine-grained Structural Hallucination Detection for Unified Visual Comprehension and Generation in Multimodal LLM.

  • 2024.07:  🎉 Accepted at ACM MM 2024 (Oral)
    PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis.

  • 2024.03:  🎉 2nd Place at SemEval-2024
    NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations.

  • 2022.06:  Accepted at TDSC
    Towards Class-Balanced Privacy Preserving Heterogeneous Model Aggregation.

📝 Publications

  • 🎓During My PhD’s Research Program
ICML
sym

On Path to Multimodal Generalist: General-Level and General-Bench

Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, …, Meng Luo, Jiebo Luo, Tat‑Seng Chua, Hanwang Zhang, Shuicheng Yan

Project | ICML (Spotlight)

ICML
sym

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Haojian Huang, Haodong Chen, Shengqiong Wu, Meng Luo, Jinlan Fu, Xinya Du, Hanwang Zhang, Hao Fei

Project | ICML

ACL
sym

Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework

Jundong Xu, Hao Fei, Meng Luo, Qian Liu, Liangming Pan, William Yang Wang, Preslav Nakov, Mong-Li Lee, Wynne Hsu

Project | ACL

  • 🎓During My Master’s Research Program
ACM MM
sym

PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis

Meng Luo, Hao Fei, Bobo Li, Shengqiong Wu, Qian Liu, Soujanya Poria, Erik Cambria, Mong-Li Lee, Wynne Hsu

Project | ACM MM (Oral)

ICML
sym

SWIFTCODE: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning

Dong Huang, Guangtao Zeng, Jianbo Dai, Meng Luo, Han Weng, Yuhao Qing, Heming Cui, Zhijiang Guo, Jie M. Zhang

Project | ICML

ICLR
sym

PAD: Personalized Alignment at Decoding-Time

Ruizhe Chen, Xiaotian Zhang, Meng Luo, Wenhao Chai, Zuozhu Liu

Project | ICLR

WWW
sym

Towards Multimodal Empathetic Response Generation: A Rich Text-Speech-Vision Avatar-based Benchmark

Han Zhang, Zixiang Meng, Meng Luo, Hong Han, Lizi Liao, Erik Cambria, Hao Fei

Project | WWW

SemEval
sym

NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations

Meng Luo, Han Zhang, Shengqiong Wu, Bobo Li, Hong Han, Hao Fei

Project | SemEval@ACL (Challenge, 2nd Place) 2024

MM Workshop
sym

Fine-grained Structural Hallucination Detection for Unified Visual Comprehension and Generation in Multimodal LLM

Hao Fei, Meng Luo, Jundong Xu, Shengqiong Wu, Wei Ji, Mong-Li Lee, Wynne Hsu

Project | Workshop@MM

TDSC
sym

Towards Class-Balanced Privacy Preserving Heterogeneous Model Aggregation

Xiaoyi Pang, Zhibo Wang, Zeqing He, Peng Sun, Meng Luo, Ju Ren

Project | TDSC

arxiv
sym

A Survey on Benchmarks of Multimodal Large Language Models

Jian Li, Weiheng Lu, Hao Fei, Meng Luo, Ming Dai, Min Xia, Yizhang Jin, Zhenye Gan, Ding Qi, Chaoyou Fu, Ying Tai, Wankou Yang, Yabiao Wang, Chengjie Wang

Project | arxiv

💻 Professional Activity

Reviewer for NeurIPS, ICLR, ICML, ICCV, ACL, ACM MM, WWW, Neurocomputing, TOMM, TALLIP.

🎖 Honors and Awards

During Undergraduate

Academic Achievements

  • 2022: Huawei Scholarship, Wuhan University (Top 5%)
  • 2021: First-class Excellence Scholarship, Wuhan University (Ranked 2nd)
  • 2021: Merit Student, Wuhan University (Top 10%)
  • 2021: Outstanding Student, Wuhan University

Competitions and Recognitions

  • 2022: Silver Award, Hubei Challenge Cup, Wuhan University
  • 2022: Gold Award, Ziqiang Cup College, Wuhan University
  • 2021: National First Prize, Citi Cup Financial Innovation Application Contest
  • 2021: Bole Award, ByteTop Summit Project, ByteDance
  • 2019: Top 10 Book Ambassador, Wuhan University Library
  • 2018: The First Prize, HP Dream Factory Innovation Hackathon Wuhan Station, HP

Leadership and Social Activities

  • 2021: Chairman, Wuhan University Campus Ambassador, ByteDance
  • 2021: Excellent Campus Ambassador, WePie Team
  • 2021: Online Course on Interdisciplinary Communication, University of Cambridge

Wisdom begins in wonder. 网罗天下,广结同盟。