About me

I am an Associate Researcher at East China Normal University (School of Computer Science and Technology). I received my Ph.D. in Computer Application Technology from Fudan University (2019.09–2024.06), advised by Prof. Yu-Gang Jiang and Prof. Jingjing Chen.

My research interest is focused on Vision-Language Learning, especially Visual Question Answering. In addition, i also delve into some autonoumous driving related topics, such as 3D object detection, lane detection, stereo depth estimation.

News

  • Oct. 2025

    Look Before You Decide (MARS-Bench) was accepted to ACM MM 2025.

  • Aug. 2025

    EgoCross was accepted to AAAI 2026.

  • Jun. 2025

    Domain-RAG was accepted to NeurIPS 2025.

  • Jun. 2025

    NeighborRetr was accepted to CVPR 2025.

  • Apr. 2025

    HSACNet was accepted to ICME 2025.

  • Jan. 2024

    NuScenes-QA was accepted to AAAI 2024; preprint and code are now online.

  • Jun. 2023

    Our "Locate before Answering" work appeared in IEEE Transactions on Multimedia.

  • Jul. 2022

    ViGA for video moment retrieval was presented at ACM SIGIR 2022.

  • Jun. 2022

    Wrapped up my multimodal research internship at Bilibili AI-Lab.

  • Mar. 2022

    Scene Graph Refinement Network for VQA was published in IEEE Transactions on Multimedia.

Publications

Egocross: Benchmarking multimodal large language models for cross-domain egocentric video question answering

Yanjun Li, Yuqian Fu, Tianwen Qian, Qi'ao Xu, Silong Dai, Danda Pani Paudel, Luc Van Gool, Xiaoling Wang

AAAI, 2026

Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning

Yian Li, Wentao Tian, Yang Jiao, Tianwen Qian, Na Zhao, Bin Zhu, Jingjing Chen, Yu-Gang Jiang

ACM MM, 2025

Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection

Yu Li, Xingyu Qiu, Yuqian Fu, Jie Chen, Tianwen Qian, Xu Zheng, Danda Pani Paudel, Yanwei Fu, Xuanjing Huang, Luc Van Gool, Yu-Gang Jiang

NeurIPS, 2025

NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval

Zengrong Lin, Zheng Wang, Tianwen Qian, Pan Mu, Sixian Chan, Cong Bai

CVPR, 2025

HSACNet: Hierarchical Scale-Aware Consistency Regularized Semi-Supervised Change Detection

Qi'ao Xu, Pengfei Wang, Yanjun Li, Tianwen Qian, Xiaoling Wang

ICME, 2025

NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario (264 cites • 217 GitHub stars)

Tianwen Qian, Jingjing Chen, Linhai Zhuo, Yang Jiao, Yu-Gang Jiang

AAAI, 2024

Locate before Answering: Answer Guided Question Localization for Video Question Answering

Tianwen Qian, Ran Cui, Jingjing Chen, Pai Peng, Xiaowei Guo, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), 2023

Video Moment Retrieval from Text Queries via Single Frame Annotation

Ran Cui*, Tianwen Qian*, Pai Peng, Elena Daskalaki, Jingjing Chen, Xiaowei Guo, Huyang Sun, Yu-Gang Jiang

ACM SIGIR, 2022 (* indicates equal contribution)

Scene Graph Refinement Network for Visual Question Answering

Tianwen Qian, Jingjing Chen, Shaoxiang Chen, Bo Wu, Yu-Gang Jiang

IEEE Transactions on Multimedia (TMM), 2022

Experiences

East China Normal University, School of Computer Science and Technology

Associate Researcher

Mar. 2025 – Present

Bosch (China) Investment Ltd., Central Research

AI Algorithm Researcher

Jul. 2024 – Nov. 2024

Bilibili AI-Lab

Research intern with the topic of Multimodal Learning, Large-scale Video Pre-training, Video Localization.

Jun. 2021 – Jun. 2022

Dalian University of Technology

Research assistant of the Smart Ocean Lab with the topic of visual obstacle avoidance for unmanned ships.

Sep. 2018 – Jun. 2019

Education

Fudan University

Ph.D., Computer Application Technology

Sep. 2019 – Jun. 2024

Dalian University of Technology

Bachelor of Engineering

Sep. 2015 – Jun. 2019

Academic Services

  • Conference Reviewer for ACM MM 2023 / ECCV 2022 / CVPR 2022.
  • Journal Reviewer for TMM / ToMM / PR / Neurocomputing.