I'm currently a Ph.D. student major in computer application technology at FVL Lab, Fudan University. And i'm very lucky to be co-supervised by Prof. Yu-Gang Jiang and Prof. Jingjing Chen. Before this, I got my Bachelor degree from Dalian University of Technology in 2019. My research interest is focused on Vision-Language Learning, especially Visual Question Answering. In addition, i also delve into some autonoumous driving related topics, such as 3D object detection, lane detection, stereo depth estimation.
Locate before Answering: Answer Guided Question Localization for Video Question Answering
IEEE Transactions on Multimedia (TMM), 2023.
Video Moment Retrieval from Text Queries via Single Frame Annotation
* indicates equal contribution.
ACM SIGIR, 2022.
Scene Graph Refinement Network for Visual Question Answering
IEEE Transactions on Multimedia (TMM), 2022.