I am currently an assistant researcher/professor at the Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences. In 2012, I was graduated from Wuhan Institute of Technology. And then I obtained the M.E. degree from Capital Normal University, Beijing, China, in 2015, advised by Prof. Zhiping Shi. In 2020, I received the Ph.D. degree from the Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, advised by Prof. Shuqiang Jiang.
My research interests include large-scale image classification, vision and language understanding, and Embodied AI. I have been serving/served as a reviewer of IEEE TPAMI, IEEE TIP, IEEE TMM, IEEE TNNLS, IEEE TBD, and ACM TOMM. I also have been serving/served as a PC member of leading conferences in computer vision, multimedia, and AI, such as CVPR, ICCV, ECCV, ACM MM, IJCAI, and AAAI.
🔥🔥🔥 News
- 2024.02.27: 🎉🎉🎉 Our paper Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation has been accepted by CVPR 2024!
📝 Publications
📚 Journal
-
MemBridge: Video-Language Pre-training with Memory-Augmented Inter-Modality Bridge, Jiahao Yang, Xiangyang Li, Mao Zheng, Zihan Wang, Yongqing Zhu, Xiaoqian Guo, Yuchen Yuan, Zifeng Chai, Shuqiang Jiang. IEEE Transactions on Image Processing (TIP), 2023.
-
TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection, Xiaoqian Guo, Xiangyang Li, Yaowei Wang, Shuqiang Jiang. IEEE Transactions on Image Processing (TIP), 2023.
-
Focus and Align: Learning Tube Tokens for Video-Language Pre-training, Yongqing Zhu, Xiangyang Li, Mao Zheng, Jiahao Yang, Zihan Wang, Xiaoqian Guo, Zifeng Chai, Yuchen Yuan, Shuqiang Jiang. IEEE Transactions on Multimedia (TMM), 2023.
-
Dataset Bias in Few-shot Image Recognition, Shuqiang Jiang, Yaohui Zhu, Chenlong Liu, Xinhang Song, Xiangyang Li, Weiqing Min. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023. [Project]
-
Multifaceted Analysis of Fine-Tuning in a Deep Model for Visual Recognition, Xiangyang Li, Luis Herranz, Shuqiang Jiang. ACM Transactions on Data Science (ACM TDS), 2020.
-
Know More Say Less: Image Captioning Based on Scene Graphs, Xiangyang Li, Shuqiang Jiang. IEEE Transactions on Multimedia (TMM), 2019.
-
Class Agnostic Image Common Object Detection, Shuqiang Jiang, Sisi Liang, Chengpeng Chen, Yaohui Zhu, Xiangyang Li. IEEE Transactions on Image Processing (TIP), 2019.
-
Bundled Object Context for Referring Expressions, Xiangyang Li, Shuqiang Jiang, IEEE Transactions on Multimedia (TMM), 2018.
📙 Conference
-
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation, Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Junjie Hu, Ming Jiang, Shuqiang Jiang. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
-
GridMM: Grid Memory Map for Vision-and-Language Navigation, Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Shuqiang Jiang. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2023.
-
KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation, Xiangyang Li, Zihan Wang, Jiahao Yang, Yaowei Wang, Shuqiang Jiang. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
-
Expressional Region Retrieval, Xiaoqian Guo, Xiangyang Li, Shuqiang Jiang. In ACM International Conference on Multimedia (MM), 2020.
-
Learning Object Context for Dense Captioning, Xiangyang Li, Shuqiang Jiang, Jungong Han. In AAAI Conference on Artificial Intelligence (AAAI), 2019.
-
Image Captioning with Both Object and Scene Information, Xiangyang Li, Xinhang Song, Luis Herranz, Yaohui Zhu, Shuqiang Jiang. In ACM International Conference on Multimedia (MM), 2016.
-
Scene Recognition with CNNs: Objects, Scales and Dataset Bias, Luis Herranz, Shuqiang Jiang, Xiangyang Li. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
-
Heterogeneous Convolutional Neural Networks for Visual Recognition, Xiangyang Li, Luis Herranz, Shuqiang Jiang. In Pacific Rim Conference on Multimedia (PCM) 2016. [Oral]