I am currently an assistant researcher/professor at the Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences. In 2020, I obtained my Ph.D. degree from the Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, where I worked with Prof. Shuqiang Jiang.
My research interests include large-scale image classification, vision and language understanding, and Embodied AI. I have been serving/served as a reviewer of IEEE TPAMI, IEEE TIP, IEEE TMM, IEEE TNNLS, IEEE TBD, and ACM TOMM. I also have been serving/served as a PC member of leading conferences in computer vision, multimedia, and AI, such as CVPR, ICCV, ECCV, ACM MM, IJCAI, and AAAI.
🔥🔥🔥 News
-
2024.09.05: 🎉🎉🎉 Our paper Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation has been accepted by CoRL 2024!
-
2024.02.27: 🎉🎉🎉 Our paper Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation has been accepted by CVPR 2024!
📝 Publications
📚 Journal
-
MemBridge: Video-Language Pre-training with Memory-Augmented Inter-Modality Bridge, Jiahao Yang, Xiangyang Li, Mao Zheng, Zihan Wang, Yongqing Zhu, Xiaoqian Guo, Yuchen Yuan, Zifeng Chai, Shuqiang Jiang. IEEE Transactions on Image Processing (TIP), 2023.
-
TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection, Xiaoqian Guo, Xiangyang Li, Yaowei Wang, Shuqiang Jiang. IEEE Transactions on Image Processing (TIP), 2023.
-
Focus and Align: Learning Tube Tokens for Video-Language Pre-training, Yongqing Zhu, Xiangyang Li, Mao Zheng, Jiahao Yang, Zihan Wang, Xiaoqian Guo, Zifeng Chai, Yuchen Yuan, Shuqiang Jiang. IEEE Transactions on Multimedia (TMM), 2023.
-
Dataset Bias in Few-shot Image Recognition, Shuqiang Jiang, Yaohui Zhu, Chenlong Liu, Xinhang Song, Xiangyang Li, Weiqing Min. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023. [Project]
-
Multifaceted Analysis of Fine-Tuning in a Deep Model for Visual Recognition, Xiangyang Li, Luis Herranz, Shuqiang Jiang. ACM Transactions on Data Science (ACM TDS), 2020.
-
Know More Say Less: Image Captioning Based on Scene Graphs, Xiangyang Li, Shuqiang Jiang. IEEE Transactions on Multimedia (TMM), 2019.
-
Class Agnostic Image Common Object Detection, Shuqiang Jiang, Sisi Liang, Chengpeng Chen, Yaohui Zhu, Xiangyang Li. IEEE Transactions on Image Processing (TIP), 2019.
-
Bundled Object Context for Referring Expressions, Xiangyang Li, Shuqiang Jiang, IEEE Transactions on Multimedia (TMM), 2018.
📙 Conference
-
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation, Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Shuqiang Jiang. In Conference on Robot Learning (CoRL), 2024.
-
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation, Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Junjie Hu, Ming Jiang, Shuqiang Jiang. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (Spotlight), 2024.
-
GridMM: Grid Memory Map for Vision-and-Language Navigation, Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Shuqiang Jiang. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2023.
-
KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation, Xiangyang Li, Zihan Wang, Jiahao Yang, Yaowei Wang, Shuqiang Jiang. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
-
Expressional Region Retrieval, Xiaoqian Guo, Xiangyang Li, Shuqiang Jiang. In ACM International Conference on Multimedia (MM), 2020.
-
Learning Object Context for Dense Captioning, Xiangyang Li, Shuqiang Jiang, Jungong Han. In AAAI Conference on Artificial Intelligence (AAAI), 2019.
-
Image Captioning with Both Object and Scene Information, Xiangyang Li, Xinhang Song, Luis Herranz, Yaohui Zhu, Shuqiang Jiang. In ACM International Conference on Multimedia (MM), 2016.
-
Scene Recognition with CNNs: Objects, Scales and Dataset Bias, Luis Herranz, Shuqiang Jiang, Xiangyang Li. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
-
Heterogeneous Convolutional Neural Networks for Visual Recognition, Xiangyang Li, Luis Herranz, Shuqiang Jiang. In Pacific Rim Conference on Multimedia (PCM) (Oral), 2016.