HOME PUBLICATION CONTACT

Books

2022
Qi Wu, Peng Wang, Xin Wang, Xiaodong He, Wenwu Zhu. Visual Question Answering - From Theory to Application. Springer, 2022.

Journal Papers

2023
Chen Gao, Jinyu Chen, Si Liu, Luting Wang, Qiong Zhang, Qi Wu.Room-Object Entity Prompting and Reasoning for Embodied Referring Expression. IEEE Transactions on Pattern Analysis and Machine Intelligence, (TPAMI) 2023.
2023
Ning Ding, Chaorui Deng, Mingkui Tan, Qi Wu.Image Captioning with Controllable and Adaptive Length Levels. IEEE Transactions on Pattern Analysis and Machine Intelligence, (TPAMI) 2023.
2023
Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu. HOP+: History-enhanced and Order-aware Pre-training for Vision-and-Language Navigation. IEEE Transactions on Pattern Analysis and Machine Intelligence, (TPAMI) 2023.
2023
Zhiquan Wen, Shuaicheng Niu, Ge Li, Qingyao Wu, Mingkui Tan, Qi Wu. Test-time model adaptation for visual question answering with debiased self-supervisions. IEEE Transaction on Multimedia (TMM), 2023.
2023
Hao Li, Jinfa Huang, Peng Jin, Guoli Song, Qi Wu, Jie Chen, IEEE Transactions on Image Processing, (TIP) 2023.
2023
Wei Suo, Mengyang Sun, Peng Wang, Yanning Zhang, Qi Wu. Transformer-based Relational Inference Network for Complex Visual Relational Reasoning. ACM Transactions on Multimedia Computing, Communications and Applications, 2023.
2022
Mengyang Sun, Wei Suo, Peng Wang, Yanning Zhang, Qi Wu. A proposal-free one-stage framework for referring expression comprehension and generation via dense cross-attention. IEEE Transaction on Multimedia (TMM), 2022.
2022
Wei Suo, Mengyang Sun, Peng Wang, Yanning Zhang, Qi Wu. Rethinking and Improving Feature Pyramids for One-Stage Referring Expression Comprehension. IEEE Transactions on Image Processing (TIP), 2022.
2022
Mengyang Sun, Wei Suo, Peng Wang, Yanning Zhang, Qi Wu. A proposal-free one-stage framework for referring expression comprehension and generation via dense cross-attention. IEEE Transaction on Multimedia (TMM), 2022.
2022
Mengge He, Wenjing Du, Zhiquan Wen, Qing Du, Yutong Xie, Qi Wu. Multi-Granularity Aggregation Transformer for Joint Video-Audio-Text Representation Learning. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2022.
2022
Wei Suo, Mengyang Sun, Peng Wang, Yanning Zhang, Qi Wu. Rethinking and Improving Feature Pyramids for One-stage Referring Expression Comprehension. IEEE Transaction on Image Processing (TIP), 2022.
2021
Chenyu Gao, Qi Zhu, Peng Wang, Hui Li, Yuliang Liu, Anton van den Hengel, Qi Wu*. Structured Multimodal Attentions for TextVQA. IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2021.
2021
Sourav Garg, Niko Sunderhauf, Feras Dayoub, Douglas Morrison, Akansel Cosgun, Gustavo Carneiro, Qi Wu, Tat-Jun Chin, Ian Reid, Stephen Gould, Peter Corke, Michael Milford.Semantics for Robotic Mapping, Perception and Interaction: A Survey. Foundations and Trends in Robotics, 2021.
2021
Amin Parvaneh, Ehsan Abbasnejad, Qi Wu, Javen Qinfeng Shi and Anton van den Hengel. Show, Price and Negotiate: A Negotiator with Online Value Look-Ahead. IEEE Transaction on Multimedia (TMM), 2021.
2020
Chaorui Deng*, Qi Wu* (equal contribution), Qingyao Wu, Fuyuan Hu, Fan Lyu, Mingkui Tan. Visual Grounding via Accumulated Attention. IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2020.
2020
Jing Yu, Xiaoze Jiang, Zengchang Qin, Weifeng Zhang, Yue Hu and Qi Wu. Learning Dual Encoding Model for Adaptive Visual Understanding in Visual Dialogue. IEEE Transaction on Image Processing (TIP), 2020.
2020
Qi Chen, Qi Wu, Jian Chen, Qingyao Wu, Anton van den Hengel, and Mingkui Tan. Scripted Video Generation with a Bottom-up Generative Adversarial Network. IEEE Transaction on Image Processing (TIP), 2020.
2020
Yanyuan Qiao, Chaorui Deng, Qi Wu. Referring Expression Comprehension: A Survey of Methods and Datasets. IEEE Transaction on Multimedia (TMM), 2020.
2020
Jing Yu, Weifeng Zhang, Yuhang Lu, Zengchang Qin, Yue Hu, Jianlong Tan, Qi Wu. Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-modal Retrieval. IEEE Transaction on Multimedia (TMM), 2020.
2020
Weixia Zhang, Chao Ma, Qi Wu, Xiaokang Yang. Language-guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2020.
2019
Junjie Zhang, Qi Wu, Jian Zhang, Chunhua Shen. Heritage Image Annotation via Collective Knowledge. Pattern Recognition (PR), 2019.
2019
Fan Lyu, Qi Wu, Fuyuan Hu, Qingyao Wu, Mingkui Tan. Attend and Imagine: Multi-label Image Classification with Visual Attention and Recurrent Neural Networks. IEEE Transaction on Multimedia (TMM), 2019.
2019
Jianpeng Zhang, Yutong Xie, Qi Wu, Yong Xia. Medical image classification using synergic deep learning. Medical Image Analysis (MIA), 2019.
2018
Yan Huang, Qi Wu, Wei Wang, Liang Wang. Image and Sentence Matching via Semantic Concepts and Order Learning. IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2018. (IF: 17.730)
2018
Junjie Zhang*, Qi Wu* (equal contribution), Chunhua Shen, Jian Zhang. Multi-Label Image Classification with Regional Latent Semantic Dependencies. IEEE Transaction on Multimedia (TMM), 2018.
2017
Qi Wu, Chunhua Shen, Anton van den Hengel, Peng Wang, Anthony Dick. Image Captioning and Visual Question Answering Based on Attributes and Their Related External Knowledge. IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2017. (IF: 17.730)
2017
Peng Wang*, Qi Wu* (equal contribution) , Chunhua Shen, Anthony Dick, Anton van den Hengel. Fvqa: Fact-based visual question answering. IEEE Transaction on Pattern Analysis and Machine Intelligence (TPAMI), 2017. (IF: 17.730)
2017
Damien Teney, Qi Wu* (corresponding author), Anton van den Hengel. Visual Question Answering: A Tutorial. IEEE Signal Processing Magazine (SPM), Volume 34 Issue 6, Pages 63-75, 2017. (IF: 9.654)
2017
Qi Wu, Damien Teney, Peng Wang, Chunhua Shen, Anton van den Hengel, Anthony Dick. Visual Question Answering: A Survey of Models and Datasets. Computer Vision and Image Understanding (CVIU), 2017.

Conference Papers

2023
Yutong Xie, Lin Gu, Tatsuya Harada, Jianpeng Zhang, Yong Xia, Qi Wu. MedIM: Boost Medical Image Representation via Radiology Report-Guided Masking. International Conference on Medical Image Computing and Computer-Assisted Intervention. (MICCAI 2023), 2023.
2023
Qingbiao Guan, Yutong Xie, Bing Yang, Jianpeng Zhang, Zhibin Liao, Qi Wu, Yong Xia. Unpaired Cross-Modal Interaction Learning for COVID-19 Segmentation on Limited CT Images. International Conference on Medical Image Computing and Computer-Assisted Intervention. (MICCAI 2023), 2023.
2023
Chongyang Zhao, Yuankai Qi, Qi Wu. Mind the Gap: Improving Success Rate of Vision-and-Language Navigation by Revisiting Oracle Success Routes. ACM International Conference on Multimedia. (ACM MM 2023), 2023.
2023
Cristian Rodriguez, Edison Marrese-Taylor, Basura Fernando, Hiroya Takamura, Qi Wu. Digging out discrimination information from generated samples for robust visual question answering. In Findings of the Association for Computational Linguistics (ACL 2023), 2023.
2023
Cristian Rodriguez, Edison Marrese-Taylor, Basura Fernando, Hiroya Takamura, Qi Wu. Memory-efficient Temporal Moment Localization in Long Videos. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023.
2023
Xi Tian, Yong-Liang Yang, Qi Wu. ShapeScaffolder: Structure-Aware 3D Shape Generation from Text. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2023), 2023.
2023
Yanyuan Qiao, Zheng Yu, Qi Wu. VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2023), 2023.
2023
Yanyuan Qiao, Yuankai Qi, Zheng Yu, Jing Liu, Qi Wu. March in Chat: Interactive Prompting for Remote Embodied Referring Expression. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2023), 2023.
2023
Chaorui Deng, Da Chen, Qi Wu. Identity-Consistent Aggregation for Video Object Detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2023), 2023.
2023
Chaorui Deng, Qi Chen, Pengda Qin, Da Chen, Qi Wu. Prompt Switch: Efficient CLIP Adaptation for Text-Video Retrieval. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2023), 2023.
2023
Shubo Liu, Hongsheng Zhang, Yuankai Qi, Peng Wang, Yanning Zhang, Qi Wu. AerialVLN: Vision-and-Language Navigation for UAVs. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2023), 2023.
2023
Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao. Scaling data generation in vision-and-language navigation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV 2023), 2023.
2023
Gaoxiang Cong, Liang Li, Yuankai Qi, Zheng-Jun Zha, Qi Wu, Wenyu Wang, Bin Jiang, Ming-Hsuan Yang, Qingming Huang. Learning to Dub Movies via Hierarchical Prosody Models. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2023), 2023.
2023
Wei Suo, Mengyang Sun, Weisong Liu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu. S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2023), 2023.
2022
Qi Cheni, Chaorui Deng, Qi Wu. Learning Distinct and Representative Modes for Image Captioning. Neural Information Processing Systems. (NeurIPS 2022), 2022
2022
Jing Gu, Eliana Stefani, Qi Wu, Jesse Thomason, and Xin Wang. Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022), 2022.
2022
Wanrong Zhu, Yuankai Qi, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Wang, Qi Wu, Miguel Eckstein, and William Yang Wang. Diagnosing vision-and-language navigation: What really matters. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2022), 2022.
2022
Yutong Xie, Jianpeng Zhang, Yong Xia, Qi Wu. UniMiSS: Universal Medical Self-supervised Learning via Breaking Dimensionality Barrier. In Proceedings of European Conference on Computer Vision (ECCV 2022), 2022.
2022
Wei Suo, Mengyang Sun, Kai Niu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu. A Simple and Robust Correlation Filtering Method for Text-Based Person Search. In Proceedings of European Conference on Computer Vision (ECCV 2022), 2022.
2022
Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen Gould. A Recurrent Vision-and-Language BERT for Navigation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022.
2022
Qi Chen, Mingkui Tan, Yuankai Qi, Jiaqiu Zhou, Yuanqing Li, Qi Wu. V2C: Visual Voice Cloning. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022.
2022
Chenchen Jing, Yunde Jia, Yuwei Wu, Xinyu Liu, Qi Wu. Maintaining Reasoning Consistency in Compositional Visual Question Answering. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022.
2022
Yicong Hong, Zun Wang, Qi Wu, Stephen Gould. Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022.
2022
Yang Ding, Jing Yu, Bang Liu, Yue Hu, Mingxin Cui, Qi Wu. MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022.
2022
Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu. HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022.
2022
Chenchen Jing, Yunde Jia, Yuwei Wu, Chuanhao Li, Qi Wu. Learning the dynamics of visual relational reasoning via reinforced path routing (AAAI 2022), 2022.
2021
Keji He, Yan Huang, Qi Wu, Jianhua Yang, Dong An, Shuanglin Sima, Liang Wang, . Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision. Neural Information Processing Systems. (NeurIPS 2021), 2021.
2021
Zhiquan Wen, Guanghui Xu, Mingkui Tan, Qingyao Wui, Qi Wu. Debiased Visual Question Answering from Feature and Sample Perspectives. Neural Information Processing Systems. (NeurIPS 2021), 2021.
2021
Yuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Henhel, Qi Wu. The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation. In Proceedings of International Conference on Computer Vision (ICCV 2021), 2021.
2021
Dong An, Yuankai Qi, Yan Huang, Qi Wu, Liang Wang, Tieniu Tan. Neighbor-view Enhanced Model for Vision and Language Navigation. ACM International Conference on Multimedia. (ACM MM 2021), 2021.
2021
Yanyuan Qiao, Qi Chen, Chaorui Deng, Ning Ding, Yuankai Qi, Mingkui Tan, Xincheng Ren, Qi Wu. R-GAN: Exploring Human-like Way for Reasonable Text-to-Image Sythesis via Generative Adversarial Networks. (ACM MM 2021), 2021.
2021
Chengyu Gao, Qi Zhu, Peng Wang, Qi Wu. Chop Chop BERT: Visual Question Answering by Chopping VisualBERT`s Heads. International Joint Conference on Artificial Intelligence (IJCAI 2021), 2021.
2021
Jing Yu, Yuan Chai, Yujing Wang, Yue Hu, Qi Wu. CogTree: Cognition Tree Loss for Unbiased Scene Graph Generation. International Joint Conference on Artificial Intelligence (IJCAI 2021), 2021.
2021
Wei Suo, Mengyang Sun, Peng Wang, Qi Wu.Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention. International Joint Conference on Artificial Intelligence (IJCAI 2021), 2021.
2021
Chaorui Deng, Shizhe Chen, Da Chen, Qi Wu. Sketch, Ground, and Refine: Top-Down Dense Video Captioning. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), 2021.
2021
Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen Gould. A Recurrent Vision-and-Language BERT for Navigation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), 2021.
2021
Guanghui Xu, Mingkui Tan, Shuaicheng Niu, Yucheng Luo, Qing Du, Qi Wu. Towards Accurate Text-based Image Captioning with Content Diversity Exploration. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), 2021.
2021
Chen Gao, Jinyu Chen, Si Liu, Luting Wang, Qiong Zhang, Qi Wu. Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), 2021.
2021
Zeren Sun, Yazhou Yao, Fumin Shen, Qi Wu, Zhenmin Tang, Jian Zhang. Jo-SRC: A Contrastive Approach for Combating Noisy Labels. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), 2021.
2021
Tao Chen, Guo-Sen Xie, Yazhou Yao, Fumin Shen, Qi Wu, Zhenmin Tang, Jian Zhang. Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), 2021.
2021
Li Liu, Mengge He, Guanghui Xu, Mingkui Tan, Qi Wu. How to Train Your Agent to Read and Write?. Association for the Advancement of Artificial Intelligence (AAAI 2021), 2021.
2021
Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu. Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps. Association for the Advancement of Artificial Intelligence (AAAI 2021), 2021.
2021
Zhaokai Wang, Renda Bao, Qi Wu, Si Liu. Confidence-aware Non-repetitive Multimodal Transformers for TextCaps. Association for the Advancement of Artificial Intelligence (AAAI 2021), 2021.
2021
Mahdi Kazemi Moghaddam, Qi Wu, Ehsan Abbasnejad, Javen Qinfeng Shi. Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation. Winter Conference on Applications of Computer Vision (WACV 2021), 2021.
2020
Yicong Hong, Cristian Rodriguez-Opazo, Yuankai Qi, Qi Wu, Stephen Gould. Language and Visual Entity Relationship Graph for Agent Navigation. Neural Information Processing Systems. (NeurIPS 2020), 2020.
2020
Yicong Hong, Cristian Rodriguez-Opazo, Qi Wu, Stephen Gould. Sub-Instruction Aware Vision-and-Language Navigation. 2020 Conference on Empirical Methods in Natural Language Processing. (EMNLP 2020), 2020.
2020
Fen Liu, Guanghui Xu, Qi Wu, Qing Du, Wei Jia, Mingkui Tan. Cascade Reasoning Network for Text-based Visual Question Answering. ACM International Conference on Multimedia. (ACM MM 2020), 2020.
2020
Chengchen Jing, Yuwei Wu, Mingtao Pei, Yao Hu, Yunde Jia, Qi Wu. Visual-Semantic Graph Matching for Visual Grounding. ACM International Conference on Multimedia. (ACM MM 2020), 2020.
2020
Peng Wang, Dongyang Liu, Hui Li, Qi Wu. Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge. ACM International Conference on Multimedia. (ACM MM 2020), 2020.
2020
Chuanyi Zhang, Yazhou Yao, Xiangbo Shu, Zechao Li, Zhenmin Tang, Qi Wu. Data-driven Meta-set Based Fine-Grained Visual Classification. ACM International Conference on Multimedia. (ACM MM 2020), 2020.
2020
Hu Wang, Qi Wu, Chunhua Shen. Soft Expert Reward Learning for Vision-and-Language Navigation. In Proceedings of European Conference on Computer Vision (ECCV 2020), 2020.
2020
Yuankai Qi, Zizheng Pan, Shengping Zhang, Anton van den Hengel, Qi Wu. Object-and-Action Aware Model for Visual Language Navigation. In Proceedings of European Conference on Computer Vision (ECCV 2020), 2020.
2020
Chaorui Deng, Ning Ding, Mingkui Tan, Qi Wu. Length Controllable Image Captioning. In Proceedings of European Conference on Computer Vision (ECCV 2020), 2020.
2020
Ruixue Tang, Chao Ma, Wei Emma Zhang, Qi Wu, Xiaokang Yang. Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering. In Proceedings of European Conference on Computer Vision (ECCV 2020), 2020.
2020
Xiaoze Jiang, Jing Yu, Yajing Sun, Zengchang Qin, Zihao Zhu, Yue Hu, Qi Wu. DAM: Deliberation, Abandon and Memory Networks for Generating Detailed and Non-repetitive Responses in Visual Dialogue. International Joint Conference on Artificial Intelligence (IJCAI 2020), 2020.
2020
Zihao Zhu, Jing Yu, Yujing Wang, Yajing Sun, Yue Hu, Qi Wu. Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering. International Joint Conference on Artificial Intelligence (IJCAI 2020), 2020.
2020
Zhenfang Chen, Peng Wang, Lin Ma, Kwan-Yee K. Wong, Qi Wu. Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020), 2020.
2020
Qi Chen*, Qi Wu* (equal contribution), Rui Tang, Yuhan Wang, Shuai Wang, Mingkui Tan. Intelligent Home 3D: Automatic 3D-House Design from Linguistic Descriptions Only. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020), 2020.
2020
Shizhe Chen, Yida Zhao, Qin Jin, Qi Wu. Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020), 2020.
2020
Shizhe Chen, Qin Jin, Peng Wang, Qi Wu. Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020), 2020.
2020
Ehsan Abbasnejad, Qi Wu, Iman Abbasnejad, Javen Shi, Anton van den Hengell. Gold Seeker: Information Gain from Policy Distributions for Goal-oriented Vision-and-Langauge Reasoning. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020), 2020.
2020
Yuankai Qi, Qi Wu, Peter Anderson, Xin Wang, William Yang Wang, Chunhua Shen, Anton van den Hengel. REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020), 2020.
2020
Chenchen Jing, Yuwei Wu, Xiaoxun Zhang, Yunde Jia, Qi Wu. Overcoming Language Priors in VQA via Decomposed Linguistic Representations. Association for the Advancement of Artificial Intelligence (AAAI 2020), 2020.
2020
Xiaoze Jiang, Jing Yu, Zengchang Qin, Yingying Zhuang, Xingxing Zhang, Yue Hu, Qi Wu. DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue. Association for the Advancement of Artificial Intelligence (AAAI 2020), 2020.
2020
Yihan Zheng, Zhiquan Wen, Mingkui Tan, Runhao Zeng, Qi Chen, Yaowei Wang, Qi Wu. Modular Graph Attention Network for Complex Visual Relational Reasoning. Asian Conference on Computer Vision. (ACCV 2020), 2020.
2020
Zhibin Liao*, Lingqiao Liu,Qi Wu, Damien Teney, Chunhua Shen, Johan Verjans, Anton van Hengel. Medical Data Inquiry Using a Question Answering Model. In Proceedings of IEEE International Symposium on Biomedical Imaging. (ISBI 2020), 2020.
2019
Xuguang Duan, Qi Wu, Chuang Gan, Yiwei Zhang, Wenbing Huang, Anton van den Hengel and Wenwu Zhu. Watch, Reason and Code: Learning to Represent Videos Using Program. ACM International Conference on Multimedia. (ACM MM 2019), 2019.
2019
Ehsan Abbasnejad, Qi Wu, Javen Shi, Anton van den Hengel. What's to know? Uncertainty as a Guide to Asking Goal-oriented Questions. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019), 2019.
2019
Peng Wang, Qi Wu, Jiewei Cao, Chunhua Shen, Lianli Gao, Anton van den Hengel. Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019), 2019.
2019
Junjie Zhang, Qi Wu, Jian Zhang, Chunhua Shen. Mind Your Neighbours: Image Annotation with Metadata Neighbourhood Graph Co-Attention Networks. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019), 2019.
2018
Junjie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu, Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards. In Proceedings of European Conference on Computer Vision (ECCV 2018), 2018.
2018
Qi Wu, Peng Wang, Chunhua Shen, Ian Reid, Anton van den Hengel. Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), 2018. [Oral]
2018
Bohan Zhuang*, Qi Wu* (equal contribution), Chunhua Shen, Ian Reid, Anton van den Hengel. Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), 2018.
2018
Yan Huang, Qi Wu, Liang Wang. Learning Semantic Concepts and Order for Image and Sentence Matching. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), 2018.
2018
Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko Sunderhauf, Ian Reid, Stephen Gould, Anton van den Hengel. Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), 2018.
2018
Chaorui Deng*, Qi Wu* (equal contribution), Fuyuan Hu, Fan Lv, Mingkui Tan, Qingyao Wu. Visual Grounding via Accumulated Attention. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), 2018.
2018
Chao Ma, Chunhua Shen, Anthony Dick, Qi Wu, Peng Wang, Anton van den Hengel, Ian Reid. Visual Question Answering with Memory-Augmented Networks. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), 2018.
2018
Jianpeng Zhang, Yutong Xie, Qi Wu, Yong Xia. Skin Lesion Classification in Dermoscopy Images Using Synergic Deep Learning. In Proceedings of International Conference on Medical Image Computing & Computer Assisted Intervention (MICCAI 2018), 2018.
2018
Bohan Zhuang*, Qi Wu* (equal contribution), Ian Reid, Chunhua Shen, Anton van den Hengel. HCVRD: a benchmark for large-scale Human-Centered Visual Relationship Detection. Association for the Advancement of Artificial Intelligence (AAAI 2018), 2018.
2018
Junjie Zhang, Qi Wu, Jian Zhang, Chunhua Shen, Jianfeng Lu. Kill Two Birds with One Stone: Weakly-Supervised Neural Network for Image Annotation and Tag Refinement. Association for the Advancement of Artificial Intelligence (AAAI 2018), 2018.
2017
Peng Wang*, Qi Wu* (equal contribution), Chunhua Shen, Anton van den Hengel, Anthony Dick. Explicit Knowledge-based Reasoning for Visual Question Answering. International Joint Conference on Artificial Intelligence (IJCAI 2017), 2017.
2017
Peng Wang*, Qi Wu* (equal contribution), Chunhua Shen, Anton van den Hengel. The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), 2017.
2016
Qi Wu, Peng Wang, Chunhua Shen, Anton van den Hengel, Anthony Dick. Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016), 2016.
2016
Qi Wu, Chunhua Shen, Anton van den Hengel, Lingqiao Liu, Anthony Dick. What Value Do Explicit High Level Concepts Have in Vision to Language Problems? In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016), 2016.
2015
Hongping Cai, Qi Wu and Peter Hall. Beyond Photo-Domain Object Recognition: Benchmarks for the Cross-Depiction Problem, In Proceedings of International Conference on Computer Vision (ICCV 2015) Workshop, 2015, (second best paper award).
2014
Qi Wu, Hongping Cai and Peter Hall. Learning Graphs to Model Visual Objects across Different Depictive Styles. In Proceedings of European Conference on Computer Vision (ECCV 2014), 2014.
2013
Qi Wu and Peter Hall. Modelling Visual Objects Invariant to Depictive Style. In Proceedings of the British Machine Vision Conference (BMVC 2013), 2013.
2012
Qi Wu and Peter Hall. Prime shapes in natural images. In Proceedings of the British Machine Vision Conference (BMVC 2012), pages 45.1-45.12. BMVA Press, 2012.
google scholar