Timezone: »
Poster
Visual Question Answering with Question Representation Update (QRU)
Ruiyu Li · Jiaya Jia
Our method aims at reasoning over natural language questions and visual images. Given a natural language question about an image, our model updates the question representation iteratively by selecting image regions relevant to the query and learns to give the correct answer. Our model contains several reasoning layers, exploiting complex visual relations in the visual question answering (VQA) task. The proposed network is end-to-end trainable through back-propagation, where its weights are initialized using pre-trained convolutional neural network (CNN) and gated recurrent unit (GRU). Our method is evaluated on challenging datasets of COCO-QA and VQA and yields state-of-the-art performance.
Author Information
Ruiyu Li (CUHK)
Jiaya Jia (CUHK)
More from the Same Authors
-
2022 Poster: Unifying Voxel-based Representation with Transformer for 3D Object Detection »
Yanwei Li · Yilun Chen · Xiaojuan Qi · Zeming Li · Jian Sun · Jiaya Jia -
2023 Poster: Real-World Image Variation by Aligning Diffusion Inversion Chain »
Yuechen Zhang · Jinbo Xing · Eric Lo · Jiaya Jia -
2023 Poster: DiffComplete: Diffusion-based Generative 3D Shape Completion »
Ruihang Chu · Enze Xie · Shentong Mo · Zhenguo Li · Matthias Niessner · Chi-Wing Fu · Jiaya Jia -
2021 Poster: Blending Anti-Aliasing into Vision Transformer »
Shengju Qian · Hao Shao · Yi Zhu · Mu Li · Jiaya Jia -
2020 Poster: LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-resolution and Beyond »
Wenbo Li · Kun Zhou · Lu Qi · Nianjuan Jiang · Jiangbo Lu · Jiaya Jia -
2018 Poster: Image Inpainting via Generative Multi-column Convolutional Neural Networks »
Yi Wang · Xin Tao · Xiaojuan Qi · Xiaoyong Shen · Jiaya Jia -
2018 Poster: Sequential Context Encoding for Duplicate Removal »
Lu Qi · Shu Liu · Jianping Shi · Jiaya Jia -
2014 Poster: Deep Convolutional Neural Network for Image Deconvolution »
Li Xu · Jimmy S. Ren · Ce Liu · Jiaya Jia