

Researcher(s): Jinyeong Chae

Knowledge-based visual question answering (KVQA) task aims to answer questions that require additional external knowledge as well as an understanding of images and questions. Recent studies on KVQA inject external knowledge in a multi-modal form, and as more knowledge is used, irrelevant information may be added and can confuse the question answering.