Publications
International Publications
"Empirical study on using Adapters for debiased Visual Question Answering"
Jae Won Cho, Dawit Mureja Argaw, Yeongtaek Oh, Dong-Jin Kim, In So Kweon
Computer Vision and Image Understanding (CVIU), 2023. (Impact Factor 4.5)
[PDF]
"Counterfactual Mix-Up for Visual Question Answering"
{Jae Won Cho*, Dong-Jin Kim*}, Yunjae Jung, In So Kweon (* Co-first authors)
IEEE Access, 2023. (Impact Factor 3.9)
[PDF]
"Local Pseudo-Attributes for Long-Tailed Recognition"
Dong-Jin Kim, Tsung-Wei Ke, Stella X. Yu
Pattern Recognition Letters (PRL), 2023. (Impact Factor 5.1)
Also presented at the "Self-Supervised Learning: Theory and Practice" workshop in conjunction with NeurIPS 2022.
"Modeling Semantic Correlation and Hierarchy for Real-world Wildlife Recognition"
Dong-Jin Kim, Zhongqi Miao, Yunhui Guo, Stella X. Yu
IEEE Signal Processing Letters (SPL), 2023. (Impact Factor 3.201)
[PDF]
Also presented at "Workshop on Human in the Loop Learning" in conjunction with NeurIPS 2022.
"Generative Bias for Robust Visual Question Answering"
Jae Won Cho, Dong-Jin Kim, Hyeonggon Ryu, and In So Kweon,
IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2023. (25.78% accept rate)
Received Bronze Prize, 28th Samsung Humantech Paper Awards (Top 2.8%)
Received Excellent Paper Award, IW-FCV 2023
Also presented at "Workshop on Open-Domain Reasoning Under Multi-Modal Settings" in conjunction with CVPR 2023
"DASO: Distribution-Aware Semantics-Oriented Pseudo-label for Imbalanced Semi-Supervised Learning"
YoungTaek Oh, Dong-Jin Kim, and In So Kweon.
IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2022. (25.3% accept rate)
Also presented at "Workshop on Learning with Limited Labelled Data for Image and Video Understanding" in conjunction with CVPR 2022.
"MCDAL: Maximum Classifier Discrepancy for Active Learning"
{Jae Won Cho*, Dong-Jin Kim*}, Yunjae Jung, and In So Kweon (* Co-first authors)
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022. (Impact Factor 14.255)
Also presented at "The Workshop on Fine-Grained Visual Categorization" in conjunction with CVPR 2022.
"Dense Relational Image Captioning via Multi-task Triple-Stream Networks"
Dong-Jin Kim, Tae-Hyun Oh, Jinsoo Choi, and In So Kweon.
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022. (Impact Factor 24.314)
[PDF][arXiv] [Project] [Dataset] [code]
Received Qualcomm Innovation Award 2019.
"Single-Modal Entropy based Active Learning for Visual Question Answering"
{Dong-Jin Kim*, Jae Won Cho*}, Jinsoo Choi, Yunjae Jung, and In So Kweon (* Co-first authors)
British Machine Vision Conference (BMVC), 2021.
[PDF]
"LabOR: Labeling Only if Required for Domain Adaptive Semantic Segmentation"
Inkyu Shin, Dong-Jin Kim, Jae Won Cho, Sanghyun Woo, KwanYong Park, and In So Kweon
IEEE International Conference on Computer Vision (ICCV), 2021. (Oral) (3% accept rate)
[PDF]
Winner of Qualcomm Innovation Fellowship 2021.
"Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation"
Jae Won Cho, Dong-Jin Kim, Yunjae Jung, Jinsoo Choi, and In So Kweon
IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) Multimodal Learning and Applications Workshop, 2021.
[PDF]
Also presented at "Visual Question Answering Workshop" and "VizWiz Grand Challenge Workshop" in conjunction with CVPR 2021.
"Detecting Human-Object Interactions with Action Co-occurrence Priors"
Dong-Jin Kim, Xiao Sun, Jinsoo Choi, Stephen Lin, and In So Kweon,
European Conference on Computer Vision (ECCV), 2020. (27% accept rate)
[PDF] [Project] [code] [Slides] [Video] [Poster]
Received Silver Prize, 26th Samsung Humantech Paper Awards (Top 1.6%)
Also presented at "The 2nd workshop on Video Turing Test: Toward Human-Level Video Story Understanding" in conjunction with ECCV 2020.
"Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach"
Dong-Jin Kim, Jinsoo Choi, Tae-Hyun Oh, and In So Kweon.
International Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019. (23.8% accept rate)
[PDF] [Project] [Slides] [Poster]
Also, presented at "Language&Vision " and "Visual Question Answering and Dialog " Workshops in conjunction with CVPR 2019, and "CLVL: 3rd Workshop on Closing the Loop Between Vision and Language" in conjunction with ICCV 2019.
"Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning"
Dong-Jin Kim, Jinsoo Choi, Tae-Hyun Oh, and In So Kweon.
IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2019. (25.2% accept rate)
[PDF] [Project] [Dataset] [code] [Slides] [Poster]
Extension of this work received Qualcomm Innovation Award 2019.
Also presented at "Language&Vision" and "Visual Question Answering and Dialog" Workshops in conjunction with CVPR 2019.
"Disjoint Multi-task Learning between Heterogeneous Human-centric Tasks"
Dong-Jin Kim, Jinsoo Choi, Tae-Hyun Oh, Youngjin Yoon, and In So Kweon.
IEEE Winter Conference on Applications of Computer Vision (WACV), 2018. (Oral)
[PDF]
Domestic Publications
영상처리 및 이해에 관한 워크샵, 2024.
"확산 모델의 손실 함수 개선 및 멀티 모달 다 속성 조건을 이용한 오디오 기반 이미지 조작 개선" 이관영, 차승주, 최승희, 오현우, 김동진 (Oral) 우수논문상 장려상
"이미지 생성과 지도적 대조학습을 활용한 불균형 데이터 셋 분류 문제 해결 방안" 차승주, 최승희, 이관영, 김동진
"이미지 캡셔닝에서 어댑터를 활용한 효율적 이미지 검색 방법론" 전민주, 김시우, 이소은, 김동진