Remote Sensing Large Vision-Language Model: Semantic-augmented Multi-level Alignment and Semantic-aware Expert Modeling
Sungjune Park*, Yeongyun Kim*, Se Yeon Kim*, Seongho Kim, and Yong Man Ro
arXiv preprint arXiv:2506.21863 (2025, Under review)
DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes
Sungjune Park*, Hyunjun Kim*, Junho Kim, Seongho Kim, and Yong Man Ro
arXiv preprint arXiv:2505.23179 (2025, Under review)
Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images
Sungjune Park, Hyunjun Kim, Beomchan Park, and Yong Man Ro
arXiv preprint arXiv:2505.23193 (2025, Under review)
Weather-aware Drone-view Object Detection via Environmental Context Understanding
Hyunjun Kim, Dahye Lee, Sungjune Park and Yong Man Ro
IEEE International Conference on Image Processing (ICIP), 2024 (Oral)
Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank
Sungjune Park*, Hyunjun Kim*, and Yong Man Ro
Pattern Recognition (PR), 2024
Integrating Language-Derived Appearance Elements with Visual Cues in Pedestrian Detection
Sungjune Park*, Hyunjun Kim*, and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024
Robust Multispectral Pedestrian Detection via Spectral Position-Free Feature Mapping
Sungjune Park, Jung Uk Kim, Jin Mo Song, and Yong Man Ro
IEEE International Conference on Image Processing (ICIP), 2023
Audio-Visual Mismatch-Aware Video Retrieval via Association and Adjustment
Sangmin Lee, Sungjune Park, and Yong Man Ro
European Conference on Computer Vision (ECCV), 2022
Robust Thermal Infrared Pedestrian Detection By Associating Visible Pedestrian Knowledge
Sungjune Park, Dae Hwi Choi, Jung Uk Kim, and Yong Man Ro
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
Towards Versatile Pedestrian Detector with Multisensory-Matching and Multispectral Recalling Memory
Jung Uk Kim, Sungjune Park, and Yong Man Ro
AAAI Conference on Artificial Intelligence (AAAI), 2022
IVIST: Interactive VIdeo Search Tool in VBS 2022 (1st prize on AVS task)
Sangmin Lee*, Sungjune Park* and Yong Man Ro (* Equal Contribution)
International Conference on Multimedia Modeling (MMM), 2022
Robust Multispectral Pedestrian Detection via Uncertainty-Aware Cross-Modal Learning
Sungjune Park, Jung Uk Kim, Yeon Gyun Kim, Sang-Keun Moon, and Yong Man Ro
International Conference on Multimedia Modeling (MMM), 2021
Robust Small-scale Pedestrian Detection with Cued Recall via Memory Learning
Jung Uk Kim*, Sungjune Park* and Yong Man Ro (* Equal Contribution)
IEEE/CVF International Conference on Computer Vision (ICCV), 2021
Adversarially Robust Hyperspectral Image Classification via Random Spectral Sampling and Spectral Shape Encoding
Sungjune Park, Hong Joo Lee, and Yong Man Ro
IEEE Access, 2021
Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection
Jung Uk Kim, Sungjune Park, and Yong Man Ro
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2021
Towards Human-like Interpretable Object Detection via Spatial Relation Encoding
Jung Uk Kim*, Sungjune Park*, and Yong Man Ro (* Equal Contribution)
IEEE International Conference on Image Processing (ICIP), 2020 (Oral)