U-AIM SW STARLab - 연구 성과

학술적 연구 성과

우수 학술대회 성과

[2025]

Luu, T. M., Lee, D., Lee, Y., & Yoo, C. D. (2025). Policy Learning from Large Vision-Language Model Feedback Without Reward Modeling. arXiv preprint arXiv:2507.23391. (IROS 2025)
Lee, D., Luu, T. M., Lee, Y., & Yoo, C. D. (2025, April). Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation. In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1-5). IEEE. (ICASSP 2025)
Lee, Y., Luu, T.M., Lee, D., & Yoo, C.D. (2025). Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1-5. (ICASSP 2025)

[2024]

Yoon, E., Yoon, H. S., Harvill, J., Hasegawa-Johnson, M., & Yoo, C. D. (2024). Li-tta: Language informed test-time adaptation for automatic speech recognition. arXiv preprint arXiv:2408.05769. (INTERSPEECH 2024)
Yoon, H. S., Yoon, E., Tee, J. T. J., Zhang, K., Heo, Y. J., Chang, D. S., & Yoo, C. D. (2024, September). Bi-mdrg: Bridging image history in multimodal dialogue response generation. In European Conference on Computer Vision (pp. 378-396). Cham: Springer Nature Switzerland. (ECCV 2024)
Yoon, S., Koo, G., Hong, J. W., & Yoo, C. D. (2024, September). Dni: Dilutional noise initialization for diffusion video editing. In European Conference on Computer Vision (pp. 180-195). Cham: Springer Nature Switzerland. (ECCV 2024)
Koo, G., Yoon, S., Hong, J. W., & Yoo, C. D. (2024, September). Flexiedit: Frequency-aware latent refinement for enhanced non-rigid editing. In European Conference on Computer Vision (pp. 363-379). Cham: Springer Nature Switzerland. (ECCV 2024)
Song, S., Yang, S., Yoo, C. D., & Kim, J. (2024, September). Implicit Steganography Beyond the Constraints of Modality. In European Conference on Computer Vision (pp. 289-304). Cham: Springer Nature Switzerland. (ECCV 2024)
Yoon, S., Koo, G., Kim, G., & Yoo, C. D. (2024). Frag: Frequency adapting group for diffusion video editing. arXiv preprint arXiv:2406.06044. (ICML 2024)
Pham, T. X., Kang, Z., & Yoo, C. D. (2024). Cross-view masked diffusion transformers for person image synthesis. arXiv preprint arXiv:2402.01516. (ICML 2024)

[2023]

Yoon, S., Koo, G., Kim, D., & Yoo, C. D. (2023). Scanet: Scene complexity aware network for weakly-supervised video moment retrieval. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 13576-13586). (ICCV 2023)
Yoon, E., Yoon, H. S., Gowda, D., Eom, S. H., Kim, D., Harvill, J., ... & Yoo, C. D. (2023). Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Vol. 2023, pp. 2028-2032). (INTERSPEECH 2023)

[2022]

Zhang, C., Zhang, K., Pham, T. X., Niu, A., Qiao, Z., Yoo, C. D., & Kweon, I. S. (2022). Dual temperature helps contrastive learning without many negative samples: Towards understanding and simplifying moco. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 14441-14450). (CVPR 2022)
Yoon, S., Hong, J. W., Yoon, E., Kim, D., Kim, J., Yoon, H. S., & Yoo, C. D. (2022, October). Selective Query-Guided Debiasing for Video Corpus Moment Retrieval. In European Conference on Computer Vision (pp. 185-200). Cham: Springer Nature Switzerland. (ECCV 2022)

최우수 학술대회 성과

[2025]

Hong, J. W., Ton, T., Pham, T. X., Koo, G., Yoon, S., & Yoo, C. D. (2025). ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On. In Proceedings of the Computer Vision and Pattern Recognition Conference (pp. 28284-28294). (CVPR 2025)
Yoon, E., Yoon, H. S., Hasegawa-Johnson, M. A., & Yoo, C. D. (2025). Can video llms refuse to answer? alignment for answerability in video large language models. In The Thirteenth International Conference on Learning Representations. (ICLR 2025)
Pham, T. X., Ton, T., & Yoo, C. D. (2024). Mdsgen: Fast and efficient masked diffusion temporal-aware transformers for open-domain sound generation. arXiv preprint arXiv:2410.02130. (ICLR 2025)
Koo, G., Yoon, S., Lee, Y., Hong, J. W., & Yoo, C. D. (2025). Flowdrag: 3d-aware drag-based image editing with mesh-guided deformation vector flow fields. arXiv preprint arXiv:2507.08285. (ICML 2025) (Spotlight)
Yoon, H. S., Yoon, E., Hasegawa-Johnson, M. A., Kim, S., & Yoo, C. D. ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization. In Forty-second International Conference on Machine Learning. (ICML 2025)
Luu, T. M., Lee, Y., Lee, D., Kim, S., Kim, M. J., & Yoo, C. D. (2025). Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models. arXiv preprint arXiv:2506.12822. (ICML 2025)
Yoon, S., Koo, G., Lee, Y., Hong, J. W., & Yoo, C. D. (2025). Occlusion-robust Stylization for Drawing-based 3D Animation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 12263-12273). (ICCV 2025)
Tee, J. T. J., Yoon, H. S., Syarubany, A. H. M., Yoon, E., & Yoo, C. D. A Gradient Guidance Perspective on Stepwise Preference Optimization for Diffusion Models. In The Thirty-ninth Annual Conference on Neural Information Processing Systems. (ICCV 2025)

[2024]

Ryu, H., Yoon, S., Yoon, H. S., Yoon, E., & Yoo, C. D. (2024, March). Simpsi: A simple strategy to preserve spectral information in time series data augmentation. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 38, No. 13, pp. 14857-14865). (AAAI 2024)
Yoon, S., Koo, G., Lee, Y., & Yoo, C. (2024). Tpc: Test-time procrustes calibration for diffusion-based human image animation. Advances in Neural Information Processing Systems, 37, 118654-118677. (NeurIPS 2024)

[2023]

Yoon, S., Kim, D., Yoon, E., Yoon, H., Kim, J., & Yoo, C. (2023, December). HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue. In Findings of the Association for Computational Linguistics: EMNLP 2023 (pp. 11911-11924). (EMNLP 2023)

[2022]

Lee, Junghyun, et al. "Fast and efficient MMD-based fair PCA via optimization over Stiefel manifold." Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 36. No. 7. 2022. (AAAI 2022)
Vu, T., Kim, K., Luu, T. M., Nguyen, T., & Yoo, C. D. (2022). Softgroup for 3d instance segmentation on point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 2708-2717). (CVPR 2022) (Oral)
Yoon, S., Yoon, E., Yoon, H. S., Kim, J., & Yoo, C. (2022, December). Information-Theoretic Text Hallucination Reduction for Video-grounded Dialogue. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (pp. 4182-4193). (EMNLP 2022)

국내 학술 대회 성과

[2024]

이영환, & 유창동. (2024). LLaVA 를 활용한 분포 외 데이터 탐지. 대한전자공학회 학술대회, 1528-1531.
이동훈, & 유창동. (2024). 로봇 매니퓰레이터의 다양한 작업 확장성을 위한 지속적 강화학습 기법. 대한전자공학회 학술대회, 1733-1736.
나해빈. (2024). Mamba Survey: 시퀀스 모델링을 위한 상태 공간 모델의 발전 및 응용. 대한전자공학회 학술대회, 2403-2406.

[2023]

두경빈, 유창동. (2023-05-02). Support Vector Machine을 이용한 자동항공관제시스템. 대한전자공학회 학술대회, 제주.

[2022]

김대혁, 유창동. (2022-06-29). 딥러닝 기반 컴퓨터 비전의 공정성을 위한 기법 조사 - 학습 데이터를 중심으로. 대한전자공학회 학술대회, 제주.
엄수환, 유창동. (2022-06-29). ImageNet Dataset 벤치마크에서 Transformer 기반 모델들의 비교와 분석. 대한전자공학회 학술대회, 제주.

[2021]

김다현, 유창동. (2021-06-30). 인공지능을 이용한 비디오 그룹 내 장면 검색. 대한전자공학회 학술대회, 제주.
Ji Woo Hong, Chang Dong Yoo. (2021-06-30). 대한전자공학회 학술대회, 제주.

국제 학술 대회 성과

[2024]

Koo, G., Yoon, S., & Yoo, C. D. (2024, April). Wavelet-guided acceleration of text inversion in diffusion-based image editing. In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 4380-4384). IEEE. (ICASSP 2024)
Cho, S. J., Kim, G., Lee, J., Shin, J., & Yoo, C. D. (2024). Querying easily flip-flopped samples for deep active learning. arXiv preprint arXiv:2401.09787. (ICLR 2024)
Kang, H., Yoon, J., Kim, D., Hwang, S. J., & Yoo, C. D. (2024, January). Progressive fourier neural representation for sequential video compilation. In The Twelfth International Conference on Learning Representations. (ICLR 2024)
Lee, D., Shim, J. Y., Yoon, S. J., & Yoo, C. D. (2024, June). Character Identifying Video Language Alignment Network for Weakly-Supervised Video-Subtitle Moment Retrieval. In International Conference on Pattern Recognition and Artificial Intelligence (pp. 108-123). Singapore: Springer Nature Singapore. (PRAI 2024)
Nguyen, T., Luu, T. M., Ton, T., & Yoo, C. D. (2024, June). Towards robust policy: Enhancing offline reinforcement learning with adversarial attacks and defenses. In International Conference on Pattern Recognition and Artificial Intelligence (pp. 310-324). Singapore: Springer Nature Singapore. (PRAI 2024)
Luu, T. M., Nguyen, T., Jin, T. J. T., Kim, S., & Yoo, C. D. (2024, October). Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization. In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 595-602). IEEE. (IROS 2024)
Luu, T. M., Lee, D., & Yoo, C. D. (2024, October). Predictive coding for decision transformer. In 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 7469-7476). IEEE. (IROS 2024)
Tee, J. T. J., Zhang, K., Yoon, H. S., Gowda, D. N., Kim, C., & Yoo, C. D. (2024). Physics informed distillation for diffusion models. arXiv preprint arXiv:2411.08378. (TMLR 2024)

[2023]

Yoon, S., Hong, J. W., Eom, S., Yoon, H. S., Yoon, E., Kim, D., ... & Yoo, C. D. (2023, June). Counterfactual Two-Stage Debiasing For Video Corpus Moment Retrieval. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1-5). IEEE. (ICASSP 2023) (Oral)
Kang, H., Yoon, J., Madjid, S. R. H., Hwang, S. J., & Yoo, C. D. (2022, September). On the Soft-Subnetwork for Few-Shot Class Incremental Learning. In The Eleventh International Conference on Learning Representations. (ICLR 2023)
Yoon, H. S., Tee, J. T. J., Yoon, E., Yoon, S., Kim, G., Li, Y., & Yoo, C. D. (2022, September). ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure. In The Eleventh International Conference on Learning Representations. (ICLR 2023)

[2022]

Kang, H., Mina, R. J. L., Madjid, S. R. H., Yoon, J., Hasegawa-Johnson, M., Hwang, S. J., & Yoo, C. D. (2022, June). Forget-free continual learning with winning subnetworks. In International Conference on Machine Learning (pp. 10734-10750). PMLR. (ICML 2022)
Kim, D., Yoon, S., Hong, J. W., & Yoo, C. D. (2022, May). Semantic association network for video corpus moment retrieval. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1720-1724). IEEE. (ICASSP 2022)
Zhang, C., Zhang, K., Zhang, C., Pham, T. X., Yoo, C. D., & Kweon, I. S. (2021, October). How Does SimSiam Avoid Collapse Without Negative Samples? A Unified Understanding with Self-supervised Contrastive Learning. In International Conference on Learning Representations. (ICLR 2022)

[2021]

Yoon, S., Kim, D., Hong, J. W., Kim, J., Kim, K., & Yoo, C. D. (2021, September). Weakly-supervised moment retrieval network for video corpus moment retrieval. In 2021 IEEE International Conference on Image Processing (ICIP) (pp. 534-538). IEEE. (ICIP 2021)
Vu, T., Kim, K., Kang, H., Nguyen, X. T., Luu, T. M., & Yoo, C. D. (2021, September). Sphererpn: Learning spheres for high-quality region proposals on 3D point clouds object detection. In 2021 IEEE International Conference on Image Processing (ICIP) (pp. 3173-3177). IEEE. (ICIP 2021)
Nguyen, T., Luu, T. M., Vu, T., & Yoo, C. D. (2021, September). SampleA-efficient reinforcement learning representation learning with curiosity contrastive forward dynamics model. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (pp. 3471-3477). IEEE. (IROS 2021)

SCI 학술지 성과

[2025]

Kang, H., Yoon, J., Hwang, S. J., & Yoo, C. D. (2024). Continual learning: Forget-free winning subnetworks for video representations. IEEE Transactions on Pattern Analysis and Machine Intelligence. (IF=18.6)

[2024]

Yoon, S., Koo, G., Shim, J. Y., Eom, S., Hong, J. W., & Yoo, C. D. (2024). Causal Localization Network for Radar Human Localization With Micro-Doppler Signature. IEEE Access, 12, 38275-38286.
Nguyen, T., Luu, T. M., Ton, T., Kim, S., & Yoo, C. D. (2024). Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning. IEEE Access, 12, 100972-100982.
Cho, S. J., Kim, G., & Yoo, C. D. (2024). Hypothesis perturbation for active learning. IEEE Journal of Selected Topics in Signal Processing.

[2023]

Vu, T., Kim, K., Nguyen, T., Luu, T. M., Kim, J., & Yoo, C. D. (2023). Scalable SoftGroup for 3D Instance Segmentation on Point Clouds. IEEE Transactions on Pattern Analysis and Machine Intelligence. (IF=23.6)
Nguyen, T., Pham, T. X., Zhang, C., Luu, T. M., Vu, T., & Yoo, C. D. (2023). DimCL: Dimensional Contrastive Learning for Improving Self-Supervised Learning. IEEE Access, 11, 21534-21545.

[2022]

Pham, T. X., Choi, J. W., Mina, R. J. L., Nguyen, T. X., Madjid, S. R., & Yoo, C. D. (2022). Lad: A hybrid deep learning system for benign paroxysmal positional vertigo disorders diagnostic. IEEE Access, 10, 113995-114007.
Kim, G., Yoo, C. D., & Yang, S. J. (2022). Survival Analysis of COVID-19 Patients With Symptoms Information by Machine Learning Algorithms. IEEE Access, 10, 62282-62291.
Luu, T. M., Nguyen, T., Vu, T., & Yoo, C. D. (2022). Utilizing skipped frames in action repeats for improving sample efficiency in reinforcement learning. IEEE Access, 10, 64965-64975.
Yoon, S., Kim, D., Kim, J., & Yoo, C. D. (2022). Cascaded MPN: Cascaded Moment Proposal Network for Video Corpus Moment Retrieval. IEEE Access, 10, 64560-64568.
Yoon, S., Kim, D., Hong, J. W., Kim, J., & Yoo, C. D. (2022). Dual-scale doppler attention for human identification. Sensors, 22(17), 6363.

Page updated

Google Sites

Report abuse