I received my master's degree from Hallym University under the supervision of Prof. Hou Jong-Uk. My major interests are computer vision, multimodal learning, and digital forensic.
Contact: chlalsrl98@naver.com
Information: Google Scholar, GitHub
(2022.03 ~ 2024.02) M.S. in computer engineering, Hallym University, Chuncheon, Korea
(2016.03 ~ 2022.02) B.S. in convergence software, Hallym University, Chuncheon, Korea
(2022.03 ~ 2024.02) Master of Multimedia Computing Laboratory
(2021.06 ~ 2022.02) Undergraduate Research Student in Multimedia Computing Laboratory
(2022.11 ~ preparing) Geometric Transformation Estimation Research Journal Extension
(2022.05 ~ 2023.11) Autonomous Vehicle Identification Project
(2021.08 ~ 2024.03) Analysis of CPR at Dongtan Sacred Heart Hospital
(2023.02 ~ 2023.10) Development of OCR & AI Technology for Cyber Threat Documents
Computer Vision
Multimodal Learning
Digital Forensic
(2021.09 - 2022.10)
We propose an end-to-end transformer-based estimator that can predict the geometric transformation parameters of an image. Deviating from the existing classification-based formulation, we provided a more generalized method by directly estimating the transformation matrix. We note that the frequency peak position of the inherent resampling artifacts leaves explicit clues for the geometric transformation.
Role: Main model and comparison paper implementation (CNN-based models and hand-crafted methods), Dataset collection automation, Paper writing
ISSN: 1947-4598
(2022.05 - 2023.11)
Developed a novel methodology to understand driver habits using multimodal technology that integrates various data obtained from autonomous vehicles. Analyzed the relationship between Advanced Driver Assistance Systems (ADAS) activation and driver identification and profiling. Aimed to understand and analyze driver reactions in various situations.
Role: Driver profiling model structure design, Main architecture implementation contribution, Validation dataset collection and preprocessing
Development of a multimodal learning model for driver and autopilot identification
(2021.08 - 2024.03)
The global rise in out-of-hospital cardiac arrests underscores the importance of cardiopulmonary resuscitation (CPR) training. To address this, we propose a deep learning solution that transforms smartphone-captured chest compression videos into images for feedback. This model assesses four key CPR quality indicators: compression count, depth, complete release, and hand positioning.
Role: Video classification comparison paper implementation, Main model implementation contribution, Key image processing technology development (average image, critical image), Paper writing
ELSEVIER 2024, Published [Paper]
ISSN: 0957-4174
(2023.02 - 2023.10)
Analyzed vulnerabilities that reduce the accuracy of Optical Character Recognition (OCR) technology to enhance the security of confidential documents and improve the efficiency of cyber threat response through automated classification and recognition technologies. Developed an AI model for recognizing text within images to automate the classification of cyber threat documents. Built a pipeline integrating layout recognition and OCR technologies.
Role: Layout parser development, Text detection, Text recognition, Report writing and review
Development of an end-to-end pipeline for extracting text from images within documents
(2022.08 - 2022.11)
The purpose of learning a neural network is to facilitate comprehension of the network structure and enhance its generalization performance. We experimented to explore the relationship between loss landscapes and generalization performance. Utilizing the transformer architecture's success based on the multi-head self-attention (MSA) layer in vision tasks, we aim to establish a flatter loss landscape and observe an enhanced generalization performance.
Role: Main architecture, Sliding Window MSA, SAM optimizer, Loss landscape visualization implementation and application, Paper writing
ACK 2022.11.02. Published
ISSN: 2671-7298
(2022.11) Outstanding Paper Award of the Korea Information Processing Society, ACK, Korea
(2021.06) (Kaggle) Hallym AI-X R&D Challenge - AI Competition to Remove Print Stains (2nd, Silver Award), Hallym University, Korea
Seongji Ko, Yoongeol Lee, Mingi Choi, Jong-Uk Hou* “Harnessing Optical Flow in Deep Learning Framework for Cardiopulmonary Resuscitation Training”, ELSEVIER, 2024.
Mingi Choi, Sangyeong Lee, Heesun Jung, Jong-Uk Hou* "Transformers in Spectral Domain for Estimating Image Geometric Transformation", ACM MM, 2022.
Mingi Choi, Soeun Lee, Jong-Uk Hou* "A Study on Loss Landscape Affecting the Performance Generalization of Transformer", KIPs, 2022.
[Patent] Transformer-based electronic apparatus for estimating image transformation and control method thereof. Republic of Korea. 10-2022-0109941