Mingi Choi

I received my master's degree from Hallym University under the supervision of Prof. Hou Jong-Uk. My major interests are computer vision, multimodal learning, and digital forensic.

Contact: chlalsrl98@naver.com

Information: Google Scholar, GitHub

Education

(2022.03 ~ 2024.02) M.S. in computer engineering, Hallym University, Chuncheon, Korea
(2016.03 ~ 2022.02) B.S. in convergence software, Hallym University, Chuncheon, Korea

Working Experience

(2022.03 ~ 2024.02) Master of Multimedia Computing Laboratory
(2021.06 ~ 2022.02) Undergraduate Research Student in Multimedia Computing Laboratory

Study & Projects

(2022.11 ~ preparing) Geometric Transformation Estimation Research Journal Extension
(2022.05 ~ 2023.11) Autonomous Vehicle Identification Project
(2021.08 ~ 2024.03) Analysis of CPR at Dongtan Sacred Heart Hospital
(2023.02 ~ 2023.10) Development of OCR & AI Technology for Cyber Threat Documents

Research Interest

Computer Vision
Multimodal Learning
Digital Forensic

Project and Publication

(2021.09 - 2022.10)

Transformers in Spectral Domain for Estimating Image Geometric Transformation

We propose an end-to-end transformer-based estimator that can predict the geometric transformation parameters of an image. Deviating from the existing classification-based formulation, we provided a more generalized method by directly estimating the transformation matrix. We note that the frequency peak position of the inherent resampling artifacts leaves explicit clues for the geometric transformation.
Role: Main model and comparison paper implementation (CNN-based models and hand-crafted methods), Dataset collection automation, Paper writing
ACM MM 2022, Published [Paper] [Code]
ISSN: 1947-4598

(2022.05 - 2023.11)

Towards Driver Identification through Multimodal Data Fusion

Developed a novel methodology to understand driver habits using multimodal technology that integrates various data obtained from autonomous vehicles. Analyzed the relationship between Advanced Driver Assistance Systems (ADAS) activation and driver identification and profiling. Aimed to understand and analyze driver reactions in various situations.
Role: Driver profiling model structure design, Main architecture implementation contribution, Validation dataset collection and preprocessing
Development of a multimodal learning model for driver and autopilot identification

(2021.08 - 2024.03)

Harnessing Optical Flow in Deep Learning Framework for Cardiopulmonary Resuscitation Training

The global rise in out-of-hospital cardiac arrests underscores the importance of cardiopulmonary resuscitation (CPR) training. To address this, we propose a deep learning solution that transforms smartphone-captured chest compression videos into images for feedback. This model assesses four key CPR quality indicators: compression count, depth, complete release, and hand positioning.
Role: Video classification comparison paper implementation, Main model implementation contribution, Key image processing technology development (average image, critical image), Paper writing
ELSEVIER 2024, Published [Paper]
ISSN: 0957-4174

(2023.02 - 2023.10)

Development of an OCR Pipeline for Understanding Cyber Threat Documents

Analyzed vulnerabilities that reduce the accuracy of Optical Character Recognition (OCR) technology to enhance the security of confidential documents and improve the efficiency of cyber threat response through automated classification and recognition technologies. Developed an AI model for recognizing text within images to automate the classification of cyber threat documents. Built a pipeline integrating layout recognition and OCR technologies.
Role: Layout parser development, Text detection, Text recognition, Report writing and review
Development of an end-to-end pipeline for extracting text from images within documents

(2022.08 - 2022.11)

A Survey of the Impact of Loss Landscape on Generalization

The purpose of learning a neural network is to facilitate comprehension of the network structure and enhance its generalization performance. We experimented to explore the relationship between loss landscapes and generalization performance. Utilizing the transformer architecture's success based on the multi-head self-attention (MSA) layer in vision tasks, we aim to establish a flatter loss landscape and observe an enhanced generalization performance.
Role: Main architecture, Sliding Window MSA, SAM optimizer, Loss landscape visualization implementation and application, Paper writing
ACK 2022.11.02. Published
ISSN: 2671-7298

Honor and Awards

(2022.11) Outstanding Paper Award of the Korea Information Processing Society, ACK, Korea
(2021.06) (Kaggle) Hallym AI-X R&D Challenge - AI Competition to Remove Print Stains (2nd, Silver Award), Hallym University, Korea

Publication

Seongji Ko, Yoongeol Lee, Mingi Choi, Jong-Uk Hou* “Harnessing Optical Flow in Deep Learning Framework for Cardiopulmonary Resuscitation Training”, ELSEVIER, 2024.
Mingi Choi, Sangyeong Lee, Heesun Jung, Jong-Uk Hou* "Transformers in Spectral Domain for Estimating Image Geometric Transformation", ACM MM, 2022.
Mingi Choi, Soeun Lee, Jong-Uk Hou* "A Study on Loss Landscape Affecting the Performance Generalization of Transformer", KIPs, 2022.

Patent

[Patent] Transformer-based electronic apparatus for estimating image transformation and control method thereof. Republic of Korea. 10-2022-0109941

Page updated

Google Sites

Report abuse