20. SHREC 2025: Retrieval of Optimal Objects for Multi-modal Enhanced Language and Spatial Assistance (ROOMELSA)
Trong-Thuan Nguyen, Viet-Tham Huynh, Quang-Thuc Nguyen, Hoang-Phuc Nguyen, Long Le Bao, Thai Hoang Minh, Minh Nguyen Anh, Thang Nguyen Tien, Phat Nguyen Thuan, Huy Nguyen Phong, Bao Huynh Thai, Vinh-Tiep Nguyen, Duc-Vu Nguyen, Phu-Hoa Pham, Minh-Huy Le-Hoang, Nguyen-Khang Le, Minh-Chinh Nguyen, Minh-Quan Ho, Ngoc-Long Tran, Hien-Long Le-Hoang, Man-Khoi Tran, Anh-Duong Tran, Kim Nguyen, Quan Nguyen Hung, Dat Phan Thanh, Hoang Tran Van, Tien Huynh Viet, Nhan Nguyen Viet Thien, Dinh-Khoi Vo, Van-Loc Nguyen, Trung-Nghia Le, Tam V. Nguyen, Minh-Triet Tran
Computers & Graphics (Special Section on 3DOR 2025) (Q2, IF = 2.8 in 2024)
[PDF]
19. DYNAFormer: Enhancing transformer segmentation with dynamic anchor mask for medical imaging
Tan-Cong Nguyen, Kim Anh Phung, Thao Thi Phuong Dao, Trong-Hieu Nguyen-Mau, Thuc Nguyen-Quang, Cong Nhan Pham, Trung-Nghia Le, Ju Shen, Tam V. Nguyen, Minh-Triet Tran
Computers in Biology and Medicine (Q1, IF = 6.3 in 2024)
[PDF]
18. LookupForensics: A Large-Scale Multi-Task Dataset for Multi-Phase Image-Based Fact Verification
Shuhan Cui, Huy H. Nguyen, Trung-Nghia Le, Chun-Shien Lu, Isao Echizen
IEEE Access (Q1, IF = 3.9 in 2022)
[PDF]
17. GUNNEL: Guided Mixup Augmentation and Multi-Model Fusion for Aquatic Animal Segmentation
Minh-Quan Le*, Trung-Nghia Le*, Tam V. Nguyen, Isao Echizen, Minh-Triet Tran
Neural Computing & Applications (Q1, IF = 4.5 in 2023)
[PDF] [Project Page] [Dataset]
16. Artificial Intelligence for Laryngoscopy in Vocal Fold Diseases: A Review of Dataset, Technology, and Ethics
Thao Thi Phuong Dao, Tan-Cong Nguyen, Viet-Tham Huynh, Xuan-Hai Bui, Trung-Nghia Le, Minh-Triet Tran
Machine Learning (Q1, IF = 4.3 in 2023) (ACML 2024, Journal track)
[PDF]
15. Improving Laryngoscopy Image Analysis through Integration of Global Information and Local Features in VoFoCD Dataset
Thao Thi Phuong Dao, Tuan-Luc Huynh, Minh-Khoi Pham, Trung-Nghia Le, Tan-Cong Nguyen, Quang-Thuc Nguyen, Bich Anh Tran, Boi Ngoc Van, Chanh Cong Ha, Minh-Triet Tran
Imaging Informatics in Medicine (Q1, IF = 4.4 in 2022)
[PDF]
14. eKYC-DF: A Large-Scale Deepfake Dataset for Developing and Evaluating eKYC Systems
Hichem Felouat, Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, Isao Echizen
IEEE Access (Q1, IF = 3.9 in 2022)
[PDF]
13. Analysis of Fine-grained Counting Methods for Masked Face Counting: A Comparative Study
Khanh-Duy Nguyen, Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, Isao Echizen
IEEE Access (Q1, IF = 3.9 in 2022)
[PDF]
12. SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval
Trung-Nghia Le, Tam V. Nguyen, Minh-Quan Le, Trong-Thuan Nguyen, Viet-Tham Huynh, Trong-Le Do, Khanh-Duy Le, Mai-Khiem Tran, Nhat Hoang-Xuan, Thang-Long Nguyen-Ho, Vinh-Tiep Nguyen, Nhat-Quynh Le-Pham, Huu-Phuc Pham, Trong-Vu Hoang, Quang-Binh Nguyen, Trong-Hieu Nguyen-Mau, Tuan-Luc Huynh, Thanh-Danh Le, Ngoc-Linh Nguyen-Ha, Tuong-Vy Truong-Thuy, Truong Hoai Phong, Tuong-Nghiem Diep, Khanh-Duy Ho, Xuan-Hieu Nguyen, Thien-Phuc Tran, Tuan-Anh Yang, Kim-Phat Tran, Nhu-Vinh Hoang, Minh-Quang Nguyen, Hoai-Danh Vo, Minh-Hoa Doan, Hai-Dang Nguyen, Akihiro Sugimoto, Minh-Triet Tran
Computers & Graphics (Special Section on 3DOR 2023) (Q2, IF = 2.62 in 2022)
[PDF]
11. TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval
Trung-Nghia Le, Tam V. Nguyen, Minh-Quan Le, Trong-Thuan Nguyen, Viet-Tham Huynh, Trong-Le Do, Khanh-Duy Le, Mai-Khiem Tran, Nhat Hoang-Xuan, Thang-Long Nguyen-Ho, Vinh-Tiep Nguyen, Tuong-Nghiem Diep, Khanh-Duy Ho, Xuan-Hieu Nguyen, Thien-Phuc Tran, Tuan-Anh Yang, Kim-Phat Tran, Nhu-Vinh Hoang, Minh-Quang Nguyen, E-Ro Nguyen, Minh-Khoi Nguyen-Nhat, Tuan-An To, Trung-Truc Huynh-Le, Nham-Tan Nguyen, Hoang-Chau Luong, Truong Hoai Phong, Nhat-Quynh Le-Pham, Huu-Phuc Pham, Trong-Vu Hoang, Quang-Binh Nguyen, Hai-Dang Nguyen, Akihiro Sugimoto, Minh-Triet Tran
Computers & Graphics (Special Section on 3DOR 2023) (Q2, IF = 2.62 in 2022)
[PDF]
10. Purifying Adversarial Images using Adversarial Autoencoders with Conditional Normalizing Flows
Yi Ji, Trung-Nghia Le, Huy H. Nguyen, Isao Echizen
IEEE Open Journal of Signal Processing (ICIP, Journal Track) (Q2, IF = 2.89 in 2022)
[PDF]
9. Image Synthesis: A Review of Methods, Datasets, Evaluation Metrics, and Future Outlook
Samah Saeed Baraheem, Trung-Nghia Le, Tam V. Nguyen
Artificial Intelligence Review (Q1, IF = 12.0 in 2022)
[PDF]
8. Current Status of Deepfake Generation and Detection (Deepfakeの生成と検出の現状)
Trung-Nghia Le, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen
The Journal of The Institute of Image Information and Television Engineers (ITE), 07/2022 (Vol.76 No.4), Special Feature: AI and Cyber Security in the Infodemic Era (In Japanese, ISSN 1342-6907)
[PDF]
7. Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation
Trung-Nghia Le, Tam V. Nguyen, Minh-Triet Tran
Machine Vision and Applications (MVA) (Q2, IF = 3.3 in 2022)
[PDF] [Project Page]
6. Robust Deepfake On Unrestricted Media: Generation And Detection
Trung-Nghia Le, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen
Frontiers in Fake Media Generation and Detection
5. Camouflaged Instance Segmentation In-The-Wild: Dataset, Method, and Benchmark Suite
Trung-Nghia Le, Yubo Cao, Tan-Cong Nguyen, Minh-Quan Le, Khanh-Duy Nguyen, Thanh-Toan Do, Minh-Triet Tran, Tam V. Nguyen
IEEE Transactions on Image Processing (T-IP) (Q1, IF = 10.6 in 2022)
[PDF] [Project Page]
4. Masked Face Analysis via Multi-task Deep Learning
Vatsa S Patel, Zhongliang Nie, Trung-Nghia Le, Tam V. Nguyen
Journal of Imaging (Q2, IF = 3.2 in 2022)
[PDF]
3. MirrorNet: Bio-Inspired Camouflaged Object Segmentation
Jinnan Yan, Trung-Nghia Le, Khanh-Duy Nguyen, Minh-Triet Tran, Thanh-Toan Do, Tam V. Nguyen
IEEE Access (Q1, IF = 3.9 in 2022)
[PDF] [Project Page]
2. Anabranch Network for Camouflaged Object Segmentation
Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, Minh-Triet Tran, Akihiro Sugimoto
Computer Vision and Image Understanding (CVIU) (Q1, IF = 4.5 in 2024)
[PDF] [Project Page]
92. Vortex: Multi-Modal Fusion System for Intelligent Video Retrieval
Duc-Tho Nguyen, Hieu-Hoc Tran-Minh, Khanh-Hoa Lam, Hoang-Nhut Ly, Huu-Phuc Huynh, Thanh-Tien Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2025 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
91. Hierarchical Multi-Modal Retrieval for News Image Captioning
Minh-Loi Nguyen*, Xuan-Vu Le*, Long-Bao Nguyen, Hoang-Bach Ngo, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2025 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
90. Forged Calamity: Benchmark for Cross-Domain Synthetic Disaster Detection in the Age of Diffusion
Duc-Manh Phan*, Quoc-Duy Tran*, Duy-Khang Do*, Anh-Tuan Vo, Hai-Dang Nguyen, Trong Le Do, Mai-Khiem Tran, Vinh-Tiep Nguyen, Tam V. Nguyen, Isao Echizen, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2025 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
89. CIAN: Multi-Stage Framework for Event-Enriched Image Captioning via Retrieval-Augmented Generation
Thi Thu Hien Trinh, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2025
[PDF] [Poster] [Presentation] [Demo] [Project Page]
88. VisionGuard: Synergistic Framework for Helmet Violation Detection
Thanh-Hai Nguyen*, Thinh-Phuc Nguyen*, Gia-Huy Dinh*, Lam-Huy Nguyen*, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2025
[PDF] [Poster] [Presentation] [Demo] [Project Page]
87. Edit3DGS: Unified Framework for Dynamic Head Editing via 2D Instruction-Guided Diffusion and 3D Gaussian Splatting
Duy-Dat Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2025
[PDF] [Poster] [Presentation] [Demo] [Project Page]
86. Visual Retrieval-Augmented Generation for Silhouette-Guided Animal Art
Quoc-Duy Tran, Anh-Tuan Vo, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2025
[PDF] [Poster] [Presentation] [Demo] [Project Page]
85. Exploring Multi-Modal Large Language Models and Two-Stage Fine-Tuning for Fashion Image Retrieval
Nguyen Hoang Cao*, Hoang Bui Le*, Nam Vo Hoang*, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2025
[PDF] [Poster] [Presentation] [Demo] [Project Page]
84. DTD-Mamba: Dual Teacher Distillation for Mamba in Head and Neck Abscess Segmentation
Thao Thi Phuong Dao, Tan-Cong Nguyen, Trong-Le Do, Mai-Khiem Tran, Minh-Khoi Pham, Trung-Nghia Le, Minh-Triet Tran, Thanh Dinh Le
International Symposium on Information and Communication Technology (SoICT), 2025 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
83. MasHeNe: A Benchmark for Head and Neck CT Mass Segmentation using Window-Enhanced Mamba with Frequency-Domain Integration
Thao Thi Phuong Dao, Tan-Cong Nguyen, Nguyen Chi Thanh, Truong Hoang Viet, Trong-Le Do, Mai-Khiem Tran, Minh-Khoi Pham, Trung-Nghia Le, Minh-Triet Tran, Thanh Dinh Le
International Symposium on Information and Communication Technology (SoICT), 2025 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
82. AEye: Avian Monitoring from Streaming Videos
Kasturi Jamale*, Kunal Agrawal*, Ba-Thinh Tran-Le, Jayanth Merakanapalli, Soham Chousalkar, Vatsa Patel, Trung-Nghia Le, Tam V. Nguyen
International Symposium on Information and Communication Technology (SoICT), 2025 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
81. Research Paper Quality Recognition Through Textual Feature Analysis
Saikiran Korla*, Sadwik Gummadavelli*, Trung-Nghia Le, Minh-Triet Tran, Tam V. Nguyen
International Symposium on Information and Communication Technology (SoICT), 2025
[PDF] [Poster] [Presentation] [Demo] [Project Page]
80. MultiPointing: Supporting Multiple Users' Pointing in Hybrid Meetings
Dinh-Thuan Duong-Le, Duy-Nam Ly, Trung-Nghia Le, Vinh-Tiep Nguyen, Khanh-Duy Le
Australian Conference on Human-Computer Interaction (OzCHI), 2025 (B Rank) (Late Breaking Work)
[PDF]
79. OpenEvents V1: Large-Scale Benchmark Dataset for Multimodal Event Grounding
Hieu Nguyen, Phuc-Tan Nguyen, Thien-Phuc Tran, Minh-Quang Nguyen, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
ACM International Conference on Multimedia (ACM MM), 2025 (A* Rank) (Dataset)
[PDF]
78. Event-Enriched Image Analysis Grand Challenge at ACM Multimedia 2025
Thien-Phuc Tran*, Minh-Quang Nguyen*, Minh-Triet Tran, Tam V. Nguyen, Trong-Le Do, Duy-Nam Ly, Viet-Tham Huynh, Khanh-Duy Le, Mai-Khiem Tran, Trung-Nghia Le
ACM International Conference on Multimedia (ACM MM), 2025 (A* Rank) (Challenge)
[PDF]
77. Multi-Level CLS Token Fusion for Contrastive Learning in Endoscopy Image Classification
Y Hop Nguyen, Doan Anh Phan Huu, Trung Thai Tran, Nhat Nam Mai, Van Toi Giap, Thao Thi Phuong Dao, Trung-Nghia Le
ACM International Conference on Multimedia (ACM MM), 2025 (A* Rank) (Challenge)
[PDF]
76. ReCap: Event-Aware Image Captioning with Article Retrieval and Semantic Gaussian Normalization
Thinh-Phuc Nguyen, Thanh-Hai Nguyen, Gia-Huy Dinh, Lam-Huy Nguyen, Minh-Triet Tran, Trung-Nghia Le
ACM International Conference on Multimedia (ACM MM), 2025 (A* Rank) (Challenge)
[PDF]
75. EVENT-Retriever: Event-Aware Multimodal Image Retrieval for Realistic Captions
Dinh-Khoi Vo, Van-Loc Nguyen, Minh-Triet Tran, Trung-Nghia Le
ACM International Conference on Multimedia (ACM MM), 2025 (A* Rank) (Challenge)
[PDF]
74. Streamlining Virtual KOL Generation Through Modular Generative AI Architecture
Tan-Hiep To, Duy-Khang Nguyen, Minh-Triet Tran, Trung-Nghia Le
ACM International Conference on Multimedia (ACM MM), 2025 (A* Rank) (Demo)
[PDF]
73. Advancing Fashion Design Through Intelligent Sketchpad Studio
Nhu-Binh Nguyen-Truc*, Nhu-Vinh Hoang*, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
ACM International Conference on Multimedia (ACM MM), 2025 (A* Rank) (Demo)
[PDF]
72. Learning Disentangled Stain and Structural Representations for Semi-Supervised Histopathology Segmentation
Ha-Hieu Pham, Nguyen Lan Vi Vu, Thanh-Huy Nguyen, Ulas Bagci, Min Xu, Trung-Nghia Le, Huy-Hieu Pham
MICCAI Workshop on Computational Pathology with Multimodal Data (COMPAYL), 2025
[PDF] [Project Page]
71. SAMURAI: Shape-Aware Multimodal Retrieval for 3D Object Identification
Dinh-Khoi Vo*, Van-Loc Nguyen*, Minh-Triet Tran, Trung-Nghia Le
International Conference on Multimedia Analysis and Pattern Recognition (MAPR), 2025
[PDF]
70. GenFlow: Interactive Modular System for Image Generation
Duc-Hung Nguyen*, Huu-Phuc Huynh*, Minh-Triet Tran, Trung-Nghia Le
International Conference on Content-Based Multimedia Indexing (CBMI), 2025
[PDF]
69. Automated Image Recognition Framework
Quang-Binh Nguyen*, Trong-Vu Hoang*, Do Tran Ngoc, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
International Conference on Computational Collective Intelligence (ICCCI), 2025 (B Rank)
[PDF] [Presentation] [Demo] [Project Page]
68. Chat2Edit: A Prompt-based Image Editor with Live Feedback and Parameter Recommendation
Tin-Nghia Le, Phuong-Dao Duong Dinh, Quang Huy Che, Duc-Vu Nguyen, Vinh-Tiep Nguyen, Tam V. Nguyen, Trung-Nghia Le, Minh-Triet Tran
International Conference on Computational Collective Intelligence (ICCCI), 2025 (B Rank)
[PDF] [Presentation] [Demo] [Project Page]
67. FaR: Enhancing Multi-Concept Text-to-Image Diffusion via Concept Fusion and Localized Refinement
Gia-Nghia Tran, Quang-Huy Che, Trong-Tai Dam Vu, Bich-Nga Pham, Vinh-Tiep Nguyen, Trung-Nghia Le, Minh-Triet Tran
International Conference on Computational Collective Intelligence (ICCCI), 2025 (B Rank)
[PDF] [Presentation] [Demo] [Project Page]
66. CamoFA: A Learnable Fourier-based Augmentation for Camouflage Segmentation
Minh-Quan Le, Minh-Triet Tran, Trung-Nghia Le, Tam V. Nguyen, Thanh-Toan Do
Winter Conference on Applications of Computer Vision (WACV), 2025 (A Rank)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
65. Language-Guided Video Object Segmentation
Minh Duy Phan, Minh Huan Le, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
64. VisChronos: Revolutionizing Image Captioning Through Real-Life Events
Phuc-Tan Nguyen*, Hieu Nguyen*, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page] [Dataset]
63. EPEdit: Redefining Image Editing with Generative AI and User-Centric Design
Hoang-Phuc Nguyen*, Dinh-Khoi Vo*, Trong-Le Do, Hai-Dang Nguyen, Tan-Cong Nguyen, Vinh-Tiep Nguyen, Tam V. Nguyen, Khanh-Duy Le, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
62. MythraGen: Two-Stage Retrieval Augmented Art Generation Framework
Quang-Khai Le*, Cong-Long Nguyen*, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
61. KidRisk: Benchmark Dataset for Children Dangerous Action Recognition
Minh-Kha Nguyen*, Trung-Hieu Do*, Kim Anh Phung, Thao Thi Phuong Dao, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
60. DanceDuo: Bridging Human Movement and AI Choreography
Gia-Cat Bui-Le, Tuong-Vy Truong-Thuy, Hai-Dang Nguyen, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
59. Budget-Aware Keyboardless Interaction
Quang-Thang Nguyen*, Gia-Phuc Song-Dong*, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024 (Oral)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
58. Decoding Deepfakes: Caption Guided Learning for Robust Deepfake Detection
Y-Hop Nguyen, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024
[PDF] [Poster] [Presentation] [Demo] [Project Page]
57. Minimalist Preprocessing Approach for Image Synthesis Detection
Hoai-Danh Vo, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024
[PDF] [Poster] [Presentation] [Demo] [Project Page]
56. Hybrid Compression: Integrating Pruning and Quantization for Optimized Neural Networks
Minh-Loi Nguyen*, Long-Bao Nguyen*, Van-Hieu Huynh*, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024
[PDF] [Poster] [Presentation] [Demo] [Project Page]
55. Motion Analysis in Static Images
Kunal Agrawal, Vatsa Patel, Reema Tharra, Trung-Nghia Le, Minh-Triet Tran, Tam V. Nguyen
International Symposium on Information and Communication Technology (SoICT), 2024
[PDF] [Poster] [Presentation] [Demo] [Project Page]
54. AI-Generated Image Recognition via Fusion of CNNs and Vision Transformers
Xuan-Bach Mai, Hoang-Tung Vu, Hoang-Minh Nguyen-Huu, Quoc-Nghia Nguyen, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2024
[PDF] [Poster] [Presentation] [Demo] [Project Page]
53. Rethinking Sampling for Music-Driven Long-Term Dance Generation
Tuong-Vy Truong-Thuy, Gia-Cat Bui-Le, Hai-Dang Nguyen, Trung-Nghia Le
Asian Conference on Computer Vision (ACCV), 2024 (B Rank)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
52. CrossPAR: Enhancing Pedestrian Attribute Recognition with Vision-Language Fusion and Human-Centric Pre-training
Bach-Hoang Ngo, Si-Tri Ngo, Phu-Duc Le, Quang-Minh Phan, Minh-Triet Tran, Trung-Nghia Le
Asian Conference on Computer Vision (ACCV), 2024 (B Rank)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
51. Immersive Spatiotemporal Travel in Virtual Reality
Thanh Ngoc-Dat Tran, Viet-Tham Huynh, Poojitha Moganti, Trung-Nghia Le, Minh-Triet Tran, Tam V. Nguyen
International Symposium on Mixed and Augmented Reality (ISMAR), 2024 (A* Rank) (Poster)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
50. Urban Traffic Planning Simulation with Time and Weather Dynamics
Tam V. Nguyen, Thanh Ngoc-Dat Tran, Viet-Tham Huynh, Vatsa S Patel, Umang Jai, Mai-Khiem Tran, Trung-Nghia Le, Minh-Triet Tran
International Symposium on Mixed and Augmented Reality (ISMAR), 2024 (A* Rank) (Poster)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
49. Synthetic Is All You Need For Semantic Segmentation
Minh-Tuan Huynh*, Ngoc-Do Tran*, Minh-Triet Tran, Trung-Nghia Le
SyntaGen Workshop, CVPR, 2024 (First Prize)
[Invited Paper] [Project Page]
48. Rethinking Text-to-Image as Semantic-Aware Data Augmentation for Indoor Scene Recognition
Trong-Vu Hoang, Quang-Binh Nguyen, Dinh-Khoi Vo, Hoai-Danh Vo, Minh-Triet Tran, Trung-Nghia Le
International Conference on Multimedia Analysis and Pattern Recognition (MAPR), 2024
[PDF]
47. Evaluation of Image Matching for Art Skills Assessment
Asaad Alghamdi, Michael Poor, Trung-Nghia Le, Tam V. Nguyen
International Conference on Multimedia Analysis and Pattern Recognition (MAPR), 2024
[PDF]
46. Masked Face Recognition on Limited Training Data
Phuoc-Sang Pham, Minh-Kha Nguyen, Minh-Hien Le, Minh-Triet Tran, Trung-Nghia Le
International Conference on Multimedia Analysis and Pattern Recognition (MAPR), 2024
[PDF]
45. iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer
Dinh-Khoi Vo*, Duy-Nam Ly*, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
ACM Conference on Human Factors in Computing Systems (CHI), 2024 (A* Rank) (Late Breaking Work)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
44. ARtVista: Gateway To Empower Anyone Into Artist
Trong-Vu Hoang*, Quang-Binh Nguyen*, Duy-Nam Ly, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
ACM Conference on Human Factors in Computing Systems (CHI), 2024 (A* Rank) (Late Breaking Work)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
43. PISeg: Polyp Instance Segmentation with Texture Denoising and Adaptive Region
Tan-Cong Nguyen, Kim Anh Phung, Tien-Phat Nguyen, Thao Dao, Cong Nhan Pham, Quang-Thuc Nguyen, Trung-Nghia Le, Ju Shen, Tam V. Nguyen, Minh-Triet Tran
IEEE International Symposium on Biomedical Imaging, 2024 (A Rank)
[PDF] [Project Page]
42. MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
Minh-Quan Le, Tam V. Nguyen, Trung-Nghia Le, Thanh-Toan Do, Minh N. Do, Minh-Triet Tran
AAAI Conference on Artificial Intelligence, 2024 (A* Rank, Oral )
[PDF] [Project Page]
41. Medico Multimedia Task at MediaEval 2023: Transparent Tracking of Spermatozoa
Vajira Thambawita, Andrea Storås, Tuan-Luc Huynh, Hai-Dang Nguyen, Minh-Triet Tran, Trung-Nghia Le, Pål Halvorsen, Michael Riegler, Steven Hicks, Thien-Phuc Tran
Multimedia Evaluation Workshop (MediaEval), 2024
[PDF]
40. NearbyPatchCL: Leveraging Nearby Patches for Self-Supervised Patch-Level Multi-Class Classification in Whole-Slide Images
Gia-Bao Le*, Van-Tien Nguyen*, Trung-Nghia Le, Minh-Triet Tran
International Conference on Multimedia Modeling (MMM), 2024 (B Rank, Oral)
[PDF] [Project Page]
39. Multi-Branch Network for Imagery Emotion Prediction
Quoc-Bao Ninh, Hai-Chan Nguyen, Triet Huynh, Trung-Nghia Le
International Symposium on Information and Communication Technology (SoICT), 2023
[PDF] [Project Page]
38. Budget-Aware Road Semantic Segmentation in Unseen Foggy Scenes
Tan-Hiep To, Thanh-Nghi Do, Duc-Nghia Ngo, Minh-Triet Tran, Trung-Nghia Le
International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2023
[PDF]
37. Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments
Hieu Nguyen*, Cong-Hoang Ta*, Phuong-Thuy Le-Nguyen*, Minh-Triet Tran, Trung-Nghia Le
International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2023
[PDF]
36. Efficient 3D Brain Tumor Segmentation with Axial-Coronal-Sagittal Embedding
Tuan-Luc Huynh, Thanh-Danh Le, Tam V. Nguyen, Trung-Nghia Le, Minh-Triet Tran
Pacific-Rim Symposium on Image and Video Technology (PSIVT), 2023 (C Rank - Best Paper Award)
[PDF] [Project Page]
35. Cluster-based Video Summarization with Temporal Context Awareness
Hai-Dang Huynh-Lam*, Ngoc-Phuong Ho-Thi*, Minh-Triet Tran, Trung-Nghia Le
Pacific-Rim Symposium on Image and Video Technology (PSIVT), 2023 (C Rank)
[PDF] [Project Page]
34. DM-VTON: Distilled Mobile Real-time Virtual Try-On
Khoi-Nguyen Nguyen-Ngoc, Thanh-Tung Phan-Nguyen, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Mixed and Augmented Reality (ISMAR), 2023 (A* Rank) (Nominated for Best Poster)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
33. VIDES: Virtual Interior Design via Natural Language and Visual Guidance
Minh-Hien Le, Chi-Bien Chu, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
International Symposium on Mixed and Augmented Reality (ISMAR), 2023 (A* Rank) (Poster)
[PDF] [Poster] [Presentation] [Demo] [Project Page]
32. Analysis of Master Vein Attacks on Finger Vein Recognition Systems
Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, Isao Echizen
Winter Conference on Applications of Computer Vision (WACV), 2023 (A Rank)
[PDF]
31. Closer Look at the Transferability of Adversarial Examples: How They Fool Different Models Differently
Futa Waseda, Sosuke Nishikawa, Trung-Nghia Le, Huy H. Nguyen, Isao Echizen
Winter Conference on Applications of Computer Vision (WACV), 2023 (A Rank)
[PDF]
30. Tail-Aware Sperm Analysis for Transparent Tracking of Spermatozoa
Tuan-Luc Huynh, Huu-Hung Nguyen, Xuan-Nhat Hoang, Thao Thi Phuong Dao, Tien-Phat Nguyen, Viet-Tham Huynh, Hai-Dang Nguyen, Trung-Nghia Le, Minh-Triet Tran
Multimedia Evaluation Workshop (MediaEval), 2022
[PDF]
29. Multilingual Communication System with Deaf Individuals Utilizing Natural and Visual Languages
Tuan-Luc Huynh*, Khoi-Nguyen Nguyen-Ngoc*, Chi-Bien Chu*, Minh-Triet Tran, Trung-Nghia Le
International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2022
[PDF]
28. Rethinking Adversarial Examples for Location Privacy Protection
Trung-Nghia Le*, Ta Gu*, Huy H. Nguyen, Isao Echizen
IEEE International Workshop on Information Forensics and Security (WIFS), 2022
[PDF]
27. Public Speaking Simulator with Speech and Audience Feedback
Bao Truong, Trung-Nghia Le, Khanh-Duy Le, Minh-Triet Tran, Tam V. Nguyen
IEEE International Symposium on Mixed and Augmented Reality (ISMAR), 2022 (A* Rank) (Poster)
[PDF]
26. GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic Animal Segmentation
Minh-Quan Le*, Trung-Nghia Le*, Tam V. Nguyen, Isao Echizen, Minh-Triet Tran
CV4Animal Workshop, CVPR 2022 [Invited poster]
[PDF]
25. Effectiveness of Detection-based and Regression-based Approaches for Estimating Mask-Wearing Ratio
Khanh-Duy Nguyen, Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, Isao Echizen
FG4COVID19, 2021
[PDF]
24. OpenForensics: Large-Scale Challenging Dataset For Multi-Face Forgery Detection And Segmentation In-The-Wild
Trung-Nghia Le, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen
International Conference on Computer Vision (ICCV), 2021 (A* Rank) (Acceptance rate 25.9%)
[PDF] [Presentation] [Project Page]
23. Fashion-Guided Adversarial Attack on Person Segmentation
Marc Treu*, Trung-Nghia Le*, Huy H. Nguyen*, Junichi Yamagishi, Isao Echizen
CVPR Workshop on Media Forensics, 2021 (*Equal Contributions)
[PDF] [Presentation] [Project Page]
22. Interactive Video Object Mask Annotation
Trung-Nghia Le, Tam V. Nguyen, Quoc-Cuong Tran, Lam Nguyen, Trung-Hieu Hoang, Minh-Quan Le, Minh-Triet Tran
AAAI Conference on Artificial Intelligence, 2021 (A* Rank) (Demo)
[PDF] [Presentation] [Poster] [Project Page]
21. CamouFinder: Finding Camouflaged Instances in Images
Trung-Nghia Le, Vuong Nguyen, Cong Le, Tan-Cong Nguyen, Minh-Triet Tran, Tam V. Nguyen
AAAI Conference on Artificial Intelligence, 2021 (A* Rank ) (Demo)
[PDF] [Presentation] [Poster] [Project Page]
20. Text-to-Image Synthesis via Aesthetic Layout
Samah Saeed Baraheem, Trung-Nghia Le, Tam V. Nguyen
International Conference on Multimedia, 2020 (A* Rank) (Demo)
[PDF] [Presentation] [Project Page]
19. Multi-Referenced Guided Instance Segmentation Framework for Semi-supervised Video Instance Segmentation
Minh-Triet Tran, Trung-Hieu Hoang, Tam V. Nguyen, Trung-Nghia Le, E-Ro Nguyen, Minh-Quan Le, Hoang-Phuc Nguyen-Dinh, Xuan-Nhat Hoang, Minh N. Do
CVPR Workshop on DAVIS Challenge on Video Object Segmentation, 2020 (4th place )
[PDF] [Presentation] [Leader Board] [Project Page]
18. iTASK - Intelligent Traffic Analysis Software Kit
Minh-Triet Tran, Tam V. Nguyen, Trung-Hieu Hoang, Trung-Nghia Le, Khac-Tuan Nguyen, Dat-Thanh Dinh, Thanh-An Nguyen, Hai-Dang Nguyen, Trong-Tung Nguyen, Xuan-Nhat Hoang, Viet-Khoa Vo-Ho, Trong-Le Do, Lam Nguyen, Minh-Quan Le, Hoang-Phuc Nguyen-Dinh, Trong-Thang Pham, Xuan-Vy Nguyen, E-Ro Nguyen, Quoc-Cuong Tran, Hung Tran, Hieu Dao, Mai-Khiem Tran, Quang-Thuc Nguyen, The-Anh Vu-Le, Tien-Phat Nguyen, Gia-Han Diep, Minh N. Do
CVPR Workshop on AI City Challenge, 2020 (10th place on Track 1 and 26th place on Track 2 and 5th place on Track 4)
[PDF] [Track 1] [Track 2] [Track 4]
17. Attention R-CNN for Accident Detection
Trung-Nghia Le, Akihiro Sugimoto, Shintaro Ono, Hiroshi Kawasaki
Intelligent Vehicles Symposium (IV), 2020 (B Rank)
[PDF] [Presentation] [Project Page]
16. Toward Interactive Self-Annotation For Video Object Bounding Box: Recurrent Self-Learning And Hierarchical Annotation Based Framework
Trung-Nghia Le, Akihiro Sugimoto, Shintaro Ono, Hiroshi Kawasaki
Winter Conference on Applications of Computer Vision (WACV), 2020 (A Rank)
[PDF] [Poster] [Presentation] [Project Page]
15. Guided Instance Segmentation Framework for Semi-Supervised Video Instance Segmentation
Minh-Triet Tran, Trung-Nghia Le, Tam V. Nguyen, Vinh Ton-That, Trung-Hieu Hoang, Ngoc-Minh Bui, Trong-Le Do, Quoc-An Luong, Vinh-Tiep Nguyen, Duc Anh Duong, Minh N. Do
CVPR Workshop on DAVIS Challenge on Video Object Segmentation, 2019 (3rd place )
[PDF] [Poster] [Leader Board] [Project Page]
14. Vehicle Re-identification with Learned Representation and Spatial Verification and Abnormality Detection with Multi-Adaptive Vehicle Detectors for Traffic Video Analysis
Khac-Tuan Nguyen, Trung-Hieu Hoang, Minh-Triet Tran, Trung-Nghia Le, Ngoc-Minh Bui, Trong-Le Do, Viet-Khoa Vo-Ho, Quoc-An Luong, Mai-Khiem Tran, Thanh-An Nguyen, Thanh-Dat Truong, Vinh-Tiep Nguyen, Minh N. Do
CVPR Workshop on AI City Challenge, 2019 (8th place on Track 3 and 25th place on Track 2)
[PDF] [Poster] [Track 2] [Track 3]
13. Semantic Instance Meets Salient Object: Study on Video Semantic Salient Instance Segmentation
Trung-Nghia Le, Akihiro Sugimoto
Winter Conference on Applications of Computer Vision (WACV), 2019 (A Rank)
[PDF] [Poster] [Project Page]
12. Context-based Instance Segmentation in Video Sequence
Minh-Triet Tran, Vinh Ton-That, Trung-Nghia Le, Khac-Tuan Nguyen, Tu V. Ninh, Tu-Khiem Le, Vinh-Tiep Nguyen, Tam V. Nguyen, Minh N. Do
CVPR Workshop on DAVIS Challenge on Video Object Segmentation, 2018 (6th place )
[PDF] [Poster] [Leader Board] [Project Page]
11. Balancing Content and Style with Two-Stream FCNs for Style Transfer
Duc Minh Vo, Trung-Nghia Le, Akihiro Sugimoto
Winter Conference on Applications of Computer Vision (WACV), 2018 (A Rank)
[PDF] [Poster] [Project Page]
10. Deeply Supervised 3D Recurrent FCN for Salient Object Detection in Videos
Trung-Nghia Le, Akihiro Sugimoto
British Machine Vision Conference (BMVC), 2017 (A Rank)
[PDF] [Poster] [Results] [Project Page]
9. Instance Re-Identification Flow for Video Object Segmentation
Trung-Nghia Le, Khac-Tuan Nguyen, Manh-Hung Nguyen-Phan, That-Vinh Ton, Toan-Anh Nguyen, Xuan-Son Trinh, Quang-Hieu Dinh, Vinh-Tiep Nguyen, Anh-Duc Duong, Akihiro Sugimoto, Tam V. Nguyen, Minh-Triet Tran
CVPR Workshop on DAVIS Challenge on Video Object Segmentation, 2017 (3rd place )
[PDF] [Leader Board] [Project Page]
8. Spatiotemporal Utilization of Deep Features for Video Saliency Detection
Trung-Nghia Le, Akihiro Sugimoto
ICME Workshop on Deep Learning for Intelligent Multimedia Analytics (DeLIMMA), 2017 (Oral presentation)
6. Essential Keypoints to Enhance Visual Object Recognition with Saliency-based Metrics
Trung-Nghia Le, Yen-Thanh Le, Minh-Triet Tran, Anh-Duc Duong
International Conference on Control, Automation, Robotics and Vision (ICARCV), 2014 (A Rank) (Oral presentation)
[PDF] [Project Page]
5. Applying Saliency-based Region of Interest Detection in Developing a Collaborative Active Learning System with Augmented Reality
Trung-Nghia Le, Yen-Thanh Le, Minh-Triet Tran
International Conference on Human-Computer Interaction (HCII), 2014
[PDF] [Project Page]
4. Applying Fast Planar Object Detection in Multimedia Augmentation for Products with Mobile Devices
Quoc-Minh Bui, Trung-Nghia Le, Vinh-Tiep Nguyen, Minh-Triet Tran, Anh-Duc Duong
International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2012
[PDF]
3. Augmented Media for Traditional Magazines
Vinh-Tiep Nguyen, Trung-Nghia Le, Quoc-Minh Bui, Minh-Triet Tran, Anh-Duc Duong
International Symposium on Information and Communication Technology (SoICT), 2012
[PDF] [Project Page]
2. Smart Shopping Assistant: A Multimedia and Social Media Augmented System With Mobile Devices to Enhance Customers' Experience and Interaction
Vinh-Tiep Nguyen, Trung-Nghia Le, Quoc-Minh Bui, Minh-Triet Tran, Anh-Duc Duong
Pacific Asia Conference on Information Systems (PACIS), 2012 (A Rank) (Oral presentation)
[PDF] [Project Page]
1. Applying Virtual Reality for In-Door Jogging
Trung-Nghia Le, Quoc-Minh Bui, Vinh-Tiep Nguyen, Minh-Triet Tran, Anh-Duc Duong
International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2012
[PDF]
3. Comprehensive Analysis of AI-Synthetic Image Detection Architectures
Thien-Hoa Hoang-Don, Tien-Dat Nguyen, Nam-Anh Nguyen, Trung-Nghia Le
National Conference on Fundamental and Applied IT Research (FAIR), 2025
[PDF]
2. Learning-Based Semi-Automatic Annotation and Accident Detection from Driving Video (in Japanese)
Trung-Nghia Le, Shintaro Ono, Akihiro Sugimoto, Hiroshi Kawasaki
18th ITS symposium, Japan, 2020
1. Instance Segmentation in Video with Human-Pose Guidance and Data Augmentation (in Vietnamese)
Minh-Triet Tran, Tu V. Ninh, Tu-Khiem Le, Vinh Ton-That, Khac-Tuan Nguyen, Trung-Nghia Le, Tam V. Nguyen
Scientific Conference of University of Science, VNU-HCM, Vietnam, 2018
[Abstract] [Project Page]
GenKOL: Modular Generative AI Framework For Scalable Virtual KOL Generation
Tan-Hiep To, Duy-Khang Nguyen, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
Arxiv Pre-print: 2509.14927
[PDF]
KiseKloset: Comprehensive System For Outfit Retrieval, Recommendation, And Try-On
Thanh-Tung Phan-Nguyen, Khoi-Nguyen Nguyen-Ngoc, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
Arxiv Pre-print: 2506.234
[PDF]
Interactive Interface For Semantic Segmentation Dataset Synthesis
Ngoc-Do Tran, Minh-Tuan Huynh, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
Arxiv Pre-print: 2506.23470
[PDF]
PrefPaint: Enhancing Image Inpainting through Expert Human Feedback
Duy-Bao Bui, Hoang-Khang Nguyen, Trung-Nghia Le
Arxiv Pre-print: 2506.21834
[PDF]
TaleForge: Interactive Multimodal System for Personalized Story Creation
Minh-Loi Nguyen, Quang-Khai Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
Arxiv Pre-print: 2506.21832
[PDF]
VisionGuard: Synergistic Framework for Helmet Violation Detection
Thinh-Phuc Nguyen*, Thanh-Hai Nguyen*, Gia-Huy Dinh*, Lam-Huy Nguyen*, Minh-Triet Tran, Trung-Nghia Le
Arxiv Pre-print: 2506.21005
[PDF]
Shape2Animal: Creative Animal Generation from Natural Silhouettes
Quoc-Duy Tran, Anh-Tuan Vo, Dinh-Khoi Vo, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
Arxiv Pre-print: 2506.20616
[PDF] [Project Page]
ShowFlow: From Robust Single Concept to Condition-Free Multi-Concept Generation
Trong-Vu Hoang, Quang-Binh Nguyen, Thanh-Toan Do, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
Arxiv Pre-print: 2506.18493
[PDF]
CPAM: Context-Preserving Adaptive Manipulation for Zero-Shot Real Image Editing
Dinh-Khoi Vo, Thanh-Toan Do, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
Arxiv Pre-print: 2506.18438
[PDF]
Region-Based Multiscale Spatiotemporal Saliency for Video
Trung-Nghia Le, Akihiro Sugimoto
Arxiv Pre-print: 1708.01589
[PDF]