Multi-omics Data Analysis and Interpretation
Data Science/Machine Learning in Biomedicine
High Performance Computing in Genomics
Genetic Testing for Disease Risk Prediction, Drug-Drug-Gene Interaction
Platform for Data Science and AI in Biomedicine
DNA profiling for Human Identification
Chief Scientific & Technology Officer & Co-Founder, GeneStory JSC (2022–present)
Director of Precision Medicine Division, VinUni Big Data Research Institute (2025-present)
Director of Biomedical Informatics Center, Vingroup Big Data Institute (2019–present)
Senior Bioinformatics Scientist, University of Chicago (2017–2019)
Postdoctoral Fellow, MD Anderson Cancer Center (2016–2017)
VN1K: 1000 Vietnamese Genomes Project
MASH: Management, Analysis, Sharing, and Harmonization of Large-scale Biomedical Data
VGR: Construction of Vietnamese-specific Pangenome Reference
VGC: Design of Vietnamese-specific Genotyping Chip
VGP: Vietnamese Genome-based Prediction of Disease Risks
ADR: Preventing Adverse Drug Reactions using Pharmacogenomics
AMR: Fighting Antimicrobial Resistance using Sequencing Data and Big Data Analytics
➡ Detail on VinGen
VN1K: a genome graph-based and function-driven multi-omics and phenomics resource for the Vietnamese population. bioRxiv, 2025
Integrating polygenic and transcriptional risk scores improves risk prediction of nine common diseases in the underrepresented Vietnamese population. medRxiv, 2025
A study of genetic variants associated with skin traits in the Vietnamese population. BMC Genomics, 2024
Pasa: leveraging population pangenome graph to scaffold prokaryote genome assemblies, Nucleic Acids Research, 2024
Self-designed single-nucleotide polymorphism chip and method of computing polygenicrisk score for given populations using self-designed single-nucleotide polymorphism chip. US Patent, 2023
A rapid and reference-free imputation method for low-cost genotyping platforms. Scientific Reports, 2023
Assessing polygenic risk score models for applications in populations with under-represented genomics data: an example of Vietnam, Briefings in Bioinformatics, 2022
A comprehensive evaluation of polygenic score and genotype imputation performances of human SNP arrays in diverse populations. Scientific Reports, 2022
➡ Full list on Google Scholar
• Reviews:
• Journal reviews: Nature Methods, Nature Communications, Bioinformatics, Briefings in Bioinformatics, Scientific Reports, Frontiers in Oncology, BMC Bioinformatics, BMC Medical Genomics.
• Conference reviews: RECOMB, RECOMB-CBB, AICoB, BIBM, KSE.
• Panel reviews: Vingroup Innovation Foundation (VINIF), National Foundation for Science and Technology Development (NAFOSTED).
• Program committee: IEEE International Conference on Knowledge and Systems Engineering (KSE), Genomic Medicine Conference (GMC).
• Invited talks:
• Integrated Multi-omics Risk Scores for Common Diseases in the Vietnamese Population. Thermo Fisher SwiftArrayStudio VVIP Launch Event, Singapore, 02/2026.
• Integrating polygenic and transcriptional risk scores improves risk prediction of nine common diseases in the Vietnamese population. KIT-ASEAN Joint Seminar, Hanoi, Vietnam, 12/2025.
• AI & Big Data for Vietnam Genome Program. AVSE Global-Summit, Hanoi, Vietnam, 07/2025.
• Vietnam Genome Program: Current Results & Challenges. PacBio PRISM, Danang, Vietnam, 04/2025.
• Mathematical Methods for Big Data Analysis in Vietnam Genome Program. VIASM Spring School on recent advances in bioinformatics and statistical genetics, Hanoi, Vietnam, 03/2025.
• Research, Development, and Applications of AI in Medicine in Vietnam. Vietnam AI Forum, Hanoi, Vietnam, 12/2024.
• Functional-based analysis of multi-omics data in Vietnam Genome Program. Vietnam School of Biology, Quy Nhon, Vietnam, 12/2024.
• New Technologies in Medicine. MOST KC Conference, Ho Chi Minh City, Vietnam, 09/2024.
• Research and Development Activities at GeneStory: Current Status and Vision. UK-VN multi-disciplinary network in healthcare innovation, Hanoi, Vietnam, 03/2024.
• Decoding Human Genomes at Scale: Data Science and Machine Learning Approaches. International Conference on Health Science and Technology, Hanoi, Vietnam, 12/2023.
• Vietnamese Genomes at Population Scale: Vision and Solutions. National Conference of Biotechnology, Hanoi, Vietnam, 10/2023.
• Data Science in Life Sciences: The 1000 Vietnamese Genomes Project. Japan-Vietnam Bilateral Symposium on Science and Engineering for Space and the Earth, Hanoi, Vietnam, 10/2023.
• Polygenic Risk Score Estimation using SNP Arrays in Diverse Populations Lessons from Vietnam Genome Program. ThermoFisher Scientific Predictive Genomics Symposium, Bangkok, Thailand, 09/2023.
• Machine Learning for the Understanding of the Human Genome. IMH-VAST Spring Research School, Hanoi, Vietnam, 02/2023.
• Sequencing Vietnamese Genomes at Scale: Opportunities & Challenges. IBSG Seminar, Hanoi, Vietnam, 07/2022.
• Decoding DNA how graph theory can help? Intl. Day of Math Mathematics Unites, Hanoi, Vietnam, 03/2022.
• Data Science Approaches to Decoding Genomes at Scale. HUST-SoICT & VinBigdata Workshop on DS&AI, Hanoi, Vietnam, 03/ 2022.
• Population-specific Risk Prediction Models: Experiences from the 1000 Vietnamese Genomes Project. AIxImpact Southeast Asia Conference, 02/2022.
• Genomic Sequencing in Clinical Practice. Vinmec Conference in Genetics and Cell Technology, Hanoi, Vietnam, 01/2022.
• Sequencing Vietnamese Genomes at Scale: Towards Precision Medicine in Viet Nam. Human Cell Atlas Asia Meeting, 11/2021.
• Vietnamese Genome Sequencing in the Era of Data Science and Artificial Intelligence. Vietnam Science & Technology Day, Hanoi, Vietnam, 06/2021.
• Big Data for Precision Medicine. FAIR Conference, Nha Trang University, Nha Trang, Vietnam, 08/2020.
• Genomic Big Data: What can we do? National University of Civil Engineering, Hanoi, Vietnam, 11/2019.
• Mathematical Methods for the Understanding of the Human Genome. International Graduate Summer School in Mathematics, Hanoi, Vietnam, 08/2019.
• Mathematical Methods for the Understanding of the Human Genome. VN-USA Joint Mathematical Meeting, Quy Nhon, Vietnam, 06/2019.
• Predicting Response to Cancer Immunotherapy: Big Data Approaches. Genomic Medicine Conference, Hanoi, Vietnam, 06/2019.
• Towards a Software Platform for Big Data in Biomedical Research. Vingroup Institute of Big Data, Hanoi, Vietnam, 12/2018.
• NCI’s Genomic Data Commons: Research and Development, Vinmec Research Institute of Stem Cell and Gene Technology, Hanoi, Vietnam, 09/2018.
• Genomic Variant Analysis for Computational Immunogenomics, Vinmec Research Institute of Stem Cell and Gene Technology, Hanoi, Vietnam, 07/2017.
• Neoantigen Predictions from Splice-creating Mutations. TCGA PanCanAtlas Fusion/Splicing Working Group, USA, 10/2017.
• From Genomic Variant Analysis to Computational Immunogenomics. The University of Chicago, Chicago, USA, 09/2017.
• Genomic Variant Analysis for Cancer Immunogenomics. The New York Genome Center, New York, USA, 09/2017.
• Neoantigen Predictions from InDels, TCGA PanCanAtlas Immune Response Working Group, USA, 04/2017.
• Computational Methods for Genomic Variant and Gene Expression Analysis, The University of Texas MD Anderson Cancer Center, Houston, USA, 06/2016.
• Oral/poster presentations:
• Leveraging Known Genomic Variants to Improve Variant Detection. ISMB, poster presentation, 07/2019.
• Somatic Variant Detection from Tumor-only Samples. ISMB, poster presentation, 07/2018.
• Improving Variant Calling by Incorporating Known Genetic Variants into Read Alignment, MCBIOS, poster presentation, 03/2015.
• Predicting True Patterns of Gene Response to Treatments in Expression Analysis using Pairwise Comparisons. MCBIOS, selected oral presentation, 03/2014.
• Using Partially Ordered Sets to Represent and Predict True Patterns of Gene Response to Treatments, UT-ORNL-KBRIN Summit, selected oral presentation, 03/2013.
• Predicting Possible Directed-graph Patterns of Gene Expressions in Studies Involving Multiple Treatments, ACM-BCB, poster presentation, 10/2012.
• Pattern Analysis: A Web-based Tool for Analyzing Response Patterns in Low-replication, Many-treatment Gene Expression Data, ACM-BCB, poster presentation, 10/2012.