Publications

COPYRIGHT NOTICE: All the documents on this server have been submitted by their authors to scholarly journals or conferences as indicated. The manuscripts are put on-line to facilitate the purpose of non-commercial dissemination of scientific work. These manuscripts are copyrighted by the authors or the journals in which they were published. You may copy a manuscript for scholarly, non-commercial purposes, such as research or instruction, provided that you agree on these copyrights.

Conference Publications

2021

Anh Nguyen, Quang D. Tran, "Autonomous Navigation with Mobile Robots using Deep Learning and the Robot Operating System", ROSBook 2021. [project site]

2020

Anh Nguyen, Erman Tjiputra, Quang D. Tran. "BeetleBot: A Multi-Purpose AI-Driven Mobile Robot for Realistic Environments". UKRAS 2020 Conference: “Robots into the real world” Proceedings, 2020. [pdf]
Anh Nguyen, Ngoc Nguyen, Kim Tran, Erman Tjiputra, Quang D. Tran, "Autonomous Navigation in Complex Environments with Deep Multimodal Fusion Network", IROS 2020. [paper] [code]
Binh X. Nguyen, Binh D. Nguyen, Gustavo Carneiro, Erman Tjiputra, Quang D. Tran, Thanh-Toan Do. "Deep Metric Learning Meets Deep Clustering: A Novel Unsupervised Approach for Feature Embedding". BMVC 2020 [pdf] [code] [video]
Tuong Do, Binh X. Nguyen, Huy Tran, Erman Tjiputra, Quang D. Tran, Thanh-Toan Do. "Multiple interaction learning with question-type prior knowledge for constraining answer search space in visual question answering". ECCVW 2020 [code] [pdf]

2019

Binh D. Nguyen, Thanh-Toan Do, Binh X. Nguyen, Tuong Do, Erman Tjiputra, Quang D. Tran. "Overcoming Data Limitation in Medical Visual Question Answering" . The 22nd International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2019. [code]
Tuong Do, Thanh-Toan Do, Huy Tran, Erman Tjiputra, Quang D. Tran. "Compact Trilinear Interaction for Visual Question Answering". International Conf. on Computer Vision (ICCV) 2019. [code]
Anh Nguyen, Quang D. Tran, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis. "Object Captioning and Retrieval with Natural Language". International Conf. on Computer Vision Workshop (ICCVW) 2019.

2015

Thanh-Toan Do, Quang D. Tran, Ngai-Man Cheung. "FAemb: a function approximation-based embedding method for image retrieval". IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, USA. [pdf]

2013

Quang D. Tran, Ngoc Q. Ly, “Sparse Spatio-Temporal Representation of Joint Shape-Motion Cues for Human Action Recognition in Depth Sequences”. IEEE International Conference on Computing & Communication Technologies - Research, Innovation, and Vision for Future, RIVF 2013. [pdf] (Best Runner-up Paper Award)
Quang D. Tran, Ngoc Q. Ly, “An Effective Fusion Scheme of Spatio-Temporal Fea- tures for Human Action Recognition in RGB-D Video”. IEEE ICCAIS 2013. [pdf]

Technical talks & Posters

CVPR 2019

(June 2019)

Tuong Do, Huy Tran, Thanh-Toan Do, Erman Tjiputra, Quang D. Tran. "Interaction Learning with Question-type Awareness for Visual Question Answering". CVPR VQA Challenge 2019. [poster] [website]
A. Nguyen, T.-T. Do, D. G. Caldwell, N. G. Tsagarakis. "Real-Time 6DOF Pose Relocalization for Event Cameras with Stacked Spatial LSTM Networks". CVPRW 2019. (presented for Anh Nguyen) [video]

NVIDIA GTC 2019

(Mar 2019)

Quang D. Tran, Erman Tjiputra. "Surpassing State-of-the-Art VQA with Deep Learning Optimization Techniques and Limited GPU Resources". NVIDIA GTC 2019, Sillicon Valley. (50-mins talk) [video] [pdf]
Tuong Do, Vuong Pham, Binh Nguyen, Huy Tran, Vy Pham, Erman Tjiputra, Quang D. Tran. "Question-type Awareness for Visual Question Answering under Limited GPU Resources". NVIDIA GTC 2019 - Poster Session [poster]

Google Sites

Report abuse