Anh Nguyen, Quang D. Tran, "Autonomous Navigation with Mobile Robots using Deep Learning and the Robot Operating System", ROSBook 2021. [project site]
Anh Nguyen, Erman Tjiputra, Quang D. Tran. "BeetleBot: A Multi-Purpose AI-Driven Mobile Robot for Realistic Environments". UKRAS 2020 Conference: “Robots into the real world” Proceedings, 2020. [pdf]
Anh Nguyen, Ngoc Nguyen, Kim Tran, Erman Tjiputra, Quang D. Tran, "Autonomous Navigation in Complex Environments with Deep Multimodal Fusion Network", IROS 2020. [paper] [code]
Binh X. Nguyen, Binh D. Nguyen, Gustavo Carneiro, Erman Tjiputra, Quang D. Tran, Thanh-Toan Do. "Deep Metric Learning Meets Deep Clustering: A Novel Unsupervised Approach for Feature Embedding". BMVC 2020 [pdf] [code] [video]
Tuong Do, Binh X. Nguyen, Huy Tran, Erman Tjiputra, Quang D. Tran, Thanh-Toan Do. "Multiple interaction learning with question-type prior knowledge for constraining answer search space in visual question answering". ECCVW 2020 [code] [pdf]
Binh D. Nguyen, Thanh-Toan Do, Binh X. Nguyen, Tuong Do, Erman Tjiputra, Quang D. Tran. "Overcoming Data Limitation in Medical Visual Question Answering" . The 22nd International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2019. [code]
Tuong Do, Thanh-Toan Do, Huy Tran, Erman Tjiputra, Quang D. Tran. "Compact Trilinear Interaction for Visual Question Answering". International Conf. on Computer Vision (ICCV) 2019. [code]
Anh Nguyen, Quang D. Tran, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis. "Object Captioning and Retrieval with Natural Language". International Conf. on Computer Vision Workshop (ICCVW) 2019.
Thanh-Toan Do, Quang D. Tran, Ngai-Man Cheung. "FAemb: a function approximation-based embedding method for image retrieval". IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, USA. [pdf]
Quang D. Tran, Ngoc Q. Ly, “Sparse Spatio-Temporal Representation of Joint Shape-Motion Cues for Human Action Recognition in Depth Sequences”. IEEE International Conference on Computing & Communication Technologies - Research, Innovation, and Vision for Future, RIVF 2013. [pdf] (Best Runner-up Paper Award)
Quang D. Tran, Ngoc Q. Ly, “An Effective Fusion Scheme of Spatio-Temporal Fea- tures for Human Action Recognition in RGB-D Video”. IEEE ICCAIS 2013. [pdf]
(June 2019)
Tuong Do, Huy Tran, Thanh-Toan Do, Erman Tjiputra, Quang D. Tran. "Interaction Learning with Question-type Awareness for Visual Question Answering". CVPR VQA Challenge 2019. [poster] [website]
A. Nguyen, T.-T. Do, D. G. Caldwell, N. G. Tsagarakis. "Real-Time 6DOF Pose Relocalization for Event Cameras with Stacked Spatial LSTM Networks". CVPRW 2019. (presented for Anh Nguyen) [video]
(Mar 2019)
Quang D. Tran, Erman Tjiputra. "Surpassing State-of-the-Art VQA with Deep Learning Optimization Techniques and Limited GPU Resources". NVIDIA GTC 2019, Sillicon Valley. (50-mins talk) [video] [pdf]
Tuong Do, Vuong Pham, Binh Nguyen, Huy Tran, Vy Pham, Erman Tjiputra, Quang D. Tran. "Question-type Awareness for Visual Question Answering under Limited GPU Resources". NVIDIA GTC 2019 - Poster Session [poster]