Towards Robust Identity Inference Under Surveillance Environments: From Still Images to Video Sequences Thesis | URL
Yongkang Wong PhD Thesis, University of Queensland, Australia, 2012.
tags: surveillance, identity inference, local features, bag of words, sparse representation, quality assessment, face synthesis.
Handling Privacy Regulation in Video Surveillance Systems. Kajal Kansal, Yongkang Wong, Wei Jian Peh, Hui Lam Ong, Mohan Kankanhalli. Technical Report, Scholarbank@NUS Repository, 27 July 2023. Link
Bridging the Intent Gap: Knowledge-Enhanced Visual Generation. Yi Cheng, Ziwei Xu, Dongyun Lin, Harry Cheng, Yongkang Wong, Ying Sun, Joo Hwee Lim, Mohan Kankanhalli. arXiv:2405.12538, 2024. arXiv
ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens. Yangyang Guo, Haoyu Zhang, Liqiang Nie, Yongkang Wong, Mohan Kankanhalli. arXiv:2309.16738, 2023. arXiv
(Machine Learning, Artificial Intelligence, Computer Vision and Pattern Recognition, Multimedia)
[C56] Improving Video Moment Retrieval via LLM Augmented Nested Adapter. Arkaprabha Bhandari, Yongkang Wong, Kajal Kansal, Jianquan Liu, Mohan Kankanhalli. Accepted for publication in The International Conference on Multimedia Information Processing and Retrieval (MIPR), 2025.
[J40] STAR: Skeleton-aware Text-based 4D Avatar Generation with InNetwork Motion Retargeting. Zenghao Chai, Chen Tang, Yongkang Wong, Mohan Kankanhalli. Accepted for publication in IEEE Transactions on Visualization and Computer Graphics, 2025. arXiv | Project | GitHub | doi
[J39] Learning to Predict Gradients for Semi-Supervised Continual Learning. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. IEEE Transactions Neural Networks and Learning Systems (TNNLS), Volume 36, Issue 2, Pages 2593-2607. February 2025. arXiv | doi
[J38] Implications of Privacy Regulations on Video Surveillance Systems. Kajal Kansal, Yongkang Wong, Mohan Kankanhalli. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), 2024 [Available Online]. doi
[C55] TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment. Wei Li, Hehe Fan, Yongkang Wong, Yi Yang, Mohan Kankanhalli. Annual Conference on Neural Information Processing Systems (NeurIPS), 2024. Spotlight Poster. arXiv | GitHub | poster | paper
[C54] MCM: Multi-condition Motion Synthesis Framework. Zeyu Ling, Bo Han, Yongkang Wong, Han Lin, Mohan Kankanhalli, Weidong Geng. International Joint Conference on Artificial Intelligence (IJCAI), pages 1083-1091, 2024. arXiv | code
[C53] Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning. Wei Li, Hehe Fan, Yongkang Wong, Yi Yang, Mohan Kankanhalli. International Conference on Machine Learning (ICML), Article No. 1109, Pages 27732-27751, 2024. Poster | paper
[C52] Suffix Injection and Projected Gradient Descent Can Easily Good an MLLM. Yangyang Guo, Ziwei Xu, Xiliu Xu, Yongkang Wong, Liqiang Nie, Mohan Kankanhalli. International Conference on Machine Learning (ICML) TiFA Workshop MLLM Attack Challenge, 2024. Winner | paper
[J37] KF-VTON: Keypoints-Driven Flow based Virtual Try-On. Zizhao Wu, Siyu Liu, Peioyan Lu, Ping Yang, Yongkang Wong, Xiaoling Gu, Mohan Kankanhalli. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), Volume 20, Issue 9, Article No.: 293, Pages 1-23. 23 September 2024. doi
[J36] Multi2Human: Controllable Human Image Generation with Multimodal Controls. Xiaoling Gu, Shengwenzhuo Xu, Yongkang Wong, Zizhao Wu, Jun Yu, Jianping Fan, Mohan Kankanhalli. Neurocomputing, Volume 587, pages 127682. 28 June 2024. doi | Early Free Access
[J35] Recurrent Appearance Flow for Occlusion-Free Virtual Try-On. Xiaoling Gu, Junkai Zhu, Yongkang Wong, Zizhao Wu, Jun Yu, Jianping Fan, Mohan Kankanhalli. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), Volume 20, Issue 8, Article No.: 239, Pages 1-17. 12 June 2024. doi
[J34] Rejecting Unknown Gestures based on Surface-Electromyography Using Variational Autoencoder. Qingfeng Dai, Yongkang Wong, Mohan Kankanhalli, Xiangdong Li, Weidong Geng. IEEE Transactions in Neural Systems and Rehabilitation Engineering (TNSRE), Volume 32, pages 750-758, 30 January 2024. doi
[C51] Finetuning Text-to-Image Diffusion Models for Fairness. Xudong Shen, Chao Du, Tianyu Pang, Min Lin, Yongkang Wong, Mohan Kankanhalli. International Conference on Learning Representation (ICLR), 2024. Oral. paper | arXiv PrePrint | poster
[C50] Privacy-Enhancing Person Re-Identification Framework -- A Dual-Stage Approach. Kajal Kansal, Yongkang Wong, Mohan Kankanhalli. Winter Conference on Applications of Computer Vision (WACV), pages 8543-8552, 2024. Poster | Video | paper
[J33] Unsupervised Domain Adaptation by Causal Learning for Biometric Signal based HCI. Qingfeng Dai, Yongkang Wong, Guofei Sun, Yanwei Wang, Zhou Zhou, Mohan Kankanhalli, Xiangdong Li, Weidong Geng. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), Volume 20, Issue 2, Article No. 49, pages 1- 18. 26 September 2023. doi
[J32] PAINT: Photo-realistic Fashion Design Synthesis. Xiaoling Gu, Jie Huang, Yongkang Wong, Jun Yu, Jianping Fan, Pai Peng, Mohan Kankanhalli. Transactions on Multimedia, Computing Communications and Application (TOMM), Volume 20, Issue 2, Article No.: 48, pages 1- 23. 26 September 2023. doi
[J31] Improved Network and Training Scheme for Cross-trial sEMG-based Gesture Recognition. Qingfeng Dai, Yongkang Wong, Mohan Kankanhalli, Xiangdong Li, Weidong Geng. Bioengineering, Volume 10, Issue 9, pages 1101, 2023. doi
[C49] NarSUM'23: The 2nd Workshop on User-Centric Narrative Summarization of Long Videos. Mohan Kankanhalli, Ioannis (Yiannis) Patras, Jianquan Liu, Yongkang Wong, Takahiro Komamizu, Satoshi Yamazaki, Karen Stephen, Kajal Kansal. ACM Multimedia (MM), 2023. doi
[C48] Narrative Graph for Narrative Generation from Long Videos. Rishabh Sheoran, Yongkang Wong, Jianquan Liu, Mohan Kankanhalli. ACM Multimedia (MM) NarSUM Workshop, 2023. Poster | Video | doi
[C47] A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023. Yi Cheng, Ziwei Xu, Fen Fang, Dongyun Lim, Hehe Fan, Yongkang Wong, Ying sun, Mohan Kankanhalli. The IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR) workshop, 2023. arXiv [Winner for UDA for Recognition Track]
[J30] Learning to Minimize the Remainder in Supervised Learning. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. IEEE Transactions on Pattern Multimedia (TMM), Volume 25, pages 1738-1748, 9 March 2022. arXiv | doi
[J29] Fair Representation: Guranteeing Approximate Multiple Group Fairness for Unknown Tasks. Xudong Shen, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Volume 45, Issue 1. 1 January 2023, pages 525-538. arXiv | doi
[C46] Don't Pour Cereal into Coffee: Differentiable Temporal Logic for Temporal Action Segmentation. Ziwei Xu, Yogesh S Rawat, Yongkang Wong, Mohan Kankanhalli, Mubarak Shah. Annual Conference on Neural Information Processing Systems (NeurIPS), 2022. Project
[C45] Compute to Tell the Tale: Goal-Driven Narrative Generation. Yongkang Wong, Shaojing Fan, Yangyang Guo, Ziwei Xu, RishabhSheoran, Karen Stephen, Anusha Bhamidipati, Vivek Barsopia, Jianquan Liu, Mohan Kankanhalli. ACM Multimedia (MM) Brave New Idea Track, 2022. Poster | Video | doi [Best BNI Paper Award]
[C44] A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA. Yangyang Guo, Liqiang Nie, Yongkang Wong, Yibing Liu,Zhiyong Chen, Mohan Kankanhalli. ACM Multimedia (MM), 2022. arXiv | doi
[C43] Distance Matters in Human-Object Interaction Detection. Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli. ACM Multimedia (MM), 2022. arXiv | doi
[C42] NarSUM'22: 1st Workshop on User-Centric Narrative Summarization of Long Videos. Mohan Kankanhalli, Jianquan Liu, Yongkang Wong, Karen Stephen, RishabhSheoran, Anusha Bhamidipati. ACM Multimedia (MM), 2022. doi
[C41] Chairs Can be Stood On: Overcoming Object Bias in Human-Object Interaction Detection. Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli. European Conference on Computer Vision (ECCV), 2022. arXiv
[J28] Enhanced 3D Shape Reconstruction with Knowledge Graph of Category Concept. Guofei Sun,Yongkang Wong, Mohan Kankanhalli, Xiangdong Li, Weidong Geng. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), Volume 18, Issue 3, 4 March 2022, Article No. 71, pages 1-20. Paper | doi
[J27] Semantic-aware Triplet Loss for Image Classification. Guangzhi Wang, Yangyang Guo, Ziwei Xu, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Pattern Multimedia (TMM), Volume 25, 26 May 2022, pages 4563-4572. doi
[J26] Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition. Ziwei Xu, Guangzi Wang, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Multimedia (TMM), Volume 24, 2022, pages 3652-3664. arXiv | doi
[C40] Unsupervised Motion Representation Learning with Capsule Autoencoders. Ziwei Xu, Xudong Shen, Yongkang Wong, Mohan Kankanhalli. Annual Conference on Neural Information Processing Systems (NeurIPS), 2021. NeurIPS | arXiv | Code & Dataset release | Slide
[C39] Learning to Predict Trustworthiness with Steep Slope Loss. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. Annual Conference on Neural Information Processing Systems (NeurIPS), 2021. NeurIPS | arXiv | Code release | Slide
[C38] Learning Causal Representation for Training Cross-Domain Pose Estimator via Generative Inventions. Xiheng Zhang, Yongkang Wong, Xiaofei Wu, Juwei Lu, Mohan Kankanhalli, Xiangdong Li, Weidong Geng. International Conference of Computer Vision (ICCV), pages 11270-11280, 2021. Paper | Supplementary | Slide | Poster | Video
[J25] Toward Multi-Modal Conditioned Fashion Image Translation. Xiaoling Gu, Jun Yu, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Multimedia (TMM), Volume 23, 2021, pages 2361-2371. doi
[J24] Direction Concentration Learning: Enhancing Congruency in Machine Learning. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Volume 43, Issue 6, June 2021, pages 1928-1946. arXiv | Code release | doi
[J23] Scene Graph Inference via Multi-Scale Context Modeling. Ning Xu, An-An Liu, Yongkang Wong, Weizhi Nie, Yu-Ting Su, Mohan Kankanhalli. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Volume 31, Issue 3, March 2021, pages 1031-1041. doi
[C37] $n$-Reference Transfer Learning for Saliency Prediction. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. European Conference on Computer Vision (ECCV), Lecture Notes in Computer Science, Volume 12353, pages 502-519, 2020. arXiv | Code release | doi
[J22] DeepDance: Music-to-Dance Motion Choreography with Adversarial Learning. Guofei Sun, Yongkang Wong, Zhiyong Cheng, Mohan Kankanhalli, Weidong Geng, Xiangdong Li. IEEE Transactions on Multimedia (TMM), Volume 23, 2020, pages 497-509. Code release | doi
[J21] Visual Social Relationship Recognition. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. International Journal of Computer Vision (IJCV), Volume 128, February 2020, Pages 1750-1765. arXiv | doi
[J20] Interact as You Intend: Intention-Driven Human-Object Interaction Detection. Bingjie Xu, Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. IEEE Transactions on Multimedia (TMM), Volume 22, Issue 6, June 2020, pages 1423-1432. arXiv | doi
[C36] GradMix: Multi-source Transfer across Domains and Tasks. Junnan Li, Ziwei Xu, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. Winter Conference on Applications of Computer Vision (WACV), pages 3019-3027, 2020. Paper | Poster | Video | Spotlight | arXiv
[C35] Weakly-Supervised Multi-person Action Recognition in 360$^\circ$ Videos. Junnan Li, Jianquan Liu, Yongkang Wong, Shoji Nishimura, Mohan Kankanhalli. Winter Conference on Applications of Computer Vision (WACV), pages 508-516, 2020. Paper | Supplementary | Video | arXiv | Dataset
[J19] Video Storytelling: Textual Summaries for Events. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. IEEE Transactions on Multimedia (TMM), Volume 22, Issue 2, February 2020, pages 554-565. arXiv | Dataset | doi
[J18] Unsupervised Online video Object Segmentation with Motion Property Understanding. Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Image Processing (TIP), Volume 29, 2020, pages 237-249. arXiv | doi
[J17] G-softmax: Improving Intra-class Compactness and Inter-class Separability of Features. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. IEEE Transactions on Neural Networks and Learning Systems (TNNLS), Volume 31, Issue 2, February 2020, pages 685-699. arXiv | doi
[J16] sEMG-based Gesture Recognition with Embedded Virtual Hand Poses and Adversarial Learning. Yu Hu, Yongkang Wong, Qingfeng Dai, Mohan Kankanhalli, Weidong Geng, Xiangdong Li. IEEE Access, Volume 7, Issue 1, December 2019, Pages 104108-104120. doi
[J15] Hierarchical Multi-View Aggregation Network for Sensor-based Human Activity Recognition. Xiheng Zhang, Yongkang Wong, Mohan Kankanhalli, Weidong Geng. PLOS ONE 14(9):e0221390, September 2019. doi
[C34] Unsupervised Domain Adaptation for 3D Human Pose Estimation. Xiheng Zhang, Yongkang Wong, Mohan Kankanhalli, Weidong Geng. ACM Multimedia (MM), pages 926-934, 2019. Slide | Poster | Project Page | doi
[C33] Self-supervised Representation Learning using 360$^\circ$ Data. Junnan Li, Jianquan Liu, Yongkang Wong, Shoji Nishimura, Mohan Kankanhalli. ACM Multimedia (MM), pages 998-1006, 2019. Poster | doi
[C32] Human-imperceptible Privacy Protection Against Machines. Zhiqi Shen, Shaojing Fan, Yongkang Wong, Tiantsong Ng, Mohan Kankanhalli. ACM Multimedia (MM), pages 1119-1128, 2019. Project Page | doi [Best Student Paper Award]
[C31] Explainable Video Action Reasoning via Prior Knowledge and State Transitions. Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan Kankanhalli. ACM Multimedia (MM), pages 521-529, 2019. doi | Code release
[J14] Surface Electromyography-based Gesture Recognition by Multi-View Deep Learning. Wentao Wei, Qingfeng Dai, Yongkang Wong, Yu Hu, Mohan Kankanhalli, Weidong Geng. IEEE Transactions on Biomedical Engineering (TBME), Volume 66, Issue 10, October 2019, pages 2964-2973. doi
[C30] Learning Controllable Face Generator from Disjoint Dataset. Jing Li, Yongkang Wong, Terence Sim. The International Conference on Computer Analysis of Images and Patterns (CAIP), Volume 11678 of the series Lecture Notes in Computer Science, pages 209-223, 2019. Paper | doi
[J13] Dual-Stream Recurrent Neural Network for Video Captioning. Ning Xu, An-An Liu, Yongkang Wong, Yongdong Zhang, Weizhi Nie, Yu-Ting Su, Mohan Kankanhalli. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Volume 26, Issue 8, August 2019, pages 2482-2493. doi
[C29] Learning to Detect Human-Object Interactions with Knowledge. Bingjie Xu, Yongkang Wong, Junnan Li, Qi Zhao, Mohan Kankanhalli. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 2019-2028, 2019. Paper | Poster | Code release
[C28] Learning to Learn from Noisy Labeled Data. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 5051-5059, 2019. Paper | arXiv | Poster | Code release
[J12] A Multi-sensor Framework for Personal Presentations Analytics. Tian Gan, Junnan Li, Yongkang Wong, Mohan Kankanhalli. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), 15, 2, Article 30 (June 2019). doi
[J11] Multi-Modal and Multi-Domain Embedding Learning for Fashion Retrieval and Analysis. Xiaoling Gu, Yongkang Wong, Lidan Shou, Pai Peng, Gang Chen, Mohan Kankanhalli. IEEE Transactions on Multimedia (TMM), Volume 21, Issue 6, June 2019, pages 1524-1537. doi
[J10] A Multi-Stream Convolutional Neural Network for sEMG-based Gesture Recognition in Muscle-Computer Interface. Wentao Wei, Yongkang Wong, Yu Du, Yu Hu, Mohan Kankanhalli, Weidong Geng. Pattern Recognition Letters (PRL), Volume 119, March 2019, pages 131-138. doi
[J9] LSTM-based Multi-Label Video Event Detection. An-An Liu, Zhuang Shao, Yongkang Wong, Junnan Li, Yu-Ting Su, Mohan Kankanhalli. Multimedia Tools and Applications, Volume 78, Issue 1, January 2019, pages 677-695. doi
[C27] Unsupervised Learning of View-Invariant Action Representations. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. Annual Conference on Neural Information Processing Systems (NeurIPS), pages 1260-1270, 2018. Paper | arXiv | Poster | Video
[J8] A Fine-grained Spatial-Temporal Attention Model for Video Captioning. An-An Liu, Yirui Qiu, Yongkang Wong, Yu-Ting Su, Mohan Kankanhalli. IEEE Access, Volume 6, November 2018, pages 68463-68471. doi
[J7] A Novel Attention-based Hybrid CNN-RNN Architecture for sEMG-based Gesture Recognition. Yu Hu, Yongkang Wong, Wentao Wei, Yu Du, Mohan Kankanhalli, Weidong Geng. PLOS ONE 13(10): e0206049, October 2018. doi
[J6] Hierarchical & Multimodal Video Captioning: Discovering and Transferring Multimodal Knowledge for Vision to Language. An-An Liu, Ning Xu, Yongkang Wong, Junnan Li, Yu-Ting Su, Mohan Kankanhalli. Computer Vision and Image Understanding (CVIU), Volume 163, October 2017, Pages 113-125. doi
[C26] Dual-Glance Model for Deciphering Social Relationships. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. International Conference on Computer Vision (ICCV), pages 2669-2678, 2017. Dataset | Paper | arXiv | Poster | doi
[C25] Attention Transfer from Web Images for Video Recognition. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. ACM Multimedia (MM), pages 1-9, 2017. Dataset | Paper | arXiv | Poster | doi
[C24] Understanding Fashion Trends from Street Photos via Neighbor-Constrained Embedding Learning. Xiaoling Gu, Yongkang Wong, Pai Peng, Lidan Shou, Gang Chen, Mohan Kankanhalli. ACM Multimedia (MM), pages 190-198, 2017. Dataset | Paper | Supplementary | Poster | doi
[J5] Benchmarking a Multimodal and Multiview and Interactive Dataset for Human Action Recognition. An-An Liu, Ning Xu, Weizhi Nie, Yu-Ting Su, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Cybernetics, Volume 47, Issue 7, July 2017, Pages 1781-1794. doi
[C23] Semi-supervised Learning for Surface EMG-based Gesture Recognition. Yu Du, Yongkang Wong, Wentao Wei, Yu Hu, Mohan Kankanhalli, Weidong Geng. International Joint Conference on Artificial Intelligence (IJCAI), pages 1624-1630, 2017. Paper | Slide | doi
[C22] Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking. Wenhui Li, Yongkang Wong, An-An Liu, Yang Li, Yu-Ting Su, Mohan Kankanhalli. IEEE Winter Conference on Applications of Computer Vision (WACV), 2017. Dataset | Paper | arXiv | Poster | Project Page | doi
[C21] Demo Paper: PreSense - An Assistive Presentation Self-Quantification System. Junnan Li, Yongkang Wong, Mohan Kankanhalli. IEEE International Symposium on Multimedia (ISM), Pages 401-402, 2016. Paper | doi
[C20] Multi-stream Deep Learning Framework for Automated Presentation Assessment. Junnan Li, Yongkang Wong, Mohan Kankanhalli. IEEE International Symposium on Multimedia (ISM), Pages 222-225, 2016. Paper | doi
[C19] Towards Protecting Biometric Templates Without Sacrificing Performance. Jing Li, Yongkang Wong, Terence Sim. International Conference on Pattern Recognition (ICPR), 2016. Paper | Poster | doi
[C18] Marker-less 3D Human Motion Capture with Monocular Image Sequence and Height-Maps. Yu Du, Yongkang Wong, Yonghao Liu, Feiling Han, Yilin Gui, Zhen Wang, Mohan Kankanhalli, Weidong Geng. European Conference on Computer Vision (ECCV), Volume 9908 of the series Lecture Notes in Computer Science, Pages 20-36, 2016. Paper | Poster | Code release | DEMO | Project Page | doi
[C17] Multi-sensor Self-Quantification of Presentations. Tian Gan, Yongkang Wong, Bappaditya Mandal, Vijay Chandrasekhar, Mohan Kankanhalli. ACM Multimedia (MM), pages 601-610, 2015. Paper | doi
[C16] Multi-Modal & Multi-View & Interactive Benchmark Dataset for Human Action Recognition. Ning Xu, An-An Liu, Weizhi Nie, Yongkang Wong, Fuwu Li, Yu-Ting Su. ACM Multimedia (MM), pages 1195-1198, 2015. Paper | doi
[C15] Label Consistent Quadratic Surrogate Model for Visual Saliency Prediction. Yan Luo, Yongkang Wong, Qi Zhao. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages, 5060-5069, 2015. Paper | doi
[J4] Human Action Recognition Based on Local Action Attributes. Jing Zhang, Hong Lin, Weizhi Nie, Lekha Chaisorn, Yongkang Wong, Mohan Kankanhalli. Journal of Electrical Engineering & Technology, Volume 10, Issue 3, May 2015. Paper | doi
[J3] Multi-Camera Saliency. Yan Luo, Ming Jiang, Yongkang Wong, Qi Zhao. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Volume 37, Issue 10, 15 January 2015, Pages 2057-2070. Paper | doi
[J2] On Robust Face Recognition via Sparse Encoding: the Good, the Bad, and the Ugly. Yongkang Wong, Mehrtash Harandi, Conrad Sanderson. IET Biometrics, Volume 3, Issue 4, December 2014, Pages 176--189. arXiv | doi
[C14] Discovering Person Identity via Large-Scale Observations. Yongkang Wong, Lekha Chaisorn, Mohan Kankanhalli. Workshop on Human Identification for Surveillance, Asian Conference on Computer Vision (ACCV) Workshops, 2014. Paper | doi
[C13] Recovering Social Interaction Spatial Structure from Multiple First-Person Views. Tian Gan, Yongkang Wong, Bappaditya Mandal, Vijay Chandrasekhar, Liyuan Li, Joo-Hwee Lim, Mohan Kankanhalli. International Workshop on Socially-Aware Multimedia (IWSAM), ACM Multimedia workshops, 2014. Paper | doi
[C12] Scalable Decision-Theoretic Coordination and Control for Real-Time Active Multi-Camera Surveillance. Prabhu Natarajan, Trong Nghia Hoang, Yongkang Wong, Kian Hsiang Low, Mohan Kankanhalli. International Conference on Distributed Computing System (ICDCS), 2014 (invited paper). Paper
[C11] View-Invariant Feature Discovering for Multi-Camera Human Action Recognition. Hong Lin, Lekha Chaisorn, Yongkang Wong, An-An Liu, Yu-Ting Su, Mohan Kankanhalli. IEEE International Workshop on Multimedia Signal Processing (MMSP), 2014. Paper
[C10] Multi-View Action Recognition by Cross-Domain Learning. Weizhi Nie, An-An Liu, Jing Yu, Yu-Ting Su, Lekha Chaisorn, Yongkang Wong, Mohan Kankanhalli. IEEE International Workshop on Multimedia Signal Processing (MMSP), 2014. Paper
[J1] Automatic Classification of Human Epithelial Type 2 Cell Indirect Immunofluorescence Images using Cell Pyramid Matching. Arnold Wiliem, Conrad Sanderson, Yongkang Wong, Peter Hobson, Rodney. F. Minchin, Brian C. Lovell. Pattern Recognition (PR), Volume 47, Issue 7, July 2014, Pages 2315-2324. Paper | doi
[C9] Video Analytics for Surveillance Camera Networks. Lekha Chaisorn, Yongkang Wong. IEEE International Conference on Networks, 2013 (invited paper). Paper | doi
[C8] Temporal Encoded F-formation System for Social Interaction Detection. Tian Gan, Yongkang Wong, Daqing Zhang, Mohan Kankanhalli. ACM Multimedia (MM), pages 937-946, 2013. Paper | doi
[C7] Classification of Human Epithelial Type 2 Cell Indirect Immunofluoresence Images via Codebook Based Descriptors. Arnold Wiliem, Yongkang Wong, Conrad Sanderson, Peter Hobson, Shaokang Chen, Brian C. Lovell. IEEE Workshop on the Applications of Computer Vision (WACV), pages 95-102, 2013. Paper | doi
[C6] Combined Learning of Salient Local Descriptors and Distance Metrics for Image Set Face Verification. Conrad Sanderson, Mehrtash Harandi, Yongkang Wong, Brian C. Lovell. IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS), pages 294-299, 2012. Paper | doi
[C5] On Robust Biometric Identity Verification via Sparse Encoding of Faces: Holistic vs Local Approaches. Yongkang Wong, Mehrtash Harandi, Conrad Sanderson, Brian C. Lovell. International Joint Conference on Neural Networks (IJCNN), 2012. Paper | doi
[C4] Patch-based Probabilistic Image Quality Assessment for Face Selection and Improved Video-based Face Recognition. Yongkang Wong, Shaokang Chen, Sandra Mau, Conrad Sanderson, Brian C. Lovell. Biometrics Workshop, IEEE Conference Computer Vision and Pattern Recognition (CVPR) Workshops, 2011. Paper | Dataset 1 | Dataset 2 | doi [Highest Impact Award (presented at CVPR 2015 Biometrics Workshop)]
[C3] Dynamic Amelioration of Resolution Mismatches for Local Feature Based Identity Inference. Yongkang Wong, Conrad Sanderson, Sandra Mau, Brian C. Lovell. International Conference on Pattern Recognition (ICPR), Pages 1200-1203, 2010. Paper | doi
[C2] Regression Based Non-Frontal Face Synthesis for Improved Identity Verification. Yongkang Wong, Conrad Sanderson, Brian C. Lovell. International Conference on Computer Analysis of Images and Patterns (CAIP), Pages 116-124, 2009. Papers | doi
[C1] Narrow-Band FM-Multi-Tone FSK Modem: TMS320C6000 Based Test bed Implementation and Performance Analysis. Kandeepan Sithamparanathan, Yongkang Wong. IEEE Asia Pacific Conference on Circuits and Systems (APCCAS), 2006. doi
[J39] Implications of Privacy Regulations on Video Surveillance Systems. Kajal Kansal, Yongkang Wong, Mohan Kankanhalli. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), 2024 [Available Online]. doi
[C55] TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment. Wei Li, Hehe Fan, Yongkang Wong, Yi Yang, Mohan Kankanhalli. Annual Conference on Neural Information Processing Systems (NeurIPS), 2024. Spotlight. arXiv | GitHub | poster | paper
[C54] MCM: Multi-condition Motion Synthesis Framework. Zeyu Ling, Bo Han, Yongkang Wong, Han Lin, Mohan Kankanhalli, Weidong Geng. International Joint Conference on Artificial Intelligence (IJCAI), 2024. arXiv | code
[C53] Improve Multimodal Context Understanding via Multimodal Composition Learning. Wei Li, Hehe Fan, Yongkang Wong, Yi Yang, Mohan Kankanhalli. International Conference on Machine Learning (ICML), 2024. Poster | paper
[C52] Suffix Injection and Projected Gradient Descent Can Easily Good an MLLM. Yangyang Guo, Ziwei Xu, Xiliu Xu, Yongkang Wong, Liqiang Nie, Mohan Kankanhalli. International Conference on Machine Learning (ICML) TiFA Workshop MLLM Attack Challenge, 2024. Winner | paper
[J38] KF-VTON: Keypoints-Driven Flow based Virtual Try-On. Zizhao Wu, Siyu Liu, Peioyan Lu, Ping Yang, Yongkang Wong, Xiaoling Gu, Mohan Kankanhalli. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), Volume 20, Issue 9, Article No.: 293, Pages 1-23. 23 September 2024. doi
[J37] Multi2Human: Controllable Human Image Generation with Multimodal Controls. Xiaoling Gu, Shengwenzhuo Xu, Yongkang Wong, Zizhao Wu, Jun Yu, Jianping Fan, Mohan Kankanhalli. Neurocomputing, Volume 587, pages 127682. 28 June 2024. doi | Early Free Access
[J36] Recurrent Appearance Flow for Occlusion-Free Virtual Try-On. Xiaoling Gu, Junkai Zhu, Yongkang Wong, Zizhao Wu, Jun Yu, Jianping Fan, Mohan Kankanhalli. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), Volume 20, Issue 8, Article No.: 239, Pages 1-17. 12 June 2024. doi
[J35] Rejecting Unknown Gestures based on Surface-Electromyography Using Variational Autoencoder. Qingfeng Dai, Yongkang Wong, Mohan Kankanhalli, Xiangdong Li, Weidong Geng. IEEE Transactions in Neural Systems and Rehabilitation Engineering (TNSRE), Volume 32, pages 750-758, 30 January 2024. doi
[J34] Learning to Predict Gradients for Semi-Supervised Continual Learning. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. EEE Transactions Neural Networks and Learning Systems (TNNLS), Volume 36, Issue 2, Pages 2593-2607. February 2025. arXiv | doi
[C51] Finetuning Text-to-Image Diffusion Models for Fairness. Xudong Shen, Chao Du, Tianyu Pang, Min Lin, Yongkang Wong, Mohan Kankanhalli. International Conference on Learning Representation (ICLR), 2024. Oral. paper | arXiv PrePrint | poster
[C50] Privacy-Enhancing Person Re-Identification Framework -- A Dual-Stage Approach. Kajal Kansal, Yongkang Wong, Mohan Kankanhalli. Winter Conference on Applications of Computer Vision (WACV), pages 8543-8552, 2024. Poster | Video | paper
[J33] Unsupervised Domain Adaptation by Causal Learning for Biometric Signal based HCI. Qingfeng Dai, Yongkang Wong, Guofei Sun, Yanwei Wang, Zhou Zhou, Mohan Kankanhalli, Xiangdong Li, Weidong Geng. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), Volume 20, Issue 2, Article No. 49, pages 1- 18. 26 September 2023. doi
[J32] PAINT: Photo-realistic Fashion Design Synthesis. Xiaoling Gu, Jie Huang, Yongkang Wong, Jun Yu, Jianping Fan, Pai Peng, Mohan Kankanhalli. Transactions on Multimedia, Computing Communications and Application (TOMM), Volume 20, Issue 2, Article No.: 48, pages 1- 23. 26 September 2023. doi
[J31] Improved Network and Training Scheme for Cross-trial sEMG-based Gesture Recognition. Qingfeng Dai, Yongkang Wong, Mohan Kankanhalli, Xiangdong Li, Weidong Geng. Bioengineering, Volume 10, Issue 9, pages 1101, 2023. doi
[C49] NarSUM'23: The 2nd Workshop on User-Centric Narrative Summarization of Long Videos. Mohan Kankanhalli, Ioannis (Yiannis) Patras, Jianquan Liu, Yongkang Wong, Takahiro Komamizu, Satoshi Yamazaki, Karen Stephen, Kajal Kansal. ACM Multimedia (MM), 2023. doi
[C48] Narrative Graph for Narrative Generation from Long Videos. Rishabh Sheoran, Yongkang Wong, Jianquan Liu, Mohan Kankanhalli. ACM Multimedia (MM) NarSUM Workshop, 2023. Poster | Video | doi
[C47] A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023. Yi Cheng, Ziwei Xu, Fen Fang, Dongyun Lim, Hehe Fan, Yongkang Wong, Ying sun, Mohan Kankanhalli. The IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR) workshop, 2023. arXiv [Winner for UDA for Recognition Track]
[J30] Learning to Minimize the Remainder in Supervised Learning. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. IEEE Transactions on Pattern Multimedia (TMM), Volume 25, pages 1738-1748, 9 March 2022. arXiv | doi
[J29] Fair Representation: Guranteeing Approximate Multiple Group Fairness for Unknown Tasks. Xudong Shen, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Volume 45, Issue 1. 1 January 2023, pages 525-538. arXiv | doi
[C46] Don't Pour Cereal into Coffee: Differentiable Temporal Logic for Temporal Action Segmentation. Ziwei Xu, Yogesh S Rawat, Yongkang Wong, Mohan Kankanhalli, Mubarak Shah. Annual Conference on Neural Information Processing Systems (NeurIPS), 2022. Project
[C45] Compute to Tell the Tale: Goal-Driven Narrative Generation. Yongkang Wong, Shaojing Fan, Yangyang Guo, Ziwei Xu, RishabhSheoran, Karen Stephen, Anusha Bhamidipati, Vivek Barsopia, Jianquan Liu, Mohan Kankanhalli. ACM Multimedia (MM) Brave New Idea Track, 2022. Poster | Video | doi [Best BNI Paper Award]
[C44] A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA. Yangyang Guo, Liqiang Nie, Yongkang Wong, Yibing Liu,Zhiyong Chen, Mohan Kankanhalli. ACM Multimedia (MM), 2022. arXiv | doi
[C43] Distance Matters in Human-Object Interaction Detection. Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli. ACM Multimedia (MM), 2022. arXiv | doi
[C42] NarSUM'22: 1st Workshop on User-Centric Narrative Summarization of Long Videos. Mohan Kankanhalli, Jianquan Liu, Yongkang Wong, Karen Stephen, RishabhSheoran, Anusha Bhamidipati. ACM Multimedia (MM), 2022. doi
[C41] Chairs Can be Stood On: Overcoming Object Bias in Human-Object Interaction Detection. Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli. European Conference on Computer Vision (ECCV), 2022. arXiv
[J28] Enhanced 3D Shape Reconstruction with Knowledge Graph of Category Concept. Guofei Sun,Yongkang Wong, Mohan Kankanhalli, Xiangdong Li, Weidong Geng. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), Volume 18, Issue 3, 4 March 2022, Article No. 71, pages 1-20. Paper | doi
[J27] Semantic-aware Triplet Loss for Image Classification. Guangzhi Wang, Yangyang Guo, Ziwei Xu, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Pattern Multimedia (TMM), Volume 25, 26 May 2022, pages 4563-4572. doi
[J26] Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition. Ziwei Xu, Guangzi Wang, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Multimedia (TMM), Volume 24, 2022, pages 3652-3664. arXiv | doi
[C40] Unsupervised Motion Representation Learning with Capsule Autoencoders. Ziwei Xu, Xudong Shen, Yongkang Wong, Mohan Kankanhalli. Annual Conference on Neural Information Processing Systems (NeurIPS), 2021. NeurIPS | arXiv | Code & Dataset release | Slide
[C39] Learning to Predict Trustworthiness with Steep Slope Loss. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. Annual Conference on Neural Information Processing Systems (NeurIPS), 2021. NeurIPS | arXiv | Code release | Slide
[C38] Learning Causal Representation for Training Cross-Domain Pose Estimator via Generative Inventions. Xiheng Zhang, Yongkang Wong, Xiaofei Wu, Juwei Lu, Mohan Kankanhalli, Xiangdong Li, Weidong Geng. International Conference of Computer Vision (ICCV), pages 11270-11280, 2021. Paper | Supplementary | Slide | Poster | Video
[J25] Toward Multi-Modal Conditioned Fashion Image Translation. Xiaoling Gu, Jun Yu, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Multimedia (TMM), Volume 23, 2021, pages 2361-2371. doi
[J24] Direction Concentration Learning: Enhancing Congruency in Machine Learning. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Volume 43, Issue 6, June 2021, pages 1928-1946. arXiv | Code release | doi
[J23] Scene Graph Inference via Multi-Scale Context Modeling. Ning Xu, An-An Liu, Yongkang Wong, Weizhi Nie, Yu-Ting Su, Mohan Kankanhalli. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Volume 31, Issue 3, March 2021, pages 1031-1041. doi
[C37] $n$-Reference Transfer Learning for Saliency Prediction. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. European Conference on Computer Vision (ECCV), Lecture Notes in Computer Science, Volume 12353, pages 502-519, 2020. arXiv | Code release | doi
[J22] DeepDance: Music-to-Dance Motion Choreography with Adversarial Learning. Guofei Sun, Yongkang Wong, Zhiyong Cheng, Mohan Kankanhalli, Weidong Geng, Xiangdong Li. IEEE Transactions on Multimedia (TMM), Volume 23, 2020, pages 497-509. Code release | doi
[J21] Visual Social Relationship Recognition. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. International Journal of Computer Vision (IJCV), Volume 128, February 2020, Pages 1750-1765. arXiv | doi
[J20] Interact as You Intend: Intention-Driven Human-Object Interaction Detection. Bingjie Xu, Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. IEEE Transactions on Multimedia (TMM), Volume 22, Issue 6, June 2020, pages 1423-1432. arXiv | doi
[C36] GradMix: Multi-source Transfer across Domains and Tasks. Junnan Li, Ziwei Xu, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. Winter Conference on Applications of Computer Vision (WACV), pages 3019-3027, 2020. Paper | Poster | Video | Spotlight | arXiv
[C35] Weakly-Supervised Multi-person Action Recognition in 360$^\circ$ Videos. Junnan Li, Jianquan Liu, Yongkang Wong, Shoji Nishimura, Mohan Kankanhalli. Winter Conference on Applications of Computer Vision (WACV), pages 508-516, 2020. Paper | Supplementary | Video | arXiv | Dataset
[J19] Video Storytelling: Textual Summaries for Events. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. IEEE Transactions on Multimedia (TMM), Volume 22, Issue 2, February 2020, pages 554-565. arXiv | Dataset | doi
[J18] Unsupervised Online video Object Segmentation with Motion Property Understanding. Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Image Processing (TIP), Volume 29, 2020, pages 237-249. arXiv | doi
[J17] G-softmax: Improving Intra-class Compactness and Inter-class Separability of Features. Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao. IEEE Transactions on Neural Networks and Learning Systems (TNNLS), Volume 31, Issue 2, February 2020, pages 685-699. arXiv | doi
[J16] sEMG-based Gesture Recognition with Embedded Virtual Hand Poses and Adversarial Learning. Yu Hu, Yongkang Wong, Qingfeng Dai, Mohan Kankanhalli, Weidong Geng, Xiangdong Li. IEEE Access, Volume 7, Issue 1, December 2019, Pages 104108-104120. doi
[J15] Hierarchical Multi-View Aggregation Network for Sensor-based Human Activity Recognition. Xiheng Zhang, Yongkang Wong, Mohan Kankanhalli, Weidong Geng. PLOS ONE 14(9):e0221390, September 2019. doi
[C34] Unsupervised Domain Adaptation for 3D Human Pose Estimation. Xiheng Zhang, Yongkang Wong, Mohan Kankanhalli, Weidong Geng. ACM Multimedia (MM), pages 926-934, 2019. Slide | Poster | Project Page | doi
[C33] Self-supervised Representation Learning using 360$^\circ$ Data. Junnan Li, Jianquan Liu, Yongkang Wong, Shoji Nishimura, Mohan Kankanhalli. ACM Multimedia (MM), pages 998-1006, 2019. Poster | doi
[C32] Human-imperceptible Privacy Protection Against Machines. Zhiqi Shen, Shaojing Fan, Yongkang Wong, Tiantsong Ng, Mohan Kankanhalli. ACM Multimedia (MM), pages 1119-1128, 2019. Project Page | doi [Best Student Paper Award]
[C31] Explainable Video Action Reasoning via Prior Knowledge and State Transitions. Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan Kankanhalli. ACM Multimedia (MM), pages 521-529, 2019. doi | Code release
[J14] Surface Electromyography-based Gesture Recognition by Multi-View Deep Learning. Wentao Wei, Qingfeng Dai, Yongkang Wong, Yu Hu, Mohan Kankanhalli, Weidong Geng. IEEE Transactions on Biomedical Engineering (TBME), Volume 66, Issue 10, October 2019, pages 2964-2973. doi
[C30] Learning Controllable Face Generator from Disjoint Dataset. Jing Li, Yongkang Wong, Terence Sim. The International Conference on Computer Analysis of Images and Patterns (CAIP), Volume 11678 of the series Lecture Notes in Computer Science, pages 209-223, 2019. Paper | doi
[J13] Dual-Stream Recurrent Neural Network for Video Captioning. Ning Xu, An-An Liu, Yongkang Wong, Yongdong Zhang, Weizhi Nie, Yu-Ting Su, Mohan Kankanhalli. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Volume 26, Issue 8, August 2019, pages 2482-2493. doi
[C29] Learning to Detect Human-Object Interactions with Knowledge. Bingjie Xu, Yongkang Wong, Junnan Li, Qi Zhao, Mohan Kankanhalli. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 2019-2028, 2019. Paper | Poster | Code release
[C28] Learning to Learn from Noisy Labeled Data. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 5051-5059, 2019. Paper | arXiv | Poster | Code release
[J12] A Multi-sensor Framework for Personal Presentations Analytics. Tian Gan, Junnan Li, Yongkang Wong, Mohan Kankanhalli. ACM Transactions on Multimedia, Computing Communications and Application (TOMM), 15, 2, Article 30 (June 2019). doi
[J11] Multi-Modal and Multi-Domain Embedding Learning for Fashion Retrieval and Analysis. Xiaoling Gu, Yongkang Wong, Lidan Shou, Pai Peng, Gang Chen, Mohan Kankanhalli. IEEE Transactions on Multimedia (TMM), Volume 21, Issue 6, June 2019, pages 1524-1537. doi
[J10] A Multi-Stream Convolutional Neural Network for sEMG-based Gesture Recognition in Muscle-Computer Interface. Wentao Wei, Yongkang Wong, Yu Du, Yu Hu, Mohan Kankanhalli, Weidong Geng. Pattern Recognition Letters (PRL), Volume 119, March 2019, pages 131-138. doi
[J9] LSTM-based Multi-Label Video Event Detection. An-An Liu, Zhuang Shao, Yongkang Wong, Junnan Li, Yu-Ting Su, Mohan Kankanhalli. Multimedia Tools and Applications, Volume 78, Issue 1, January 2019, pages 677-695. doi
[C27] Unsupervised Learning of View-Invariant Action Representations. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. Annual Conference on Neural Information Processing Systems (NeurIPS), pages 1260-1270, 2018. Paper | arXiv | Poster | Video
[J8] A Fine-grained Spatial-Temporal Attention Model for Video Captioning. An-An Liu, Yirui Qiu, Yongkang Wong, Yu-Ting Su, Mohan Kankanhalli. IEEE Access, Volume 6, November 2018, pages 68463-68471. doi
[J7] A Novel Attention-based Hybrid CNN-RNN Architecture for sEMG-based Gesture Recognition. Yu Hu, Yongkang Wong, Wentao Wei, Yu Du, Mohan Kankanhalli, Weidong Geng. PLOS ONE 13(10): e0206049, October 2018. doi
[J6] Hierarchical & Multimodal Video Captioning: Discovering and Transferring Multimodal Knowledge for Vision to Language. An-An Liu, Ning Xu, Yongkang Wong, Junnan Li, Yu-Ting Su, Mohan Kankanhalli. Computer Vision and Image Understanding (CVIU), Volume 163, October 2017, Pages 113-125. doi
[C26] Dual-Glance Model for Deciphering Social Relationships. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. International Conference on Computer Vision (ICCV), pages 2669-2678, 2017. Dataset | Paper | arXiv | Poster | doi
[C25] Attention Transfer from Web Images for Video Recognition. Junnan Li, Yongkang Wong, Qi Zhao, Mohan Kankanhalli. ACM Multimedia (MM), pages 1-9, 2017. Dataset | Paper | arXiv | Poster | doi
[C24] Understanding Fashion Trends from Street Photos via Neighbor-Constrained Embedding Learning. Xiaoling Gu, Yongkang Wong, Pai Peng, Lidan Shou, Gang Chen, Mohan Kankanhalli. ACM Multimedia (MM), pages 190-198, 2017. Dataset | Paper | Supplementary | Poster | doi
[J5] Benchmarking a Multimodal and Multiview and Interactive Dataset for Human Action Recognition. An-An Liu, Ning Xu, Weizhi Nie, Yu-Ting Su, Yongkang Wong, Mohan Kankanhalli. IEEE Transactions on Cybernetics, Volume 47, Issue 7, July 2017, Pages 1781-1794. doi
[C23] Semi-supervised Learning for Surface EMG-based Gesture Recognition. Yu Du, Yongkang Wong, Wentao Wei, Yu Hu, Mohan Kankanhalli, Weidong Geng. International Joint Conference on Artificial Intelligence (IJCAI), pages 1624-1630, 2017. Paper | Slide | doi
[C22] Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking. Wenhui Li, Yongkang Wong, An-An Liu, Yang Li, Yu-Ting Su, Mohan Kankanhalli. IEEE Winter Conference on Applications of Computer Vision (WACV), 2017. Dataset | Paper | arXiv | Poster | Project Page | doi
[C21] Demo Paper: PreSense - An Assistive Presentation Self-Quantification System. Junnan Li, Yongkang Wong, Mohan Kankanhalli. IEEE International Symposium on Multimedia (ISM), Pages 401-402, 2016. Paper | doi
[C20] Multi-stream Deep Learning Framework for Automated Presentation Assessment. Junnan Li, Yongkang Wong, Mohan Kankanhalli. IEEE International Symposium on Multimedia (ISM), Pages 222-225, 2016. Paper | doi
[C19] Towards Protecting Biometric Templates Without Sacrificing Performance. Jing Li, Yongkang Wong, Terence Sim. International Conference on Pattern Recognition (ICPR), 2016. Paper | Poster | doi
[C18] Marker-less 3D Human Motion Capture with Monocular Image Sequence and Height-Maps. Yu Du, Yongkang Wong, Yonghao Liu, Feiling Han, Yilin Gui, Zhen Wang, Mohan Kankanhalli, Weidong Geng. European Conference on Computer Vision (ECCV), Volume 9908 of the series Lecture Notes in Computer Science, Pages 20-36, 2016. Paper | Poster | Code release | DEMO | Project Page | doi
[C17] Multi-sensor Self-Quantification of Presentations. Tian Gan, Yongkang Wong, Bappaditya Mandal, Vijay Chandrasekhar, Mohan Kankanhalli. ACM Multimedia (MM), pages 601-610, 2015. Paper | doi
[C16] Multi-Modal & Multi-View & Interactive Benchmark Dataset for Human Action Recognition. Ning Xu, An-An Liu, Weizhi Nie, Yongkang Wong, Fuwu Li, Yu-Ting Su. ACM Multimedia (MM), pages 1195-1198, 2015. Paper | doi
[C15] Label Consistent Quadratic Surrogate Model for Visual Saliency Prediction. Yan Luo, Yongkang Wong, Qi Zhao. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages, 5060-5069, 2015. Paper | doi
[J4] Human Action Recognition Based on Local Action Attributes. Jing Zhang, Hong Lin, Weizhi Nie, Lekha Chaisorn, Yongkang Wong, Mohan Kankanhalli. Journal of Electrical Engineering & Technology, Volume 10, Issue 3, May 2015. Paper | doi
[J3] Multi-Camera Saliency. Yan Luo, Ming Jiang, Yongkang Wong, Qi Zhao. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Volume 37, Issue 10, 15 January 2015, Pages 2057-2070. Paper | doi
[J2] On Robust Face Recognition via Sparse Encoding: the Good, the Bad, and the Ugly. Yongkang Wong, Mehrtash Harandi, Conrad Sanderson. IET Biometrics, Volume 3, Issue 4, December 2014, Pages 176--189. arXiv | doi
[C14] Discovering Person Identity via Large-Scale Observations. Yongkang Wong, Lekha Chaisorn, Mohan Kankanhalli. Workshop on Human Identification for Surveillance, Asian Conference on Computer Vision (ACCV) Workshops, 2014. Paper | doi
[C13] Recovering Social Interaction Spatial Structure from Multiple First-Person Views. Tian Gan, Yongkang Wong, Bappaditya Mandal, Vijay Chandrasekhar, Liyuan Li, Joo-Hwee Lim, Mohan Kankanhalli. International Workshop on Socially-Aware Multimedia (IWSAM), ACM Multimedia workshops, 2014. Paper | doi
[C12] Scalable Decision-Theoretic Coordination and Control for Real-Time Active Multi-Camera Surveillance. Prabhu Natarajan, Trong Nghia Hoang, Yongkang Wong, Kian Hsiang Low, Mohan Kankanhalli. International Conference on Distributed Computing System (ICDCS), 2014 (invited paper). Paper
[C11] View-Invariant Feature Discovering for Multi-Camera Human Action Recognition. Hong Lin, Lekha Chaisorn, Yongkang Wong, An-An Liu, Yu-Ting Su, Mohan Kankanhalli. IEEE International Workshop on Multimedia Signal Processing (MMSP), 2014. Paper
[C10] Multi-View Action Recognition by Cross-Domain Learning. Weizhi Nie, An-An Liu, Jing Yu, Yu-Ting Su, Lekha Chaisorn, Yongkang Wong, Mohan Kankanhalli. IEEE International Workshop on Multimedia Signal Processing (MMSP), 2014. Paper
[J1] Automatic Classification of Human Epithelial Type 2 Cell Indirect Immunofluorescence Images using Cell Pyramid Matching. Arnold Wiliem, Conrad Sanderson, Yongkang Wong, Peter Hobson, Rodney. F. Minchin, Brian C. Lovell. Pattern Recognition (PR), Volume 47, Issue 7, July 2014, Pages 2315-2324. Paper | doi
[C9] Video Analytics for Surveillance Camera Networks. Lekha Chaisorn, Yongkang Wong. IEEE International Conference on Networks, 2013 (invited paper). Paper | doi
[C8] Temporal Encoded F-formation System for Social Interaction Detection. Tian Gan, Yongkang Wong, Daqing Zhang, Mohan Kankanhalli. ACM Multimedia (MM), pages 937-946, 2013. Paper | doi
[C7] Classification of Human Epithelial Type 2 Cell Indirect Immunofluoresence Images via Codebook Based Descriptors. Arnold Wiliem, Yongkang Wong, Conrad Sanderson, Peter Hobson, Shaokang Chen, Brian C. Lovell. IEEE Workshop on the Applications of Computer Vision (WACV), pages 95-102, 2013. Paper | doi
[C6] Combined Learning of Salient Local Descriptors and Distance Metrics for Image Set Face Verification. Conrad Sanderson, Mehrtash Harandi, Yongkang Wong, Brian C. Lovell. IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS), pages 294-299, 2012. Paper | doi
[C5] On Robust Biometric Identity Verification via Sparse Encoding of Faces: Holistic vs Local Approaches. Yongkang Wong, Mehrtash Harandi, Conrad Sanderson, Brian C. Lovell. International Joint Conference on Neural Networks (IJCNN), 2012. Paper | doi
[C4] Patch-based Probabilistic Image Quality Assessment for Face Selection and Improved Video-based Face Recognition. Yongkang Wong, Shaokang Chen, Sandra Mau, Conrad Sanderson, Brian C. Lovell. Biometrics Workshop, IEEE Conference Computer Vision and Pattern Recognition (CVPR) Workshops, 2011. Paper | Dataset 1 | Dataset 2 | doi [Highest Impact Award (presented at CVPR 2015 Biometrics Workshop)]
[C3] Dynamic Amelioration of Resolution Mismatches for Local Feature Based Identity Inference. Yongkang Wong, Conrad Sanderson, Sandra Mau, Brian C. Lovell. International Conference on Pattern Recognition (ICPR), Pages 1200-1203, 2010. Paper | doi
[C2] Regression Based Non-Frontal Face Synthesis for Improved Identity Verification. Yongkang Wong, Conrad Sanderson, Brian C. Lovell. International Conference on Computer Analysis of Images and Patterns (CAIP), Pages 116-124, 2009. Papers | doi
[C1] Narrow-Band FM-Multi-Tone FSK Modem: TMS320C6000 Based Test bed Implementation and Performance Analysis. Kandeepan Sithamparanathan, Yongkang Wong. IEEE Asia Pacific Conference on Circuits and Systems (APCCAS), 2006. doi
Multi-Scale Multi-Region Probabilistic Histograms for HEp-2 Cell Classification. Arnold Wiliem, Yongkang Wong, Conrad Sanderson, P. Hobson, S. Shen, Brian C. Lovell. ICPR Contest 2012. [Rank 8 out of 30 submission]
K Stephen, S. Yamazaki, J. Liu, R. Sheoran, Y. Wong, M Kankanhalli, Relationship Extraction Apparatus, Relationship Extracting Method, and Storage Meduium, US Patent App. 18/777,827.
S. Yamazaki, K Stephen, J. Liu, R. Sheoran, Y. Wong, M Kankanhalli, Event Detecting Apparatus, Event Detecting Method, and Storage Meduium, US Patent App. 18/777,784.
C. Sanderson, Y. Wong, Identifying Matching Images, WO2011088520A1,A8, US20120328197A1, US 9,165,184 B2
S. Chen, Y. Wong, Image Quality Assessment, WO2012109712A1, US20140044348 A1