Current research focus (March 2024)

Medical Report Generation (Or Radiology Report Generation)

The objective of medical report generation is to create computational algorithms capable of generating textual reports that articulate the findings and impressions in the images taken in modalities such as chest X-rays, CT scans, MRIs, or ultrasounds. This endeavor holds the promise of substantially alleviating the current workload burden on radiologists, enhancing their operational efficiency, mitigating examination errors, and ultimately facilitating the delivery of superior and more sustainable healthcare services to society. The primary aim of this research is to advance the development of sophisticated approaches, methodologies, and algorithms for the representation of visual and textual information, and the seamless translation of the former into the latter. Deep learning techniques, alongside large language and vision models, as well as knowledge representation and reasoning methodologies, are poised to play pivotal roles in this pursuit.

Fine-grained Vision and Language Understanding and Interaction 

The last decade has witnessed the significant progress on understanding vision, language and their interactions, with the advent of large models being a good evidence. Nevertheless, the current models are still far from being satisfactory in terms of fine-grained understanding, that is, capturing the subtle, detailed contents of image and text and associating them in a precise manner. Having the capabilities is highly desirable in many tasks involving fine-grained image and text understanding and the generation between image and text, especially when the background applications are medicine- or healthcare-related. This research aims to develop advanced approaches, methodologies, and techniques to understand image and text in a more nuanced way and better align visual and textual information in order to achieve comprehensive and accurate understanding.   

Continual Learning with the Application to Robotics 

This project endeavours to pioneer novel machine learning techniques aimed at enhancing machines' ability to leverage past experiences to tackle new tasks with limited data. Its primary objective is to mitigate the undesirable reliance of current machine learning methodologies on labeled data while substantially enhancing their performance, particularly in applications related to robotics. Anticipated outcomes of the project include the generation of new theoretical insights into continual learning, alongside the development of innovative algorithms poised to underpin the creation of the next generation of computer vision, machine learning, and robotics systems tailored to operate within open and dynamic environments. The envisioned impact of these advancements extends to benefiting science, society, and the economy through the application of these advanced intelligent systems. Deep learning, meta-learning, reinforcement learning, and the utilisation of large-scale models are expected to play pivotal roles in driving the success of this research endeavour.

Generic Image recognition

Image and object recognition has recently made significance progress with the advent of deep learning. The performance and efficiency of visual recognition have been improved to an unprecedentedly high level. Our recent research in this regard is focused on i) symmetric positive-definite (SPD) matrix-based visual representation; ii) unsupervised domain adaptation for image classification; and ii) new models and algorithms for fine-grained image recognition.

Content-based Image Retrieval

Content-based image retrieval has recently witnessed its fast development due to the pervasive use of mobile platform and the explosion of the volumes of images over the Internet. Our recent research in this regard is focused on i) query-adaptive image retrieval; ii) image retrieval via unsupervised deep learning; and iii) retrieval on archival photographic collections, by collaborating with national and regional organisations.

Grants and Awards

HDR Principal Supervisor (topics may change)