My primary research interest lies in advancing the way we explore, analyze, and interact with complex data. To this end, I develop new techniques and methodologies in machine learning and computer vision, with growing emphasis on multimodal models that connect visual and linguistic information.
My current projects focus on the algorithmic foundations of machine learning, particularly in distributed learning, semi-supervised learning, active learning, multi-task learning, and transfer learning. I also investigate new methods for capturing, processing, mining, and visualizing image and video data, as well as techniques for bridging visual and textual modalities to enable robust and interpretable multimodal understanding.