Ajay Divakaran

Bio

Ajay Divakaran, Ph.D., is the Senior Technical Director of the Vision and Learning Lab at the Center for Vision Technologies, SRI International, Princeton. Divakaran has been a principal investigator for several SRI research projects for DARPA, IARPA, ONR etc. His work includes Multimodal Content Comprehension, Multimodal Conversation Understanding and Dialog Management, Multimodal Analytics for Social Media, Real-time Human Behavior assessment, and Event Detection. He has helped develop several innovative technologies for government and commercial multimodal systems such as Passio's MealScan feature for MyFitnessPal and Driver Drowsiness detection for Toyota. He worked at Mitsubishi Electric Research Labs during 1998-2008 where he was the lead inventor of the world's first sports highlights playback-enabled DVR, and several machine learning applications. Divakaran was named a Fellow of the IEEE in 2011 for his contributions to multimedia content analysis. He has authored two books, 130+ publications and 60+ issued patents. He received his Ph.D. degree in electrical engineering from Rensselaer Polytechnic Institute.

Links

SRI Webpage

https://www.sri.com/about/people/ajay-divakaran

SRI Dish

SRI's Multimodal Content Recommendation Commercialized by SRI Spin-off Vitrina

https://medium.com/dish/vitrina-ai-the-future-of-video-licensing-transactions-a4874355ce03

SRI Food Recognition Technology Commercialized by

SRI Spin-off Passio

https://blog.myfitnesspal.com/meal-scan/

SRI's Food Recognition Paper from 2015

https://pubmed.ncbi.nlm.nih.gov/25901024/

SRI Featured Innovator

https://medium.com/dish/featured-innovator-ajay-divakaran-adab82907ed

Ajay Divakaran talks about big data, social media influence, and robotic navigation

https://www.youtube.com/watch?v=y3H0hNAyFd0&feature=youtu.be

Longer Bio

Ajay Divakaran, Ph.D., is the Technical Director of the Vision and Learning Lab at the Center for Vision Technologies, SRI International, Princeton. Divakaran has been a principal investigator for several SRI research projects for DARPA, IARPA, ONR etc. His work includes multimodal social media analytics, vision and language, knowledge-guided machine learning, multimodal modeling and analysis of affective, cognitive, and physiological aspects of human behavior, interactive virtual reality-based training, tracking of individuals in dense crowds and multi-camera tracking, technology for automatic food identification and volume estimation, and analytics for event detection in open-source video. He has developed several innovative technologies for multimodal systems in both commercial and government programs during his career. Prior to joining SRI in 2008, Divakaran worked at Mitsubishi Electric Research Labs for 10 years, where he was the lead inventor of the world's first sports highlights playback-enabled DVR. He also oversaw a wide variety of product applications for machine learning. Divakaran was named a Fellow of the IEEE in 2011 for his contributions to multimedia content analysis. He developed techniques for recognition of agitated speech for his work on automatic sports highlights extraction from broadcast sports video. He established a sound experimental and theoretical framework for human perception of action in video sequences as lead-inventor of the MPEG-7 video standard motion activity descriptor. He serves on Technical Program Committees of key multimedia conferences and served as an associate editor of IEEE Transactions on Multimedia from 2007 to 2010. He currently serves on the editorial board of IEEE Intelligent Systems. He has authored two books and has more than 130 publications to his credit, as well as more than 60 issued patents. He has supervised four Ph.D. theses. He was a research associate at the ECE Dept, IISc from September 1994 to February 1995. He was a scientist with Iterated Systems Incorporated, Atlanta, GA, from 1995 to 1998. Divakaran received his M.S. and Ph.D. degrees in electrical engineering from Rensselaer Polytechnic Institute. His B.E. in electronics and communication engineering is from the University of Jodhpur in India, where he was a lecturer in 1985-86.

He has taught at multiple levels including second grade children (math), incoming college freshmen (math), EE undergrads (electronic circuits and control systems) and PhD students. He has an interest in special needs students. He is a fluent Japanese speaker and can speak survival French. He also speaks Hindi (native), Telugu, Tamil and Marwari in roughly descending order of fluency. He has been learning Hindustani vocal music from the prominent Hindustani vocalist, Mrs. Kumkum Sanyal, since 2003.

Selected Recent Publications

Pritish Sahu, Michael Cogswell, Sara Rutherford-Quach, Ajay Divakaran

Comprehension Based Question Answering using Bloom's Taxonomy

To Appear at the 6th Workshop on Representation Learning for NLP, 2021

Ajay Divakaran

Bio

Links

SRI Webpage

SRI Dish

Linkedin Profile

Google Scholar

dblp

Twitter

Longer Bio

Selected Recent Publications

Some Pertinent Videos

SRI Vision and Learning

Amir Tamrakar

News

Yi Yao

Karan Sikka

Xiao Lin

Anirban Roy

Jesse Hostetler

Meng Ye

Yunye Gong

Arijit Ray

Julia Kruk

Some Former and Current Collaborators

Nick Vander Valk

Ajay Divakaran's Music

Raga Bageshri

Cafe Improv 2016

Cafe Improv 2017