Combinatorial Approaches for Visual
Data Summarization
Rishabh Iyer
Monday, January 7th 2PM - 5PM
Exabytes of visual data is created everyday
How do you organize and summarize this big data?
This tutorial will address several aspects of Visual Data Summarization including Image Collection Summarization, Video Summarization, Entity/Object Summarization in Videos/Images, Data Subset Selection, and Diversified Active Learning. We shall study a combinatorial framework (specifically via a class of discrete optimization functions called submodular functions) for the above visual data summarization problems, and motivate various summarization models and discuss how they models different aspects of summarization including diversity, coverage, representation and importance. Moreover, we shall show how we can learn combinatorial models from data. Throughout this tutorial, we shall show how summarization models defined this way, not only work well in practice and scale well to massive scale problems, but the resulting models are also interpretable and intuitive.
Visual Data in the form of images, videos and live streams have been growing at an unprecedented rate in the last few years. While this massive data is a blessing to data science by helping improve predictive accuracy, it is also a curse since humans are unable to consume this large amount of data. Moreover, today, machine generated videos (via Drones, Dash-cams, Body-cams, Security cameras, Go-pro etc.) are being generated at a rate higher than what we as humans can process. Moreover, majority of this data is plagued with redundancy. Given this data explosion, machine learning techniques which automatically understand, organize and categorize this data are of utmost importance. Visual Data summarization attempts to solve this problem in two ways.
Rishabh Iyer is currently a Research Scientist at Microsoft, where he works on several problems around computer vision, discrete optimization, online learning, contextual bandits, reinforcement learning etc. He finished his Postdoc and Ph.D from the University of Washington, Seattle, where he worked with Prof. Jeff Bilmes. His work has received best paper awards at the International Conference of Machine learning and the Neural Information Processing Systems. He also won the Microsoft Ph.D. fellowship and Facebook Ph.D. Fellowship, along with the Yang Outstanding Doctoral Student Award from University of Washington. He completed his B.Tech at the Department of Electrical Engineering at IIT Bombay in 2011, and has been a visitor at Microsoft Research, Redmond and Simon Fraser University. He has worked on several aspects of Machine Learning including discrete and convex optimization, deep learning, video/image summarization, data subset selection, active learning, online learning etc. He has applied his work in several domains including search advertisement, computer vision, text classification and speech. He has given invited talks/tutorials at numerous conferences and workshops including the AMS Sectional Meeting, International Symposium on Mathematical Programming (ISMP), Non-convex Optimization for Machine Learning (NOML) Summer School and several renowned research and academic institutions world-wide.