Speakers

Prof. Jorunn Mona Skofteland Gislefoss - University Of Agder

I have 24 years of experience from the process industry. 22 years at the same facility, the last few years as factory director for GE Healthcare Lindesnes, which makes active ingredients for X-ray contrast agents, and has approx. 45% of the world market. 2 years as factory director for REC (formerly Elkem) Solar in Kristiansand, which produces silicon for solar cells, with the most energy-efficient process and lowest carbon footprint in the world. Four months in NORCE, before I started at UiA on 1 February 2020. Several board positions over many years (Ex: Eyde chairman of the board, Norwegian Industry NHO representative board, UiA board, National program board for VRI in the Research Council). My background is Siv.ing and Dr.ing in physics from NTH/NTNU.

Key Note Speaker

Prof. B. Yegnanarayana - IIT Hyderabad

Dr. Bayya Yegnanarayana is currently INSA Hon Scientist and Emeritus Professor at IIIT Hyderabad. He was Professor Emeritus at BITS-Pilani Hyderabad Campus during 2016. He was an Institute Professor from 2012 to 2016 and Professor & Microsoft Chair from 2006 to 2012 at the International Institute of Information Technology Hyderabad (IIIT-H). He was a Professor (1980 to 2006) and Head of the CSE Dept (1985 to 1989) at IIT Madras, a visiting Associate Professor at Carnegie-Mellon University (CMU), Pittsburgh, USA (1977 to 1980), and a member of the faculty at the Indian Institute of Science (IISc), Bangalore, (1966 to 1978). He received BSc from Andhra University in 1961, and BE, ME and PhD from IISc Bangalore in 1964, 1966, and 1974, respectively. His research interests are in signal processing, speech, image processing and neural networks. He has published over 450 papers in these areas. He is the author of the book "Artificial Neural Networks" published by Prentice-Hall of India in 1999. He has supervised 37 PhD and 42 MS theses at IISc, IITM and IIIT-H. He is a Fellow of the Indian National Academy of Engineering (INAE), Indian National Science Academy (INSA), Indian Academy of Sciences (IASc), IEEE (USA), and International Speech Communications Association (ISCA). He was the recipient of the 3rd IETE Prof. S. V. C. Aiya Memorial Award in 1996. He received the Prof. S. N. Mitra Memorial Award for the year 2006 from INAE. He was awarded the 2013 Distinguished Alumnus Award from IISc Bangalore. He was awarded "The Sayed Husain Zaheer Medal (2014)" of INSA in 2014. He received Prof. Rais Ahmed Memorial Lecture Award from the Acoustical Society of Inida in 2016. He was an Associate Editor for the IEEE Transactions on Audio, Speech and Language Processing during 2003-2006. From June 2020, he is an Associate Editor for the Journal of the Acoustical Society of America. He received Doctor of Science (Honoris Causa) from Jawaharlal Nehru Technological University Anantapur in February 2019. He was the General Chair for Interspeech2018 held in Hyderabad, India, during September 2018. He was a visiting Professor at CMU Africa in Rwanda and at IIT Dharwad during 2019. He is currently Adjunct Faculty at IIT Tirupati, Distinguished Professor at IIT Hyderabad, and Distinguished Adjunct Professor at IIITNR.

Title: mmWave Radars for Automotive and Industrial Applications

The mmWave frequency modulated continuous wave (FMCW) radars provide excellent target detection and localization performance in harsh environments as well as low lighting conditions. They offer a resolution of less than 5 cm, a range detection range of hundreds of meters, and a velocity of up to 300 km/h. This talk focuses on the applications of these highly versatile sensors in automotive and industrial applications.

Prof. Om Jee Pandey - IIT BHU

Om Jee Pandey (Senior Member, IEEE) received the B.Tech. degree in electronics and communication engineering from Uttar Pradesh Technical University, Lucknow, India, in 2008, the M.Tech. degree in digital communications from the ABV-Indian Institute of Information Technology and Management, Gwalior, India, in 2013, and the Ph.D. degree from the Department of Electrical Engineering, Indian Institute of Technology Kanpur, Kanpur, India, in 2019. He worked as a Postdoctoral Fellow with the Communications Theories Research Group, Department of Electrical and Computer Engineering, University of Saskatchewan, Saskatoon, SK, Canada. Currently, he is working as an Assistant Professor with the Department of Electronics Engineering IIT BHU Varanasi, India His research interest includes the signal processing for wireless networks with a specific focus on robust sensor node localization and tracking over wireless ad hoc networks. He also works on related areas, such as low-latency data transmission, data aggregation, and distributed detection and estimation in wireless sensor networks. He is a Regular Reviewer for various reputed journals of IEEE, including the IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, IEEE ACCESS, IEEE INTERNET OF THINGS JOURNAL, and the IEEE TRANSACTIONS ON COMMUNICATIONS.

Title: Small-World Models for IoT Applications

In recent years, the Internet of Things (IoT) is getting high attention among the research community due to its various advantages in the context of social and commercial issues such as cognitive health monitoring, smart agriculture, intelligent transportation, and industry 4.0. Wireless Sensor Networks are working as the backbone for IoT, where, the data is transmitted at the base station either directly or in a multi-hop manner. Direct data transmission over long distances consumes a large amount of energy for the individual device. On the other hand, multi-hop data transfer results in increased data transmission delay, more data interference, and reduced data throughput. In order to address these issues, a novel small-world phenomenon can be introduced into the network, leading to an optimal number of hops required for the data transmission. In this talk, we are going to discuss small-world characteristics and their advantages in improving the energy efficiency and quality-of-service (QoS) of IoT networks. We will discuss various methods to introduce small-world characteristics first, subsequently, machine learning frameworks will be explored for the development of small-world wireless sensor networks for IoT applications.

Prof. Subrahmanya Sastry Challa - IIT Hyderabad

C. S. Sastry received the PhD degree in Applied Mathematics from IIT Kanpur. He worked at the University of British Columbia (Vancouver, Canada) as a postdoc fellow and at IIIT Jabalpur as an Assistant Professor. He is currently with the IIT Hyderabad. His research interests include sparsity-driven methods and their applications.

Title : Sparsity-driven learning: Theory and Applications

Abstract :

-----------

Recent developments in sparse optimization theory aim at providing certain classes of linear systems with sparse descriptions, which have the potential of being used for data-driven learning. The presentation aims at exploring the basic theory and applications of the sparsity driven learning methods.

Prof. Ole-Christoffer Granmo - University of Agder

Prof. Ole-Christoffer Granmo is the Founding Director of the Centre for Artificial Intelligence Research (CAIR), University of Agder, Norway. He obtained his master’s degree in 1999 and the PhD degree in 2004, both from the University of Oslo, Norway, and created the Tsetlin machine in 2018. Dr. Granmo has authored more than 150 refereed papers with seven best paper awards within machine learning, encompassing learning automata, bandit algorithms, Tsetlin machines, Bayesian reasoning, reinforcement learning, and computational linguistics. He has further coordinated 7+ research projects and graduated 55+ master- and eight PhD students. Dr. Granmo is also a co-founder of the Norwegian Artificial Intelligence Consortium (NORA). Apart from his academic endeavours, he co-founded the company Anzyz Technologies AS.

Title : The Tsetlin Machine - Today and Tomorrow

Abstract :

-----------

Researchers are accumulating increasing evidence that machine learning models based on correlation are brittle. These models do not distinguish between correlation and causation. Accordingly, they provide limited insight and cannot reason about the effects of actions for decision making. The traditional fields of logical engineering, causal inference, and deep learning have struggled with learning causal models at scale for several decades, each approach facing substantial obstacles. In this talk, I present the emerging paradigm of Tsetlin machines. Tsetlin machines partially unify the latter paradigms through a fundamental shift from arithmetic-based to logic-based machine learning. Like logical engineering, a Tsetlin machine produces propositional/relational Horn clauses (logical rules). However, the logical expressions are robustly learnt using finite state machines in the form of Tsetlin automata. Tsetlin machines further handle uncertainty by using multiple clauses to signify confidence. In this way, Tsetlin machines introduce the concept of logically interpretable learning, where both the learned model and the learning process are easy to follow and explain. The paradigm has enabled competitive accuracy, scalability, memory footprint, inference speed, and energy consumption across diverse tasks. Recent progress deals with classification, convolution, regression, image analysis, natural language processing (NLP), and speech understanding. At the end of the talk, I cover the Tsetlin machine research horizon, addressing the unresolved learning challenge in logical engineering, the scaling challenge of probabilistic causal models, and the correlation-reliance of deep learning. The proposed research agenda merges: (1) recursive logical Horne clauses for modeling all computable functions; (2) causal Tsetlin machine learning for distilling causal mechanisms from data into causal Horn clauses; (3) large-scale probabilistic causal reasoning over sparse truth tables in Horn clause form.

Prof. M.B. Srinivas Mandalika - BITS- Pilani, Hyderabad

M.B. Srinivas has been a Professor of Electrical and Electronics Engineering at BITS Pilani for about 14 years now. He is currently with Dubai Campus of BITS Pilani and is a University-wide Dean of Graduate Studies and Research. His research interests include advanced memory technologies, VLSI Arithmetic and machine learning to address societal problems.

Title : A Cyber-Physical Framework for Agriculture/Horticulture Doamain

Abstract :

-----------

BITS Pilani recently signed an MoU with the Telangana State Horticulture University to develop and deploy technologies to minimize the post-harvest loss of horticulture produce. With increasing population, while there is an urgent need to increase the production of fruits and vegetables, post-harvest losses mean decrease in production in real terms. In India these losses may amount to 30-40% of total production resulting in financial losses to the tune of hundreds of crores of rupees. In this talk, we present a framework that seeks to deploy technologies such as Sensors, IoT, Cloud and Machine Learning to reduce the losses and improve the productivity thereby saving the country of hundreds of crores of rupees and farmers from the financial burden.

Prof. J. Saketh Nath - IIT Hyderabad

J. SakethaNath is currently working as an associate professor in the department of CSE at IIT Hyderbad. Prior to that, he served as faculty at IIT Bombay for seven years. He is interested in machine learning, with a focus on kernel based methods. He received his master's and PhD degrees from IISc, Bengaluru and bachelor's from IIT Madras.

Title : Kantorovich-Wasserstein meets Mercer

Abstract :

-----------

This talk focuses on two families of metrics over measures, which are popular in Machine Learning: namely, the Mercer kernel based Maximum Mean Discrepancy (MMD) metrics and the Kantorovich-Wasserstein metrics.

In the first part of the talk we revise the relevant definitions, summarize basic properties, and discuss ML applications where the metrics lead to state-of-art performance. We then compare and contrast the two families side-by-side and note their complementary properties.

In the second half, we present our recent attempt to bridge the gap between the two by proposing a new family of metrics that seems to achieve the best of the both. We provide some theoretical insights and present empirical evidence to showcase the proposed metric's potential in ML applications.

Prof. Vineeth N Balasubramanian - IIT Hyderabad

Vineeth N Balasubramanian is an Associate Professor in the Department of Computer Science and Engineering at the Indian Institute of Technology, Hyderabad (IIT-H), India, and was also the Founding Head of the Department of Artificial Intelligence at IIT-H. His research interests include deep learning, machine learning, and computer vision. His research has resulted in ~150 peer-reviewed publications at various international venues, including top-tier venues such as ICML, CVPR, NeurIPS, ICCV, KDD, AAAI, and IEEE TPAMI, with Best Paper Awards at recent venues such as CODS-COMAD 2022, CVPR 2021 Workshop on Causality in Vision, etc. His PhD dissertation at Arizona State University on the Conformal Predictions framework was nominated for the Outstanding PhD Dissertation at the Department of Computer Science. He serves as a Senior PC/Area Chair for conferences such as CVPR, ICCV, AAAI, IJCAI, ECCV with recent awards including Outstanding Reviewer at ICLR 2021, CVPR 2019, ECCV 2020, etc. He is also a recipient of the Teaching Excellence Award at IIT-H (2017 and 2021), Google Research Scholar Award (2021), NASSCOM AI Gamechanger Award (2022), Google exploreCSR award (2022), among others. For more details, please see https://iith.ac.in/~vineethnb/.

Title : Causality in Explainable AI

Abstract :

-----------

The need for explainability of Deep Neural Network (DNN) models and the development of AI systems that can fundamentally reason has exponentially increased in recent years, especially with the increasing use of AI/ML models in risk-sensitive and safety-critical applications. Causal reasoning helps identify input variables that cause a certain prediction, rather than merely be correlated, and thus provide useful explanations in practice. Similarly, focusing on causal input-output relationships can help a DNN model generalize to out-of-distribution samples better, where spurious correlations in training data may otherwise mislead a model. This talk will introduce the growing field of explainable AI, summarize existing efforts and focus on one important aspect of causality in DNN models -- the notion of causal attributions between input and output variables of the model. We will do this from two perspectives -- firstly, we will study how one can "deduce" what causal input-output attributions an already-trained DNN model has learned, and provide an efficient mechanism to compute such causal attributions (based on our work published at ICML 2019). Secondly, we will explore the complementary side of this problem on how one can "induce" known prior causal information into DNN models during the training process itself (based on our work published at ICML 2022) . Both of these efforts are derived by a first-principles approach to integrating causal principles into DNN models, and can have significant implications on practice in real-world applications.

Prof. Srijith P. K. - IIT Hyderabad

Dr. Srijith P. K. is an Associate Professor at the Department of Computer Science and Engineering, IIT Hyderabad. He is interested in developing machine learning, deep learning and Bayesian learning models to solve problems arising in various domains of Artificial Intelligence such as vision, natural language processing and social media. He has published papers in top venues such as NeurIPS, AAAI, UAI, ACL, EMNLP, ECML, WACV, UMAP, ASONAM etc. He has been a reviewer for premier ML and NLP conferences such as NeurIPS, ICML, AAAI, UAI, EMNLP etc. and the senior program committee and local organizing chair for ACML. Prior to joining IITH, he worked as a post-doctoral researcher at the University of Melbourne and at the University of Sheffield. Dr. Srijith did his Ph.D. at the department of Computer Science and Automation, Indian Institute of Science, Bangalore. He holds a M.Tech degree in Computer Science from IIT Bombay and a B.Tech degree in Computer Science from NIT Calicut. More details on his research can be found at: https://sites.google.com/site/pksrijith/home.

Title: Continual Deep Learning : Lifelong Learning in Neural Networks

Abstract :

-----------


Humans learn continually throughout their lifespan by accumulating diverse knowledge and using the accumulated knowledge to efficiently learn new tasks without forgetting old tasks. Though deep learning models manage to achieve state-of-the-art performance in several Artificial intelligence tasks, in such continual learning situations and problem setups, they suffer from catastrophic forgetting where it forgets the knowledge gathered from previous tasks after learning a new task. A. deep learning model with a continual learning capability can be beneficial to several applications such as Autonomous driving, healthcare, chatbots etc. In this talk, we motivate and discuss continual learning in deep learning models and briefly mention the techniques to achieve it. We discuss the techniques that we proposed for continual learning (CL) in neural networks using hypernetworks and continual image generation techniques. We demonstrate the effectiveness of these techniques on several real world problems and setups.

Prof. Sri Rama Murty Kodukula - IIT Hyderabad

Sri Rama Murty completed his PhD from IITM in 2009 and joined the Department of Electrical Engineering at IITH. He is currently an Associate Professor in the same department. His research interests include signal processing, speech analysis, machine learning, and deep learning.

Title: A signal processing view of deep architectures

Abstract :

-----------


In the recent past, deep learning architectures have been highly successful in signal processing applications, paving paths for end-to-end systems. It would be interesting to see how these deep architectures are related to the traditional signal processing (filter design) approaches. This talk offers a signal processing view of deep learning models. It is mainly aimed at beginners in understanding the parallels between both approaches. Finally, the application of deep architectures in speech enhancement will be discussed.

Prof. M.V. Panduranga Rao - IIT Hyderabad

M V Panduranga Rao received his BTech from REC Warangal (now NIT Warangal), MTech from IIT Kanpur and PhD from IISc Bangalore. His research interests lie in quantum computing and communications, and formal methods.

Title : Quantum Combinatorial Optimization Algorithms and Applications

Abstract :

-----------


Combinatorial optimization is a problem that is widely studied in the domain of classical computation. Now, it is also receiving a lot of interest from a quantum standpoint. Many quantum algorithmic techniques are being studied, as well as applications. We will review these techniques and applications, including, if time permits, some work being done at IITH.

Dr. Debaditya Roy - A*STAR Singapore

Debaditya Roy is currently a Scientist at the Institute of High-Performance Computing, A*STAR, Singapore. His current research involves predicting human actions for enhanced Human-Robotic Engagement which is funded by AI Singapore. Previously, he was a post-doctoral researcher working at Nihon University, Japan under the M2Smart project to develop AI-based models to study traffic behavior in India. He received his Ph.D. from IIT Hyderabad in 2018 for his thesis "Representation Learning for Action Recognition." His research interests involve computer vision and machine learning with a particular curiosity on how to represent context and environment to learn/reason about human actions even with limited examples which we as humans manage to do effectively.

Title : Predicting Human Behavior - From the Kitchen to the Roads

Abstract :

-----------

In this talk, I will focus on Anticipating Human Behavior.

First, I will talk about action anticipation. The ability to anticipate future actions of humans is useful in application areas such as automated driving, robot-assisted manufacturing, and smart homes. The problem of anticipating human actions is an inherently uncertain one. However, we can reduce this uncertainty if we have a sense of the goal that the actor is trying to achieve. present an action anticipation model that leverages goal information for the purpose of reducing uncertainty in future predictions. So, predicting the next action in the sequence becomes easier once we have an idea about the goal that guides the entire activity. We present an action anticipation model that uses goal information in an effective manner.

Second, I will talk about predicting human driving behavior in traffic especially at intersections as a large proportion of road accidents occur at intersections. Especially, in India where vehicles often ply very close to each other, it is essential to determine collision-prone vehicle behavior. Existing approaches only analyze driving behavior in lane-based driving. We learn the collision propensity of a vehicle from its interaction trajectory. Interaction trajectories encapsulate hundreds of interactions for every vehicle at an intersection. The interaction trajectories that match accident characteristics are labeled as unsafe while the rest are considered safe. We also introduce the first laneless traffic aerial surveillance dataset called SkyEye to demonstrate our results.

Prof. Lalitha Vadlamani - IIIT Hyderabad

Lalitha Vadlamani received her B.E. degree in Electronics and Communication Engineering from the Osmania University, Hyderabad, in 2003 and her M.E. and Ph.D. degrees from the Indian Institute of Science (IISc), Bangalore, in 2005 and 2015 respectively. From May 2015, she is working as Assistant professor in IIIT Hyderabad, where she is affiliated to Signal Processing and Communications Research Center. From 2006 to 2008, she worked as an engineer at Qualcomm, Hyderabad (2006) and design engineer at Conexant Systems, Noida (2006-2008). Prior to joining IIIT Hyderabad, she worked as a research intern at Microsoft Research, Bangalore. Her research interests include coding for distributed storage and computing, index coding, polar codes, learning-based codes and coded blockchains. She is a recipient of Prof. I.S.N. Murthy medal from IISc, 2005 and the TCS Research Scholarship for the year 2011. Her paper won the runner up best paper award at NCC 2019.

Title : Channel Coding: 5G and beyond

Abstract :

-----------

5G standards have stringent requirements with respect to handling very high data rates, low latency and supporting diverse services. One of the use cases of 5G interest is enhanced Mobile Broad Band (eMBB). Channel coding is employed in a communication system in order to mitigate the effects of channel noise and decode the transmitted messages even in the presence of errors. In this talk, we will briefly review two classes of codes used in 5G new radio standard, which are LDPC codes and Polar codes. LDPC codes are a class of codes which can achieve rates quite close to the channel capacity. These codes are characterized by parity check matrices which are sparse in nature. LDPC codes can be decoded using belief propagation algorithms on Tanner graphs. The belief propagation algorithm itself is highly efficient and scales very well with the block length of the code. LDPC codes are used for transmissions on the data channel, where the block lengths are of the order of multiple thousands. Polar codes are the first class of codes which are provably capacity achieving, for a range of channels. Polar codes are designed based a certain polarizing transform. These codes can be encoded using a recursive algorithm since the polarizing transform is a kronecker product of a certain 2 X 2 matrix. Also, decoding of polar codes is performed using successive cancellation technique. Polar codes are used for transmissions on the control channel, where the block length roughly ranges from 256 to 1024. Recently, there have been advances in designing codes and decoders based on deep learning and we will present important results in this area. Also, there have been developments with respect to Reed-Muller codes achieving the capacity of binary symmetric channel. We will also briefly discuss these results.

Prof. Phaneendra K. Yalavarthy - IISc Bangalore

Phaneendra Yalavarthy received B.Sc. and M.Sc. degrees in physics from Sri Sathya Sai University, Puttaparthy, India in 1999 and 2001 respectively. He also obtained a M.Sc. degree in Engineering from Indian Institute of Science, Bangalore, India in 2004. He received a Ph.D., working as a U.S. Department of Defense Breast Cancer Pre-doctoral Fellow, in biomedical computation from Dartmouth College, Hanover, USA in 2007. He worked as a post-doctoral research associate in the Department of Radiation Oncology, School of Medicine, Washington University in St. Louis, USA from 2007-2008. Currently, he is working as a Professor in the Department of Computational and Data Sciences, Indian Institute of Science, Bangalore, India.

Title 1 : Mobile Friendly Deep Learning Algorithms for Medical Image Analysis

Title 2 : The Edge of Artificial Intelligence: Self-driving Medical Care

Abstract 1 :

-----------


Light-weight Convolutional Neural Networks (CNNs) are mobile friendly models that can provide inference without the need for any specialised hardware. These models can be very effective in point-of-care settings, where the detection of disease has to be performed in real-time. The talk will highlight few developments of Medical Imaging Group (MIG) at Indian Institute of Science, especially towards COVID19 management and diagnosis. Deployment of these lightweight networks on embedded platforms to show highly versatility as well as prove optimal performance in terms of being accurate will also be highlighted. The developed models having latency in the same order as other lightweight networks without compromising the accuracy will also be shown.


Abstract 2 :

-----------


There is a lot of similarity between self-driving cars and current AI adoption in medicine, especially in Radiology. Today, AI solutions for radiology are focused towards image enhancement. However, with the advent of innovation and the positive potential impact of AI augmentation, radiologists should expect to see more of automated diagnosis in the near future. The long-term vision for AI in medical imaging should stop at autonomous vehicle level 3 automation—the radiologist will still be in charge of the majority of cases, and AI can help them drive along the "highway" of simple use cases. The talk will focus on current challenges and list opportunities for the researchers.

Prof. C. Krishna Mohan - IIT Hyderabad

Dr. C. Krishna Mohan is currently a Professor in the Department of Computer Science and Engineering and the Dean of Public and Corporate Relations at the Indian Institute of Technology Hyderabad (IIT Hyderabad), India. He has been with IIT Hyderabad since 2009. He was also the Head of the Department of Computer Science and Engineering, IIT Hyderabad from May 2010 till October 2014. Before joining IIT Hyderabad, he was a senior faculty member at the National Institute of Technology Karnataka, Surathkal (NITK Surathkal). He received his M.Tech from NITK Surathkal and Ph.D. from the Indian Institute of Technology Madras (IIT Madras).

Prof Krishna Mohan heads Visual Learning and Intelligence Lab (VIGIL) at IIT Hyderabad working in the research areas of video content analysis, computer vision, machine learning, deep learning. He is a recipient of Excellence in Teaching Award, in recognition of distinguished teaching in the year 2018 at IIT Hyderabad. He has published more than 60 papers in peer-reviewed international journals and conferences proceedings. He has guided several Ph.D., Masters, and Undergraduate students. He has successfully carried out several projects with industry, government agencies, and academia. He serves as a reviewer for several reputed international conferences and IEEE journals and is a Senior Member of IEEE, Member of ACM, AAAI Member, Life Member of ISTE, Fellow of IEI, IETE, and TAS.

Prof. Dinesh Singh - IIT Mandi

Dinesh Singh completed his Ph.D degree in Computer Science and Engineering from IIT Hyderabad under the supervision of Prof. C. Krishna Mohan. He received his M.Tech degree in Computer Engineering from NIT Surat. Prior to joining IIT Mandi, he was working as a Postdoctoral Researcher with Prof. Makoto Yamada at RIKEN Center for Advanced Intelligence Project (AIP), Kyoto University Office, Japan. His research interests include machine learning and computer vision. His research work has been published in reputed journals and prestigious conferences including Pattern Recognition, IEEE TKDE, IEEE Trans BigData, IEEE T-ITS, BMVC, IJCNN, IEEE CEC, and IEEE Big Data. He volunteered at several national and international conferences. He also served as a PC member for IEEE BigData 2018. He is consistently providing reviewing services at top venues in machine learning and computer vision including TPAMI, MLJ, TIP, T-ITS, PR, CVPR, ECCV, ICML, and NeurIPS.

Title : Nys-Newton: Nyström-Approximated Curvature for Stochastic Optimization

Abstract :

-----------

Second-order optimization methods are among the most widely used optimization approaches for convex optimization problems, and have recently been used to optimize non-convex optimization problems such as deep learning models. The widely used second-order optimization methods such as quasi-Newton methods generally provide curvature information by approximating the Hessian using the secant equation. However, the secant equation becomes insipid in approximating the Newton step owing to its use of the first-order derivatives. In this study, we propose an approximate Newton sketch-based stochastic optimization algorithm for large-scale empirical risk minimization. Specifically, we compute a partial column Hessian of size (d × m) with m ≪ d randomly selected variables, then use the Nyström method to better approximate the full Hessian matrix. To further reduce the computational complexity per iteration, we directly compute the update step (∆w) without computing and storing the full Hessian or its inverse. We then integrate our approximated Hessian with stochastic gradient descent and stochastic variance-reduced gradient methods. The results of numerical experiments on both convex and non-convex functions show that the proposed approach was able to obtain a better approximation of Newton's method, exhibiting performance competitive with that of state-of-the-art first-order and stochastic quasi-Newton methods. Furthermore, we provide a theoretical convergence analysis for convex functions.

Prof. Abhinav Kumar - IIT Hyderabad

Dr. Abhinav Kumar received the BTech+MTech (Dual Degree) and PhD degree in Electrical Engineering from the Indian Institute of Technology Delhi, in 2009 and 2013, respectively. From September to November, 2013, he was a research associate in the Indian Institute of Technology Delhi. From December 2013 to November 2014, he was a postdoctoral fellow at the University of Waterloo, Canada. Since November 2014, he has been with Indian Institute of Technology Hyderabad, India, where he is currently an Associate Professor. His research interests are in the different aspects of Machine Learning, Wireless Communications and Networking. He is a Senior member of IEEE.

Title : ML for Target Classification and Localization using Wireless Networks

Abstract :

-----------


We will discuss various applications of machine learning in wireless networks for classification and localization. This includes using existing wireless networks like Wi-Fi for 3-dimensional (3D) indoor localization. State-of-the-art multi-building 3D indoor Wi-Fi localization schemes. A proposed CNN based scheme that performs significantly better than the state-of-the-art schemes based on openly available Wi-Fi datasets. Other works like machine learning for target classification by mmWave RADARs, vehicle detection using LTE signals, etc.