Scalable Deep Learning: From Theory to Practice (Full-Day)
- Decebal Constantin Mocanu (Eindhoven University of Technology)
- Elena Mocanu (University of Twente; Eindhoven University of Technology)
- Phuong H. Nguyen (Luxembourg Institute of Science and Technology; Eindhoven University of Technology)
- Madeleine Gibescu (Utrecht University)
- Zita Vale (Polytechnic Institute of Porto)
- Damien Ernst (Université de Liège)
A fundamental task for Multi-Agent Systems (MAS) is learning. Deep Neural Networks (DNNs) have proven to cope perfectly with all learning paradigms, i.e. supervised, unsupervised, and reinforcement learning. Nevertheless, traditional deep learning approaches make use of cloud computing facilities and do not scale well to autonomous agents with low computational resources. Even in the cloud they suffer from computational and memory limitations and cannot be used to model properly large physical worlds for agents which assume networks with billion of neurons. These issues were addressed in the last few years by the emerging topics of scalable and efficient deep learning. The tutorial covers these topics focusing on theoretical advancements, practical applications, and hands-on experience, in two parts:
Part I - Scalable Deep Learning: from pruning to evolution. The first part of the tutorial focuses on theory. We first revise how many agents make use of deep neural networks nowadays. We then introduce the basic concepts of neural networks and we draw a parallel between artificial and biological neural networks from a functional and topological perspective. We continue by introducing the first papers on efficient neural networks coming from early '90s which make use either of sparsity enforcing penalties or weights pruning of fully connected networks based on various saliency criterion. Afterwards, we review some of the recent works which start from fully connected networks and make use of prune-retrain cycles to compress deep neural networks and to make them more efficient in the inference phase. We then discuss an alternative approach, i.e. NeuroEvolution of Augmenting Topologies (NEAT) and its follow-ups, to grow efficient deep neural networks using evolutionary computing. Further on, we introduce the topic of Scalable Deep Learning (SDL) which builds on efficient deep learning and put together all of the above. Herein, we discuss how DNNs are trained using the new proposed Sparse Evolutionary Training (SET) algorithm. SET-DNNs start from random sparse networks and use an evolutionary process to adapt their sparse connectivity to the data while learning. SET-DNNs offer benefits in both phases, training and inference, having quadratically lower memory-footprints and much faster running time then their fully connected counterparts. These make them the perfect match for autonomous agents or for the modeling of large physical environments which need millions (or perhaps billions) of neurons. Up to now, everything is discussed in the context of supervised and unsupervised learning. We conclude the first part of this tutorial by introducing deep reinforcement learning and by paving the ground for scalable deep reinforcement learning. We describe some very recent progresses in the field of deep reinforcement learning that could be used to foster the performances of reinforcement learning agents when confronted with environments that can exhibit sudden changes in their dynamics, as it is often the case with energy systems.
Part II - Scalable Deep Learning: agents in smart grids. The second part of the tutorial focuses on practical applications. Distributed generation, demand response, distributed storage, and electric vehicles are bringing new challenges to the power and energy sector. The tutorial addresses the current and envisioned solutions for the management of these distributed energy resources in the context of smart grids. Artificial intelligence based approaches bring important new possibilities enabling efficient individual and aggregated energy management. Such approaches can provide different players aiming to accomplish individual and common goals in the frame of a market-driven environment with advanced decision-support and automated solutions. The first presentation in the afternoon session is concluded with the description of MARTINE (Multi-Agent based Real-Time INfrastruture for Energy), a platform to support real-time energy management and simulation of buildings and smart grids. The platform will be used as the basis to present different data-driven and cognitive approaches to support efficient energy management in buildings and smart grids. Overall, in the multi-agents settings we analyze the opportunity of using different types of strategies (e.g centralized versus decentralized, cooperative versus noncooperation, and so on). Through the end, we will argue that reinforcement learning paradigm can be very powerful to solve many decision-making problems in the energy sector, as for example investment problems, the design of bidding strategies for playing with the intraday electricity market or problems related to the control of microgrids. The last presentation define the resource allocation problems as a sequential stochastic decision-making process in multi-agent learning, that considers scalable and efficient deep reinforcement learning agents. We investigate how multiple learning agents interact and influence each other in the smart grid context, what kind of global system dynamics arise, and how desired electrical behaviour can be obtained by modifying the learning algorithms used. The settings considered, range from one-on-one interactions (e.g. games) to small groups (e.g. multi-agent coordination) and large communities (e.g. interactions in social networks).
After the tutorial, the participants shall have: a basic understanding of scalable deep neural networks for MAS learning, of MAS in the smart grid context; basic hands-on experience to use these concepts in various practical applications; and some good thoughts for future research directions.
Room: MB 3.445
- Slides will be available here.
Organizers' Short Bios
- Decebal Constantin Mocanu is an Assistant Professor in Artificial Intelligence and Machine Learning within the Data Mining group, Department of Mathematics and Computer Science, Eindhoven University of Technology (TU/e) since September 2017, and a member of TU/e Young Academy of Engineering. His short-term research interest is to conceive scalable deep artificial neural network models and their corresponding learning algorithms using principles from network science, evolutionary computing, optimization and neuroscience. Such models shall have sparse and evolutionary connectivity, make use of previous knowledge, and have strong generalization capabilities to be able to learn, and to reason, using few examples in a continuous and adaptive manner. Most science carried out throughout human evolution uses the traditional reductionism paradigm, which even if it is very successful, still has some limitations. Aristotle wrote in Metaphysics ``The whole is more than the sum of its parts''. Inspired by this quote, in long term, Decebal would like to follow the alternative complex systems paradigm and study the synergy between artificial intelligence, neuroscience, and network science for the benefits of science and society. In 2017, Decebal received his PhD in Artificial Intelligence and Network Science from TU/e. During his doctoral studies, Decebal undertook three research visits: the University of Pennsylvania (2014), Julius Maximilian University of Wurzburg (2015), and University of Texas, Austin (2016). Prior to this, in 2013, he obtained his MSc in Artificial Intelligence from Maastricht University. During his master studies, Decebal also worked as a part time software developer at We Focus BV in Maastricht. In the last year of his master studies, he also worked as an intern at Philips Research in Eindhoven, where he prepared his internship and master thesis projects. Decebal obtained his Licensed Engineer degree from University Politehnica of Bucharest. While in Bucharest, between 2001 and 2010, Decebal started MDC Artdesign SRL (a software house specialized in web development), worked as a computer laboratory assistant at the University Nicolae Titulescu, and as a software engineer at Namedrive LLC.
- Elena Mocanu is an Assistant Professor in Machine Learning within the Data Science group at University of Twente and a researcher at Eindhoven University of Technology. She received her B.Sc. degree in Mathematics and Physics from Transilvania University of Brasov, Romania, in 2004. After four years as mathematics and physics teacher at high-school level, Elena moved to the university. She has been an Assistant Lecturer within the Department of Information Technology, University of Bucharest, Romania from September 2008 to January 2011. In parallel, in 2009 she started a master program in Theoretical Physics. In 2011 she obtained the M.Sc. degree with specialization in Quantum Transport from University of Bucharest, Romania. In January 2011, Elena moved from Romania to Netherlands. She obtained the M.Sc. degree in Operations Research from Maastricht University, The Netherlands, in 2013. In the last year of her master program, she did a six months Internship at Mastricht University on bioinformatics data analytics research and a six months graduation project at NXP Semiconductors, Eindhoven. In her master thesis she has investigated deep learning methods for "People detection for building automation". In October 2013, Elena started her PhD research in Machine Learning and Smart Grids at TU/e. In January 2015 she performed a short research visit at the Technical University of Denmark and, from January to April 2016 she was a visiting researcher at University of Texas at Austin, USA. In 2017, Elena received her Doctor of Philosophy degree in Machine learning and Smart Grids from TU/e.
- Phuong H. Nguyen (M’06) received the Ph.D. degree from the Eindhoven University of Technology, the Netherlands in 2010. Before joining the Environmental Research and Innovation (ERIN) department of the Luxembourg Institute of Science and Technology (LIST) in 2019 as the group leader of the Group Sustainable Energy Systems (SES), Dr Phuong Nguyen has been holding an assistant professor position with tenure at the Electrical Energy System (EES) group, Eindhoven University of Technology (TU/e). He was a visiting researcher with the Real-Time Power and Intelligent Systems (RTPIS) Laboratory, Clemson University, USA, in 2012 and 2013. Dr Phuong Nguyen has committed his research effort to realize synergies of advanced monitoring and control functions for the distribution networks along with emerging digital technologies. This distinctive combination of competences allows him to develop a research pathway crossing over various domains of mathematical programming, stochastics, data mining, and communication networks. His research of interests includes data analytics with deep learning, real-time system awareness using (IoT) data integrity, as well as predictive and corrective grid control functions.
- Madeleine Gibescu is a full professor at Utrecht University. Madeleine received her Dipl.Eng. in Power Engineering from the University Politehnica (Bucharest, Romania) in 1993. She then moved to the University of Washington (Seattle, WA, USA) where she obtained her MSEE and PhD degrees in 1995 and 2003, respectively. She has worked as a Research Engineer for ClearSight Systems, and as a Power Systems Engineer for the Alstom Grid (Washington, USA). In 2007, she was appointed Assistant Professor at the Electrical Sustainable Energy Department of Delft University of Technology (The Netherlands). In 2013 she was appointed Associate Professor with the research group Electrical Energy Systems at Eindhoven University of Technology (The Netherlands). In 2018, she was appointed Full Professor withing the Copernicus Institute of Sustainable Development at Utrecht University.
- Zita Vale is full professor at the Polytechnic Institute of Porto and the director of the Research Group on Intelligent Engineering and Computing for Advanced Innovation and Development (GECAD). She received her diploma in Electrical Engineering in 1986 and her PhD in 1993, both from University of Porto. Zita Vale works in the area of Power and Energy Systems, with special interest in the application of Artificial Intelligence techniques. She has been involved in more than 50 funded projects related to the development and use of Knowledge-Based systems, Multi-Agent systems, Genetic Algorithms, Neural networks, Particle Swarm Intelligence, Constraint Logic Programming and Data Mining. Energy resources management, distributed generation, demand response and electric vehicles are important topics of her research in the current projects. The main application fields of these projects comprise: (1) Smart Grids, accommodating an intensive use of Renewable Energy Sources, Distributed Energy Resources (DER) and Distributed Generation (DG). She addresses the management of energy resources, the impact of DER on electrical networks, the negotiation of DER in electricity markets, demand response, storage, energy management in buildings, and electrical vehicles, including the ones with gridable capability (V2G); (2) Electricity markets, addressing contracts, prices and tariffs, decision-support for market participants, aggregation, ancillary services, and wholesale and local market simulation; and (3) Control Center applications, namely intelligent alarm processing, intelligent interfaces and intelligent tutors. Zita published over 800 works, including more than 100 papers in international scientific journals, and more than 500 papers in international scientific conferences. She has surprised 17 PhD concluded thesis, and is currently supervising 8 PhD students. Se has also supervised 45 MSc concluded theses, and is currently surpervising 10 MSc thesis.
- Damien Ernst graduated in 1998 from the University of Liège. He did his master thesis on electricity networks, focusing on loss of synchronism phenomena that can lead to blackouts in a matter of seconds. He changed direction for his doctoral thesis (which he defended in 2003), to focus on reinforcement learning. ``However, I used reinforcement learning techniques to solve control and decision-making problems in electricity networks. After my thesis, I continued on that path and I have published a great deal on the subject.'' This was during his three-year post-doctoral research, funded by the FNRS and spent at CMU and MIT in the USA, and at ETH in Zurich. Damien Ernst then headed to France, where he was a professor at Supélec, an engineering school in Rennes. ``2006 was a great year,'' he recalls, ``mainly because of my contacts with the French industry.'' He then returned to the FNRS (2007-2011) as a research associate, where he continued his work on reinforcement learning. During the period 2011-2016, he has held the EDF-Luminus Chair on Smart Grids at the University of Liège. ``Thanks to this chair, I've learned a great deal from working more closely with industry. It's given me a more complete, and less academic vision of energy systems. I've also been able to develop great research projects to facilitate the integration of renewable energy into electrical networks.'' Damien Ernst is now working at the University of Liège, as a full professor. He is doing research in the fields of energy and Artificial Intelligence.
- Decebal Constantin Mocanu, Elena Mocanu, Peter Stone, Phuong H. Nguyen, Madeleine Gibescu, and Antonio Liotta. 2018. Scalable training of artificial neural networks with adaptive sparse connectivity inspired by network science. Nature Communications 9, https://www.nature.com/articles/s41467-018-04316-3
- Decebal Constantin Mocanu and Elena Mocanu. 2018. One-Shot Learning Using Mixture of Variational Autoencoders: A Generalization Learning Approach. In Proceedings of the 17th International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS ’18), https://arxiv.org/abs/1804.07645
- Elena Mocanu, Decebal Constantin Mocanu, Phuong. H. Nguyen, Antonio Liotta, Michael E. Webber, Madeleine Gibescu, and J. G. Slootweg. 2018. On-line Building Energy Optimization using Deep Reinforcement Learning. IEEE Transactions on Smart Grid, https://ieeexplore.ieee.org/document/8356086
- Shiwei Liu, Decebal Constantin Mocanu, Amarsagar Reddy Ramapuram Matavalam, Yulong Pei, Mykola Pechenizkiy, 2019, Sparse evolutionary Deep Learning with over one million artificial neurons on commodity hardware, https://arxiv.org/abs/1901.09181