All times are in IST
05:00 PM - 05:30 PM
Vishnu Veerathu
Research Engineer, Cohesity
Title: The Structure of Parameterized Tournaments with Application to Ranking from Pairwise Comparisons (NeurIPS 2021)
Abstract: We consider the classical problem of finding the minimum feedback arc set on tournaments (MFAST). The problem is NP-hard in general and we study it for important classes of tournaments that arise naturally in the problem of learning to rank from pairwise comparisons. Specifically, we consider tournaments classes that arise out of parametric preference matrices that can lead to cyclic preference relations. The parametric assumption we make is a limit on the rank of the true preference matrix under a suitable link function. We investigate the structural properties of the tournaments induced by rank 2 preference matrices via forbidden sub tournament configurations. These properties are used to prove the optimality of our proposed ranking algorithm, Rank2Rank(R2R), on the rank 2 class of tournaments. We then propose the Block-Rank 2 tournament class and demonstrate its usefulness in modelling real world data. R2R can be used as a subroutine to develop another ranking algorithm, BlockRank2Rank(BR2R), which is optimal for the Block-Rank 2 tournament class. As an application, we study the problem of learning to rank from pairwise comparisons under the proposed model. Exploiting our structural characterization, we propose PairwiseBlockRank - a pairwise ranking algorithm for this class. We provide sample complexity bounds and performance comparisons with other popular ranking algorithms to show the efficacy of the proposed algorithm.
Reference: Vishnu Veerathu and Arun Rajkumar. (2021) “On The Structure of Parametric Tournaments with Application to Ranking from Pairwise Comparisons”. To appear in Thirty-fifth Annual Conference on Neural Information Processing Systems (NeurIPS 2021).
2. 05:30 PM - 06:00 PM
Dev Yashpal Sheth
Dual Degree Student, Department of Computer Science & Engg., IIT Madras
Title: Unsupervised Deep Video Denoising (ICCV 2021)
Abstract: Deep convolutional neural networks (CNNs) for video denoising are typically trained with supervision, assuming the availability of clean videos. However, in many applications, such as microscopy, noiseless videos are not available. To address this, we propose an Unsupervised Deep Video Denoiser (UDVD), a CNN architecture designed to be trained exclusively with noisy data. The performance of UDVD is comparable to the supervised state-of-the-art, even when trained only on a single short noisy video. We demonstrate the promise of our approach in real-world imaging applications by denoising raw video, fluorescence-microscopy, and electron-microscopy data. In contrast to many current approaches to video denoising, UDVD does not require explicit motion compensation. This is advantageous because motion compensation is computationally expensive, and can be unreliable when the input data are noisy. A gradient-based analysis reveals that UDVD automatically tracks the motion of objects in the input noisy videos. Thus, the network learns to perform implicit motion compensation, even though it is only trained for denoising.
Reference: Sheth, D. Y., Mohan, S., Vincent, J. L., Manzorro, R., Crozier, P. A., Khapra, M. M., ... & Fernandez-Granda, C. (2021). Unsupervised deep video denoising. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 1759-1768). PDF
3. 06:00 PM - 06:30 PM
Ananya B. Sai
PhD Scholar, Department of Computer Science & Engineering, IIT Madras
Title: Perturbation CheckLists for Evaluating NLG Evaluation Metrics (EMNLP 2021)
Abstract: Natural Language Generation (NLG) evaluation is a multifaceted task requiring the assessment of multiple desirable criteria, e.g., fluency, coherency, coverage, relevance, adequacy, overall quality, etc. However, the majority of the existing automatic evaluation metrics provide just a single overall score. In this talk, we will first discuss an overview of the techniques used for formulating automatic evaluation metrics for NLG. Across existing datasets for 6 NLG tasks, we observe that the human evaluation scores on these multiple criteria are often not correlated. For example, there is a very low correlation between human scores on fluency and data coverage for the task of structured data to text generation. This suggests that the current recipe of proposing new automatic evaluation metrics for NLG by showing that they correlate well with scores assigned by humans for a single criteria (overall quality) alone is inadequate. Indeed, we find that there is no single metric that correlates well with human scores on all desirable criteria, for most NLG tasks. Given this situation, we propose CheckLists for better design and evaluation of automatic metrics. We design templates that target specific criteria (e.g., coverage) and perturb the output such that the quality gets affected only along with these specific criteria (e.g., the coverage drops). We show that existing evaluation metrics are not robust against even such simple perturbations and disagree with scores assigned by humans to the perturbed output. The proposed templates thus allow for a fine-grained assessment of automatic evaluation metrics exposing their limitations and will facilitate better design, analysis, and evaluation of such metrics.
Reference: Ananya B. Sai, Tanay Dixit, Dev Sheth, Sreyas Mohan and Mitesh M. Khapra: Perturbation CheckLists for Evaluating NLG Evaluation Metrics. In Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2021). PDF
4. 06:30 PM - 07:00 PM
Dev Yashpal Sheth
Dual Degree Student, Department of Computer Science & Engg., IIT Madras
Title: PARWiS: Winner determination from Active Pairwise Comparisons under a Shoestring Budget (ICDM 2021)
Abstract: We consider the problem of determining a winner among a set of n items by actively comparing pairs of items. We focus on a practical scenario where we are given a shoestring budget (say cn for a small c such as 2 or 3) where the usual sample complexity bounds for winner recovery become useless. We focus on the Bradley-Terry-Luce model for noisy comparisons and propose PARWiS, a novel algorithm for Pairwise Active Recovery of Winner under a Shoestring budget. Our algorithm is based on an active version of a spectral ranking procedure combined with a pair selection procedure. We consider several natural disruption measures for the pair selection procedure and show that they are theoretically equivalent to moving along certain gradient directions of popular estimation objectives. We perform extensive synthetic and real-world experiments to show that our algorithm recovers the item at the top effectively with shoestring budgets and outperforms several state-of-the-art algorithms.
Reference: Dev Yashpal Sheth and Arun Rajkumar: PARWiS: Winner determination from Active Pairwise Comparisons under a Shoestring Budget, To appear as a full paper at (ICDM 2021). PDF
07:00 PM - 07:10 PM
BREAK
07:10 PM - 07:30 PM
Bharat Manvi
MS Scholar, Department of Electrical Engineering, IIT Madras
Title: Simultaneous beamforming and trajectory tracking in a multi-agent formation
Abstract: This work considers a multi-agent system in which agents are unmanned aerial vehicles(UAVs) and collect information related to applications such as surveillance, defense, etc.. The transmitting antennas present on the agents send the collected information to a distant receiver. Similarly, the receiving antennas present on the agents receive commands to operate, from a distant transmitter. The quality of signal measured by the signal-to-noise ratio (SNR) can be enhanced if the agents cooperate to transmit or receive the signal. Hence when the agents are in a regular polygonal formation, they have the desired beam pattern with maximum transmission axis (MTA) or maximum reception axis (MRA). As the centroid of the polygonal formation follows a given trajectory, the MTA/MRA needs to be aligned towards the receiver or transmitter, respectively. Hence we propose a cascade of formation control, which drives the agents to the desired formation and a geometric control to align the MTA/MRA continuously as the centroid of the formation tracks a smooth trajectory. We show that the equilibrium of the cascaded closed-loop system is locally exponentially stable. The cascaded control designed for the agents with a double integrator model is extended to the quadrotors, and the cascaded control law is validated by implementing it on a quadrotor hardware setup.
Reference: Manvi, B., Bhikkaji, B., & Mahindrakar, A. D. (2021, June). “Simultaneous beamforming and trajectory tracking in a multi-agent formation.” In 2021 29th Mediterranean Conference on Control and Automation (MED) (pp. 1114-1119). IEEE.
Karthik. V
MS Student, University of Illinois Urbana-Champaign
Title: A Joint Training Framework for Open-World Knowledge Graph Embeddings
Abstract: Knowledge Graphs(KGs) represent factual information as graphs of entities connected by relations. Knowledge graph embeddings have emerged as a popular approach to encode this information for various downstream tasks like natural language inference, question answering, and dialogue generation. As knowledge bases expand, we are presented with newer (open-world) entities, often with textual descriptions. We require techniques to embed new entities as they arrive using the textual information at hand. This task of open-world KG completion has received some attention in recent years. However, we find that existing approaches suffer from one or more of four drawbacks – 1) They are not modular with respect to the choice of the KG embedding model 2) They ignore best practices for aligning two embedding spaces 3) They do not account for differences in training strategy needed when presented with datasets with different description sizes and 4) They do not produce entity embeddings for use by downstream tasks. To address these problems, we propose FOlK (Framework for Open-World KG embeddings) - a technique that jointly learns embeddings for KG entities from descriptions and KG structure for open-world knowledge graph completion. Additionally, we modify existing data sources and make available YAGO3-10- Open and WN18RR-Open two datasets that are well suited for demonstrating the efficacy of open-world KG completion approaches. Finally, we empirically demonstrate the effectiveness of our model in improving upon state-of-the-art baselines on several tasks resulting in performance increases of up to 72% on mean reciprocal rank.
Reference: Karthik, V., Tripathi, B., Khapra, M. M., & Ravindran, B. (2021, June). A Joint Training Framework for Open-World Knowledge Graph Embeddings. In 3rd Conference on Automated Knowledge Base Construction. (AKBC 2021). PDF
Rajan Kumar Soni
MS Scholar, Department of Computer Science & Engg., IIT Madras
Title: Metric Learning for comparison of HMMs using Graph Neural Networks
Abstract: Hidden Markov models (HMMs) belong to the class of double embedded stochastic models which were originally leveraged for speech recognition and synthesis. HMMs subsequently became a generic sequence model across multiple domains like NLP, bio-informatics and thermodynamics to name a few. Literature has several heuristic metrics to compare two HMMs by factoring in their structure and emission probability distributions in HMM nodes. However, typical structure-based metrics overlook the similarity between HMMs having different structures yet similar behavior and typical behavior-based metrics rely on the representativeness of the reference sequence used for assessing the similarity in behavior. Further, little exploration has taken place in leveraging the recent advancements in deep graph neural networks for learning effective representations for HMMs. In this paper, we propose two novel deep neural network based approaches to learn embeddings for HMMs and evaluate the validity of the embeddings based on subsequent clustering and classification tasks. Our proposed approaches use a Graph variational Autoencoder and diffpooling based Graph neural network (GNN) to learn embeddings for HMMs. The graph autoencoder infers latent low-dimensional flat embeddings for HMMs in a task-agnostic manner; whereas the diffpooling based graph neural network learns class-label aware embeddings by inferring and aggregating a hierarchical set of clusters and sub-clusters of graph nodes. Empirical results reveal that the HMM embeddings learnt through the Graph variational autoencoders and diffpooling based GNN outperform the popular heuristics as measured by the cluster quality metrics and the classification accuracy in downstream tasks.
Reference: Soni, R. K., Seshadri, K., and Ravindran, B. (2021) "Metric Learning for comparison of HMMs using Graph Neural Networks". To appear in the Proceedings of the Thirteenth Asian Conference on Machine Learning (ACML 2021). PMLR.
Pavan Ravishankar
PhD Scholar, Courant Institute of Mathematical Sciences, NYU
Title: A Causal Approach for Unfair Edge Prioritization and Discrimination Removal
Abstract: In budget-constrained settings aimed at mitigating unfairness, like law enforcement, it is essential to prioritize the sources of unfairness before taking measures to mitigate them in the real world. Unlike previous works, which only serve as a caution against possible discrimination and de-bias data after data generation, this work provides a toolkit to mitigate unfairness during data generation, given by the Unfair Edge Prioritization algorithm, in addition to de-biasing data after generation, given by the Discrimination Removal algorithm. We assume that a non-parametric Markovian causal model representative of the data generation procedure is given. The edges emanating from the sensitive nodes in the causal graph, such as race, are assumed to be the sources of unfairness. We first quantify Edge Flow in any edge X --> Y, which is the belief of observing a specific value of Y due to the influence of a specific value of X along X --> Y. We then quantify Edge Unfairness by formulating a non-parametric model in terms of edge flows. We then prove that cumulative unfairness towards sensitive groups in a decision, like race in a bail decision, is non-existent when edge unfairness is absent. We prove this result for the non-trivial non-parametric model setting when the cumulative unfairness cannot be expressed in terms of edge unfairness. We then measure the Potential to mitigate the Cumulative Unfairness when edge unfairness is decreased. Based on these measurements, we propose the Unfair Edge Prioritization algorithm that can then be used by policymakers. We also propose the Discrimination Removal Procedure that de-biases a data distribution by eliminating optimization constraints that grow exponentially in the number of sensitive attributes and values taken by them. Extensive experiments validate the theorem and specifications used for quantifying the above measures.
Reference: Ravishankar, P., Malviya, P., and Ravindran, B. (2021) "A Causal Approach for Unfair Edge Prioritization and Discrimination Removal". To appear in the Proceedings of the Thirteenth Asian Conference on Machine Learning (ACML 2021). PMLR.
5. 07:30 PM - 08:00 PM
Anasua Mitra
PhD Scholar, Indian Institute of Technology, Guwahati
Title: Semi-Supervised Deep Learning for Multiplex Networks (KDD 2021)
Abstract: Multiplex networks are complex graph structures in which a set of entities are connected to each other via multiple types of relations, each relation representing a distinct layer. Such graphs are used to investigate many complex biological, social, and technological systems. In this work, we present a novel semi-supervised approach for structure-aware representation learning on multiplex networks. Our approach relies on maximizing the mutual information between local node-wise patch representations and label correlated structure-aware global graph representations to model the nodes and cluster structures jointly. Specifically, it leverages a novel cluster-aware, node-contextualized global graph summary generation strategy for effective joint-modeling of node and cluster representations across the layers of a multiplex network. Empirically, we demonstrate that the proposed architecture outperforms state-of-the-art methods in a range of tasks: classification, clustering, visualization, and similarity search on seven real-world multiplex networks for various experiment settings.
Reference: Mitra, A., Vijayan, P., Sanasam, R., Goswami, D., Parthasarathy, S., & Ravindran, B. (2021, August). Semi-Supervised Deep Learning for Multiplex Networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (pp. 1234-1244). PDF
6. 08:00 PM - 08:30 PM
Rohan Saphal
Applied Research Scientist, JPMorgan Chase & Co.
Title: SEERL: Sample Efficient Ensemble Reinforcement Learning (AAMAS 2021)
Abstract: Ensemble learning is a very prevalent method employed in machine learning. The relative success of ensemble methods is attributed to their ability to tackle a wide range of instances and complex problems that require different low-level approaches. However, ensemble methods are relatively less popular in reinforcement learning owing to the high sample complexity and computational expense involved in obtaining a diverse ensemble. We present a novel training and model selection framework for model-free reinforcement algorithms that use ensembles of policies obtained from a single training run. These policies are diverse in nature and are learned through directed perturbation of the model parameters at regular intervals. We show that learning and selecting an adequately diverse set of policies is required for a good ensemble while extreme diversity can prove detrimental to overall performance. The selection of an adequately diverse set of policies is done through our novel policy selection framework. We evaluate our approach to challenging discrete and continuous control tasks and also discuss various ensembling strategies. Our framework is substantially sampled efficient, computationally inexpensive, and is seen to outperform state-of-the-art (SOTA) scores in Atari 2600 and Mujoco.
Reference: Saphal, R., Ravindran, B., Mudigere, D., Avancha, S., & Kaul, B. (2021). SEERL: Sample Efficient Ensemble Reinforcement Learning. In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (pp. 1100–1108). (AAMAS 2021) PDF
7. 08:30 PM - 09:00 PM
Harsha Kokel
PhD Scholar, The University of Texas at Dallas
Title: RePReL: Integrating Relational Planning and Reinforcement Learning for Effective Abstraction (ICAPS 2021)
Abstract: State abstraction enables sample-efficient learning and better task transfer in complex reinforcement learning environments. State abstraction is especially beneficial in relational settings, where the number and/or types of objects are not fixed apriori. Inspired by the benefits of state abstraction in hierarchical planning and learning, we recently proposed a framework that can obtain task-specific state abstractions from humans. I will first briefly present the proposed RePReL model, a hierarchical framework that integrates relational planning and reinforcement learning. Then demonstrate how we use an SRL model to obtain the prior domain knowledge from humans that provides effective task-specific state abstractions.
Reference: Kokel, H., Manoharan, A., Natarajan, S., Ravindran, B., & Tadepalli, P. (2021, May). RePReL: Integrating Relational Planning and Reinforcement Learning for Effective Abstraction. In Proceedings of the International Conference on Automated Planning and Scheduling (Vol. 31, pp. 533-541). PDF