Readings

Paper presentation & scribe information

In the first class period, we have handed out paper cards to assign paper presentations to students for the entire semester. Here are a few important administrative notes:

Please let us know your paper assignments: This google doc keeps track of which students hold which paper assignments. Please update this document with your assignments.
If you are registered for the course and did NOT get paper assignments, please email the instructors at akbc-692a-organizers@iesl.cs.umass.edu
If you are dropping the course and have paper assignments, please remove yourself from the assignment google doc and arrange a time to return the green paper cards by emailing akbc-692a-organizers@iesl.cs.umass.edu.
Each student should be assigned 2-3 presentations (no more than two fifteen minute presentations)
If you are trading paper assignments with another student, all that you need to do is update the google doc. Nothing further is required (e.g. emailing instructors).
Each paper assignment card points to a particular paper assignment using a code of the format [week_id:paper_id]. Each lecture has its corresponding [week_id] in square braces. The assigned paper codes are listed in the schedule. PLEASE ensure you've located the dates of your presentation. For example, "[Type+Link:P2]" corresponds to "[Type+Link:P2] DeepType: Multilingual Entity Linking by Neural Type System Evolution. Jonathan Raiman, Olivier Raiman · AAAI · 2018" on the schedule page which is on 09.20.2019. Note that spotlight presentation cards indicate the week_id in square braces and are of the format, [Type+Link] Spotlight 1, [Type+Link] Spotlight2, [Type+Link] Spotlight 3 for the three spotlight presentations in the [Type+Link] week on 9.20.
Please inform your classmates (and instructors) of your spotlight paper selection: by adding it to paper assignment google doc.
Please ensure you don't select a spotlight paper that someone else has already selected.

As described in the first meeting, there are two kinds of presentations:

15-minute presentations -
- A pair of students will work together to prepare and deliver a 15 minute presentation about a research paper or papers (in some cases two papers are assigned).
- This presentation should assume that other students in the course have read the paper.
- Interactive class exercises as part of presentation are encouraged.
- The structure of the presentation is flexible, students should cover the research questions addressed by each paper, the contributions of the paper, methodologies introduced by the paper, empirical results, related work, future work, work impacted by the paper, etc.
3-minute spotlight presentations -
- A single student will deliver a 3-minute presentations on the presenter's choice of paper from the suggested reading of the given week.
- The spotlight presentation is meant to highlight contributions of the paper and inform other students about the main ideas of the work.
- Check this paper assignment google doc to make sure you haven't selected paper that others have selected.

In addition to presenting papers, students will be assigned to scribe duties. Scribe notes are taken individually and are shared on the course website. These notes should provide a written summary of the assigned presentation. This google doc stores scribe assignments. Here are example scribe notes: example 1, example 2, example 3. Note that the topics of these examples may be more technical and broad than some of the topics discussed in this class. It is fine to have notes that are more brief than these examples if more detail is not required to convey the subject matter.

Scribe notes are due within 1 week of the lecture and should be submitted here: https://docs.google.com/forms/d/e/1FAIpQLSc4N4vcebAMN6__jMoceKLNidlfgkgYiDsye6pOBzGmVZwCrA/viewform?usp=sf_link
If you cannot use Overleaf to typeset your notes, please email the instructors.
Link to latex template: https://drive.google.com/file/d/1aAoIrLSHTTDXk7tHKNrK5IlTC_8djbrV/view?usp=sharing

Additions? Corrections? Changes?

If you have a paper you would like to add to this list or have found a mistake in the information in this list, please email akbc-692a-organizers@iesl.cs.umass.edu

Meeting 1 (9.6.2019)

Introduction to KBs, automated methods for KB construction

Meeting 2 (9.13.2019)

[KR] Knowledge Representations; Course Project Overview

Assigned Reading

[KR:P1] Cyc: toward programs with common sense. Douglas B. Lenat, Ramanathan V. Guha, Karen Pittman, Dexter Pratt, Mary Shepherd. Commun. ACM. 1990
[KR:P2] Learning distributed representations of concepts. Geoffrey Hinton. Proceedings of the eighth annual conference of the cognitive science society. 1986.
[KR:P3] Identity. Stanford Encyclopedia of Philosophy. https://plato.stanford.edu/entries/identity/

Cyc Topic Map

Datalog Clause

Meeting 3 (9.20.2019)

[Type+Link] Mention Segmentation, Entity Typing, Entity Linking

Assigned Reading

[Type+Link:P1] Neural Architectures for Named Entity Recognition. Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, Chris Dyer · HLT-NAACL · 2016 and Design Challenges and Misconceptions in Named Entity Recognition. Lev-Arie Ratinov, Dan Roth · CoNLL · 2009
[Type+Link:P2] DeepType: Multilingual Entity Linking by Neural Type System Evolution. Jonathan Raiman, Olivier Raiman · AAAI · 2018
[Type+Link:P3] Deep Joint Entity Disambiguation with Local Neural Attention. Octavian-Eugen Ganea, Thomas Hofmann · EMNLP · 2017

LSTM-CRF

DeepType Entity Linking

Meeting 4 (9.27.2019)

[Clustering] Entity Resolution & Clustering

Assigned Reading

[Clustering:P1] End-to-end Neural Coreference Resolution. Kenton Lee, Luheng He, Mike Lewis, Luke Zettlemoyer. EMNLP. 2017
[Clustering:P2] Robust Entity Clustering via Phylogenetic Inference. Nicholas Andrews, Jason Eisner, Mark Dredze. ACL 2014
[Clustering:P3] Affinity Clustering: Hierarchical Clustering at Scale. Mohammadhossein Bateni, Soheil Behnezhad, Mahsa Derakhshan, MohammadTaghi Hajiaghayi, Raimondas Kiveris, Silvio Lattanzi, Vahab Mirrokni. NeurIPS 2017.

Entity Resolution & Clustering

Within Document Coreference

End-to-end Deep Reinforcement Learning Based Coreference Resolution. Hongliang Fei, Xu Li, Dingcheng Li, Ping Li. ACL 2019.
Higher-order Coreference Resolution with Coarse-to-fine Inference. Kenton Lee, Luheng He, Luke S. Zettlemoyer. NAACL-HLT2018
Improving Coreference Resolution by Learning Entity-Level Distributed Representations. Kevin Clark and Christopher D. Manning, ACL 2016.
Learning Global Features for Coreference Resolution Sam Wiseman, Alexander M. Rush, Stuart M. Shieber. HLT-NAACL, 2016
Learning Anaphoricity and Antecedent Ranking Features for Coreference Resolution Sam Wiseman, Alexander M. Rush, Stuart M. Shieber, Jason Weston. ACL. 2015
Entity-Centric Coreference Resolution with Model Stacking. Kevin Clark, Christopher D. Manning. ACL. 2015
A Joint Framework for Coreference Resolution and Mention Head Detection. Haoruo Peng, Kai-Wei Chang, Dan Roth. CoNLL 2015
Latent Structures for Coreference Resolution. Sebastian Martschat, Michael Strube. Transactions of the Association for Computational Linguistics. 2015
Prune-and-Score: Learning for Greedy Coreference Resolution. Chao Ma, Janardhan Rao Doppa, John Walker Orr, Prashanth Mannem, Xiaoli Z. Fern, Thomas G. Dietterich, Prasad Tadepalli EMNLP. 2014
Easy Victories and Uphill Battles in Coreference Resolution. Greg Durrett, Dan Klein. EMNLP, 2013
Decentralized Entity-Level Modeling for Coreference Resolution. Greg Durrett, David Hall, Dan Klein. ACL2013
Easy-first Coreference Resolution. Veselin Stoyanov, Jason Eisner. COLING. 2012

Cross-Document Coreference

Revisiting Joint Modeling of Cross-document Entity and Event Coreference Resolution. Shany Barhom, Vered Shwartz, Alon Eirew, Michael Bugert, Nils Reimers, Ido Dagan. ACL 2019
Resolving Event Coreference with Supervised Representation Learning and Clustering-Oriented Regularization. Kian Kenyon-Dean, Jackie Chi Kit Cheung, Doina Precup *SEM. 2018
CESI: Canonicalizing Open Knowledge Bases using Embeddings and Side Information. Shikhar Vashishth, Prince Jain, Partha Talukdar. WWW 2018.
Revisiting the Evaluation for Cross Document Event Coreference. Shyam Upadhyay, Nitish Gupta, Christos Christodoulopoulos, Dan Roth. COLING2016
Event Detection and Co-reference with Minimal Supervision. Haoruo Peng, Yangqiu Song, Dan Roth. EMNLP. 2016
Twitter at the Grammys: A Social Media Corpus for Entity Linking and Disambiguation. Mark Dredze, Nicholas Andrews, Jay DeYoungPublished in SocialNLP@EMNLP 2016
Cross-document Event Coreference Resolution based on Cross-media Features. Tongtao Zhang, Hongzhi Li, Heng Ji, Shih-Fu Chang. EMNLP 2015
A Hierarchical Distance-dependent Bayesian Model for Event Coreference Resolution. Bishan Yang, Claire Cardie, Peter I. Frazier. TACL. 2015
Cross-Document Co-Reference Resolution using Sample-Based Clustering with Knowledge Enrichment. Sourav Dutta, Gerhard Weikum. TACL. 2015
Robust Entity Clustering via Phylogenetic Inference. Nicholas Andrews, Jason Eisner, Mark Dredze. ACL 2014
Cross-Document Coreference Resolution Using Latent Features. Axel-Cyrille Ngonga Ngomo, Michael Röder, Ricardo Usbeck. LD4IE@ISWC. 2014
Entity Clustering Across Languages. Spence Green, Nicholas Andrews, Matthew R. Gormley, Mark Dredze, Christopher D. Manning. HLT-NAACL. 2012
Joint Entity and Event Coreference Resolution across Documents. Heeyoung Lee, Marta Recasens, Angel X. Chang, Mihai Surdeanu, Daniel Jurafsky. EMNLP-CoNLL. 2012
A Discriminative Hierarchical Model for Fast Coreference at Large Scale. Michael L. Wick, Sameer Singh, Andrew McCallum. ACL. 2012
Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models. Sameer Singh, Amarnag Subramanya, Fernando Pereira, Andrew McCallum. ACL 2011
Streaming Cross Document Entity Coreference Resolution Delip Rao, Paul McNamee, Mark Dredze COLING 2010
Author Disambiguation using Error-driven Machine Learning with a Ranking Loss Function. Aron Culotta, Pallika Kanani, Robert Hall, Michael Wick, Andrew McCallum. 2007
Weakly supervised learning for cross-document person name disambiguation supported by information extraction. Niu, C.; Li, W.; and Srihari, R. K. ACL. 2004
Unsupervised personal name disambiguation. Mann and Yarowsky NAACL. 2003;
Entity-Based Cross-Document Coreferencing Using the Vector Space Model Amit Bagga, Breck Baldwin. COLING-ACL 1998

Clustering Methodology

Analysis of Ward's Method. Anna Großwendt, Heiko Röglin, Melanie Schmidt. SODA. 2019
DBSCAN++: Towards fast and scalable density clustering. Jennifer Jang · Heinrich Jiang. ICML 2019.
Scalable Hierarchical Clustering with Tree Grafting. Nicholas Monath, Ari Kobren, Akshay Krishnamurthy, Michael R. Glass, Andrew McCallum. KDD 2019
Gradient-based Hierarchical Clustering using Continuous Representations of Trees in Hyperbolic Space. Nicholas Monath, Manzil Zaheer, Daniel Silva, Andrew McCallum, Amr Ahmed. KDD 2019
Hierarchical Clustering better than Average-Linkage. Moses Charikar, Vaggos Chatziafratis, Rad Niazadeh. SODA. 2018
Hierarchical Clustering with Structural Constraints. Vaggos Chatziafratis, Rad Niazadeh, Moses Charikar. ICML 2018
Hierarchical Clustering: Objective Functions and Algorithms. Vincent Cohen-Addad, Varun Kanade, Frederik Mallmann-Trenn, Claire Mathieu. SODA 2018
Canopy Fast Sampling with Cover Trees. Manzil Zaheer, Satwik Kottur, Amr Ahmed, José M. F. Moura, Alexander J. Smola. ICML 2017
A Hierarchical Algorithm for Extreme Clustering. Ari Kobren, Nicholas Monath, Akshay Krishnamurthy, Andrew McCallum. KDD 2017
Can clustering scale sublinearly with its clusters? A variational EM acceleration of GMMs and k-means Dennis Forster, Jörg Lücke. AISTATS. 2017
One-Shot Coresets: The Case of k-Clustering. Olivier Bachem, Mario Lucic, Silvio Lattanzi AISTATS 2017
Hierarchical Clustering Beyond the Worst-Case. Vincent Cohen-Addad, Varun Kanade, Frederik Mallmann-Trenn NIPS. 2017
A Dual-Tree Algorithm for Fast k-means Clustering With Large k. Ryan R. Curtin. SDM 2017.
Training Gaussian Mixture Models at Scale via Coresets. Mario Lucic, Matthew Faulkner, Andreas Krause, Dan Feldman J. Mach. Learn. Res.2017
Exponential Stochastic Cellular Automata for Massively Parallel Inference. Manzil Zaheer, Michael Wick, Jean-Baptiste Tristan, Alexander J. Smola, Guy L. Steele AISTATS 2016
Approximate K-Means++ in Sublinear Time. Olivier Bachem, Mario Lucic, S. Hamed Hassani, Andreas Krause. AAAI. 2016
Fast and Provably Good Seedings for k-Means. Olivier Bachem, Mario Lucic, Seyed Hamed Hassani, Andreas Krause. NIPS. 2016
A cost function for similarity-based hierarchical clustering. Sanjoy Dasgupta. STOC. 2016
Finding Planted Partitions in Nearly Linear Time using Arrested Spectral Clustering. Nader H. Bshouty, Philip M. Long. ICML 2010
Web-scale k-means clustering D. Sculley WWW 2010
A discriminative framework for clustering via similarity functions. Maria-Florina Balcan, Avrim Blum, Santosh S. Vempala STOC 2008
BIRCH: A New Data Clustering Algorithm and Its Applications. Tian Zhang, Raghu Ramakrishnan, Miron Livny. Data Mining and Knowledge Discovery 1997

Meeting 5 (10.04.2019)

[RE] Relation Extraction, Semantic Role Labeling, & Frames

Assigned Reading

[RE:P1] Document-Level N-ary Relation Extraction with Multiscale Representation Learning. Robin Jia, Cliff Wong, Hoifung Poon, ACL 2019 and Simultaneously Self-Attending to All Mentions for Full-Abstract Biological Relation Extraction. Patrick Verga, Emma Strubell, Andrew McCallum. NAACL-HLT. 2018
[RE:P2] Training Classifiers with Natural Language Explanations. Braden Hancock, Paroma Varma, Stephanie Wang, Martin Bringmann, Christopher Re´. ACL. 2018 and Joint Concept Learning and Semantic Parsing from Natural Language Explanations. Shashank Srivastava, Igor Labutov, Tom Mitchell. EMNLP. 2017
[RE:P3] AMR Parsing as Graph Prediction with Latent Alignment. Chunchuan Lyu, Ivan Titov · ACL · 2018

(Srivastava et al, 2017)

(Lyu & Titov, 2018)

Meeting 6 (10.11.2019)

[Emb] Embedding Methods: Points, Gaussian, Cones, Boxes, Cones, Hyperbolic-space methods

Assigned Reading

[Emb:P1] Hyperbolic Entailment Cones for Learning Hierarchical Embeddings. Octavian-Eugen Ganea, Gary Bécigneul, Thomas Hofmann · ICML · 2018
[Emb:P2] Generalizing Point Embeddings using the Wasserstein Space of Elliptical Distributions. Boris Muzellec, Marco Cuturi · NeurIPS · 2018
[Emb:P3] Word Representations via Gaussian Embedding Luke Vilnis, Andrew McCallum · ICLR · 2015

Box Representations (Vilnis et al, 2018)

Gaussian Embedding (Vilnis & McCallum, 2015)

Meeting 7 (10.18.2019)

Midpoint Project Presentations

No Readings

Meeting 8 (10.25.2019)

Midpoint Project Presentations

No Readings

Meeting 9 (11.01.2019)

[GNN+Index] Graph Neural Networks & Learned Index Structures

Assigned Reading

[GNN+Index:P1] The Case for Learned Index Structures. Tim Kraska, Alex Beutel, Ed Huai-hsin Chi, Jeffrey Dean, Neoklis Polyzotis in SIGMOD Conference 2017
[GNN+Index:P2] Graph convolution over pruned dependency trees improves relation extraction. Yuhao Zhang, Peng Qi, Christopher D. Manning. EMNLP. 2018 and Attention Guided Graph Convolutional Networks for Relation Extraction. Zhijiang Guo, Yan Zhang, Wei Lu. ACL 2019.
[GNN+Index:P3] Learning to Route in Similarity Graphs. Dmitry Baranchuk · Dmitry Persiyanov · Anton Sinitsin · Artem Babenko. ICML 2019

Meeting 10 (11.08.2019)

[QA] Question Answering, Reasoning, & Pretrained language models

Assigned Reading

1. Learning to Compose Neural Networks for Question Answering. Jacob Andreas, Marcus Rohrback, Trevor Darrell and Dan Klein. , NAACL 2016

2. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova, NAACL 2018

3. End-to-End Differentiable Proving. Tim Rocktäschel, Sebastian Riedel, Neurips 2017

4. Language Models as Knowledge Bases. Fabio Petroni, Tim Rocktaschel, Patrick Lewis, Anton Bakhtin, Yuxiang Wu. Alexander H. Miller, Sebastian Riedel. EMNLP 2019

Meeting 11 (11.15.2019)

[Fair] Fairness & Adversarial Attacks

Assigned Reading

[Fair:P1] Compositional Fairness Constraints for Graph Embeddings. Avishek Joey Bose, William L. Hamilton ICML 2019
[Fair:P2] Fairness in Relational Domains. Golnoosh Farnadi, Behrouz Babaki, Lise Getoor AIES 2018
[Fair:P3] Mitigating Gender Bias in Natural Language Processing: Literature Review. Tony Sun, Andrew Gaut, Shirlyn Tang, Yuxin Huang, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth M. Belding-Royer, Kai-Wei Chang, William Yang Wang ACL 2019

Meeting 12 (11.22.2019)

Guest Lecture - Marco Serafini

Meeting 13 (12.06.2019)

Final Project Poster Presentations

No Readings

Page updated

Report abuse

Readings

Paper presentation & scribe information

Additions? Corrections? Changes?

Meeting 1 (9.6.2019)

Introduction to KBs, automated methods for KB construction

Suggested Reading

Meeting 2 (9.13.2019)

[KR] Knowledge Representations; Course Project Overview

Assigned Reading

Suggested Reading

Meeting 3 (9.20.2019)

[Type+Link] Mention Segmentation, Entity Typing, Entity Linking

Assigned Reading

Suggested Reading

Meeting 4 (9.27.2019)

[Clustering] Entity Resolution & Clustering

Assigned Reading

Entity Resolution & Clustering

Meeting 5 (10.04.2019)

[RE] Relation Extraction, Semantic Role Labeling, & Frames

Assigned Reading

Suggested Reading

Meeting 6 (10.11.2019)

[Emb] Embedding Methods: Points, Gaussian, Cones, Boxes, Cones, Hyperbolic-space methods

Assigned Reading

Suggested Reading

Meeting 7 (10.18.2019)

Midpoint Project Presentations

Meeting 8 (10.25.2019)

Midpoint Project Presentations

Meeting 9 (11.01.2019)

[GNN+Index] Graph Neural Networks & Learned Index Structures

Assigned Reading

Suggested Reading

Meeting 10 (11.08.2019)

[QA] Question Answering, Reasoning, & Pretrained language models

Assigned Reading

Suggested Reading

Meeting 11 (11.15.2019)

[Fair] Fairness & Adversarial Attacks

Assigned Reading

Suggested Reading

Meeting 12 (11.22.2019)

Guest Lecture - Marco Serafini

Meeting 13 (12.06.2019)

Final Project Poster Presentations