Keyword Extraction in Scientific Documents

at SwissText 2022

 

Announcements

Welcome to the workshop! 


If you have not done so, fill this background probe Google form so that we get to know you better!

Also join the Slack channel [updated!] for more streamlined communication!


About the Workshop

The scientific publications grow at an exponential speed. Therefore, it is an increasingly challenging task to keep track of the trends and changes.  Understanding scientific documents is an important step in downstream tasks, such as knowledge graph building, text mining, and discipline classification. 

In this workshop, we aim to provide a better understanding of keyword and keyphrase extraction from the abstract of scientific publications. Beyond this workshop, the methods are also applicable to further text data such as texts in the media.

Program Overview

The workshop, as a part of SwissText 2022, will take place in Lugano-Viganello at the East Campus of USI-SUPSI. 

It will take place at Room A1.03.

The workshop will take place on Wednesday 8th June 2022 from 13:00 to 16:30. 

The tentative schedule is as follows:



Note:

Google Colaboratory

We will work solely on Google Colaboratory so that no local installation on your laptop is necessary. 


The Jupyter Notebook for each system can be found in the following links:

In order for the code to be executable, the notebook must be copied to a local Google Drive. Thus, a Google sign-in is required.


Furthermore, we also provide an evaluation function for all systems at the following link:


Note:

Data

We will use the abstract and keywords of approximately 40,000 scientific papers as a dataset. 

The complete dataset can be found in this ETH Polybox folder.


Note:

About the Organizers

This workshop is organized by ETH Zürich and Neue Zürcher Zeitung (NZZ)