Workshop Program

Pre-conference workshop at GSCL Conference 2015
Essen, Germany
September 29, 2015




NLP 4 CMC 2015:
2nd Workshop on Natural Language Processing for Computer-Mediated Communication / Social Media

Location: University of Duisburg-Essen’s Campus Essen, Room T03 R02 D82


10:00

Coffee @ NLP4CMC

10:30

Opening

Building and annotating CMC corpora

10:30–11:00

The DiDi Corpus of South Tyrolean CMC Data

Jennifer-Carmen Frey, Aivars Glaznieks and Egon Stemle
(EURAC, Bozen, IT)

11:00–11:30

Collection, Description, and Visualization of the German Reddit Corpus

Adrien Barbaresi (BBAW Berlin, DE)

11:30–12:00

Adding Value to CMC Corpora: CLARINification and Part-of-speech Annotation of the Dortmund Chat Corpus

Michael Beißwenger, Eric Ehrhardt, Andrea Horbach, Harald Lüngen, Diana Steffen and Angelika Storrer
(U Dortmund, U Mannheim, IDS Mannheim, U Saarbrücken, DE)

12:00–12:30

Building and Annotating a Corpus of German-Language Newsgroups

Jasmin Schröck and Harald Lüngen (IDS Mannheim, DE)

12:30–14:00

Lunch break

Using NLP for analysing CMC corpora

14:00–14:30

Using discursive information to disentangle French language chat

Matthieu Riou, Nicolas Hernandez and Soufian Salim (U Nantes, FR)

14:30–15:00

Text-based Geolocation of German Tweets

Johannes Gontrum and Tatjana Scheffler (U Potsdam, DE)

15:00–15:30

Modes of Communication in Social Media for Emergency Management

Sabine Gruender-Fahrer and Antje Schlaf (U Leipzig, DE)

15:30–16:00

Coffee break

Adapting the NLP toolkit to CMC genres

16:00–16:30

Unsupervised Induction of Part-of-Speech Information for OOV Words in German Internet Forum Posts

Jakob Prange, Stefan Thater and Andrea Horbach (U Saarbrücken, DE)

16:30–17:00

Bootstrapped Extraction of Index Terms from Normalized User-Generated Content

 Piroska Lendvai and Thierry Declerck (U Saarbrücken, DE)

17:00–17:15

Preview of EmpiriST2015:
Empirikom/GSCL Shared Task on Automatic Linguistic Processing of Computer-Mediated Communication:
Annotation Guidelines and Trial Data

17:15–

Concluding discussion


Talks will last 20 minutes (with 10 minutes for discussion).
A PDF volume with proceedings will be published online together with the workshop.