Social MT 2017

First workshop on Social Media and User Generated Content Machine Translation

Co-located with EAMT 2017, Prague, Czech Republic

Extended versions of the best papers may be published into an upcoming special issue of “Translating User Generated Content” on Machine Translation Journal

Important Dates

January 30, 2017 : First Call for Workshop Papers

March 8, 2017: Second Call for Workshop Papers

~~March 24, 2017 (~~extended to April 3, 2017): Workshop Paper Due Date

~~April 14, 2017~~ (extended to April 20, 2017): Notification of Acceptance

May 12, 2017: Camera-ready papers due

May 31, 2017: Workshop Date (half-day workshop)

Invited Speaker

Houda Bouamor, Carnegie Mellon University, Qatar

Translating user generated content: an overview and Arabic dialect translation as a case study

Workshop Organizers

General Chair: Andy Way, ADAPT Centre, Dublin City University

Program Chair: Haithem Afli, ADAPT Centre, Dublin City University

Workshop Description

With the widespread adoption of social media and online forums, individual users have been able to actively participate in the generation of online content in different languages and dialects. As a result, user-generated content (UGC) has seen an enormous growth in the recent years. The nature of UGC means that it can be generated at any time and in non-standard language or formats. Compared to professionally edited text, it is often more noisy, and likely to take some liberty with commonly established grammar, punctuation and spelling norms. All this can make it difficult to translate but UGC can also be incredibly valuable. This workshop will explore the multifarious aspects of effective MT of data extracted from social media.

The workshop aims to provide a research platform dedicated to new method and techniques on translating user-generated content and exploring the use of such transition on social media analytics. The workshop will solicit original research contributions related to the theme, which includes (but is not limited to):

Models and Tools Development for Social MT
Machine translation on Microblogs
Multi-lingual social analytics
Neural MT for UGC translation
Multilingual crowdsourcing
Building resources for UGC translation
Sentiment translation of UGC
Analyzing the diffusion of multilingual information
Using MT for monitoring emergency responses among social crowds
Multilingual Social-based web platform for disaster management
Multilingual and language-specific Information Retrieval on Social Web
Crosslingual document alignment using UGC data
Named entity transliteration on social media content
Code-mixed UGC translation
MT for Big social data analysis

Submissions may include work in progress as well as finished work. Submissions must have a clear focus on specific issues pertaining to UGC and its translation. Descriptions of commercial systems are welcome, but authors should be willing to discuss the details of their work.

Additionally, the workshop will aim to bring together researchers from diverse fields, such as Machine Translation, Big Data and Machine Learning, Natural Language Processing, and Computational Social Sciences, who can potentially contribute to improving the quality of UGC translation and its utilisation in research and industrial data analytics tasks.

Paper Submission Instructions

Submission Format: Submissions must conform to the official style guidelines for EAMT 2017 (https://ufal.mff.cuni.cz/pbml/instructions-authors).

Overleaf Project to Clone: https://www.overleaf.com/read/jgvrjmrqnwct

MS Word Template: https://ufal.mff.cuni.cz/eamt2017/files/templates/eamt17.doc

Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding eight (8) pages plus any number of pages for references. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding twelve (12) pages plus any number of pages for references.

Reviewing will be double blind, so the papers should not reveal the authors’ identity. Accepted papers will be published in the workshop proceedings.

Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers.

Submission Website: https://easychair.org/conferences/?conf=socialmt2017

For further information, please contact Haithem Afli mailto:haithem.afli(at)adaptcentre(dot)ie

Program @ EAMT (half-day workshop)

Weds afternoon of 31st May

14:30-14:40

Opening and Welcome

Andy Way and Haithem Afli

14:40-15:20

Keynote Speech

Houda Bouamor

Translating user generated content: an overview and Arabic dialect translation as a case study.

15:20-15:40

Oral presentation 1

Chao-Hong Liu, Declan Groves, Akira Hayakawa, Alberto Poncelas and Qun Liu. Understanding Meanings in Multilingual Customer Feedback.

15:40-16:00

Oral presentation 2

Fatma Mallek, Ngoc Tan Le and Fatiha Sadat. Improved Machine Translation for Arabic Tweets on Scarce-resource settings.

16:00-16:20

Coffee Break and Networking

16:20-16:40

Oral presentation 3

Meghan Dowling, Teresa Lynn and Andy Way. A Crowd-sourcing Approach for Translations of Minority Language User-Generated Content

16:40-17:00

Oral presentation 4

Imane Guellil, Azouaou Faical, Mourad Abbas and Fatiha Sadat. Towards an automatic Machine Translation for Arabic using Arabizi Neural Machine Transliteration

17:00-17:45

Panel discussion on open and upcoming challenges in translating UGC

17:45-18:00

Closing

Program Committee

Loïc Barrault (LIUM, Le Mans University)
Laurent Besacier (LIG, Grenoble University)
Philipp Koehn (Johns Hopkins University)
Abdelkarim Mars (Grenoble University)
Matteo Negri (FBK)
Houda Bouamor (CMU Qatar)
Yvette Graham (ADAPT Centre, Dublin City University)
Dimitar Shterionov (KantanMT)
Marco Turchi (FBK)
Antonio Toral (University of Groningen)
Lucia Specia (University of Sheffield)
Kashif Shah (eBay)
Rejwanul Haque (Lingo24)
Barry Haddow (University of Edinburgh)
Jinhua Du (ADAPT Centre, Dublin City University)
Daniel Stein (eBay)
Mohammed Hasanuzzaman (ADAPT Centre, Dublin City University)
Walid Aransa (LIUM, Le Mans University)
Pintu Lohar (ADAPT Centre, Dublin City University)

Workshop Venue

The workshop will be held at the Faculty of Mathematics and Physics, Charles University, Malostranské náměstí 25, 11800 Prague 1, Czech Republic.

Google Sites

Report abuse