January 30, 2017 : First Call for Workshop Papers
March 8, 2017: Second Call for Workshop Papers
March 24, 2017 (extended to April 3, 2017): Workshop Paper Due Date April 14, 2017 (extended to April 20, 2017): Notification of Acceptance
May 12, 2017: Camera-ready papers due
May 31, 2017: Workshop Date (half-day workshop)
General Chair: Andy Way, ADAPT Centre, Dublin City University
Program Chair: Haithem Afli, ADAPT Centre, Dublin City University
With the widespread adoption of social media and online forums, individual users have been able to actively participate in the generation of online content in different languages and dialects. As a result, user-generated content (UGC) has seen an enormous growth in the recent years. The nature of UGC means that it can be generated at any time and in non-standard language or formats. Compared to professionally edited text, it is often more noisy, and likely to take some liberty with commonly established grammar, punctuation and spelling norms. All this can make it difficult to translate but UGC can also be incredibly valuable. This workshop will explore the multifarious aspects of effective MT of data extracted from social media.
The workshop aims to provide a research platform dedicated to new method and techniques on translating user-generated content and exploring the use of such transition on social media analytics. The workshop will solicit original research contributions related to the theme, which includes (but is not limited to):
- Models and Tools Development for Social MT
- Machine translation on Microblogs
- Multi-lingual social analytics
- Neural MT for UGC translation
- Multilingual crowdsourcing
- Building resources for UGC translation
- Sentiment translation of UGC
- Analyzing the diffusion of multilingual information
- Using MT for monitoring emergency responses among social crowds
- Multilingual Social-based web platform for disaster management
- Multilingual and language-specific Information Retrieval on Social Web
- Crosslingual document alignment using UGC data
- Named entity transliteration on social media content
- Code-mixed UGC translation
- MT for Big social data analysis
Submissions may include work in progress as well as finished work. Submissions must have a clear focus on specific issues pertaining to UGC and its translation. Descriptions of commercial systems are welcome, but authors should be willing to discuss the details of their work.
Additionally, the workshop will aim to bring together researchers from diverse fields, such as Machine Translation, Big Data and Machine Learning, Natural Language Processing, and Computational Social Sciences, who can potentially contribute to improving the quality of UGC translation and its utilisation in research and industrial data analytics tasks.
Paper Submission Instructions
Submission Format: Submissions must conform to the official style guidelines for EAMT 2017 (https://ufal.mff.cuni.cz/pbml/instructions-authors).
Overleaf Project to Clone: https://www.overleaf.com/read/jgvrjmrqnwct
MS Word Template: https://ufal.mff.cuni.cz/eamt2017/files/templates/eamt17.doc
Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding eight (8) pages plus any number of pages for references. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding twelve (12) pages plus any number of pages for references.
Reviewing will be double blind, so the papers should not reveal the authors’ identity. Accepted papers will be published in the workshop proceedings.
Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers.
Submission Website: https://easychair.org/conferences/?conf=socialmt2017
For further information, please contact Haithem Afli mailto:haithem.afli(at)adaptcentre(dot)ie
Program @ EAMT (half-day workshop)
Weds afternoon of 31st May
Opening and Welcome
Andy Way and Haithem Afli
Translating user generated content: an overview and Arabic dialect translation as a case study.
Chao-Hong Liu, Declan Groves, Akira Hayakawa, Alberto Poncelas and Qun Liu. Understanding Meanings in Multilingual Customer Feedback.
Oral presentation 2
Fatma Mallek, Ngoc Tan Le and Fatiha Sadat. Improved Machine Translation for Arabic Tweets on Scarce-resource settings.
Coffee Break and Networking
Meghan Dowling, Teresa Lynn and Andy Way. A Crowd-sourcing Approach for Translations of Minority Language User-Generated Content
Oral presentation 4
Imane Guellil, Azouaou Faical, Mourad Abbas and Fatiha Sadat. Towards an automatic Machine Translation for Arabic using Arabizi Neural Machine Transliteration
Panel discussion on open and upcoming challenges in translating UGC
- Loïc Barrault (LIUM, Le Mans University)
- Laurent Besacier (LIG, Grenoble University)
- Philipp Koehn (Johns Hopkins University)
- Abdelkarim Mars (Grenoble University)
- Matteo Negri (FBK)
- Houda Bouamor (CMU Qatar)
- Yvette Graham (ADAPT Centre, Dublin City University)
- Dimitar Shterionov (KantanMT)
- Marco Turchi (FBK)
- Antonio Toral (University of Groningen)
- Lucia Specia (University of Sheffield)
- Kashif Shah (eBay)
- Rejwanul Haque (Lingo24)
- Barry Haddow (University of Edinburgh)
- Jinhua Du (ADAPT Centre, Dublin City University)
- Daniel Stein (eBay)
- Mohammed Hasanuzzaman (ADAPT Centre, Dublin City University)
- Walid Aransa (LIUM, Le Mans University)
- Pintu Lohar (ADAPT Centre, Dublin City University)