Subword & Character Level Models in NLP
The 1st Workshop on Subword and Character level models in NLP (SCLeM) will be held on September 7 at EMNLP 2017 in Copenhagen, Denmark.
The workshop provides a forum for discussing recent advances as well as future directions on sub-word and character-level natural language processing and representation learning. A detailed discussion on ideas and motivation behind this workshop are described here.
We thank Google Research for sponsoring our workshop in part.
News
- 14 Sep: the schedule is updated with the slides from four invited talks
- 10 Sep: link to the proceedings: http://www.aclweb.org/anthology/W/W17/#4100
Important Dates
Deadline for paper submission:June 10, 2017
- Notification of acceptance:
June 30, 2017 - Camera ready submission due:
July 21, 2017 - Early registration deadline:
August 12, 2017 - Workshop:
September 7, 2017
Invited Speakers
- Kyunghyun Cho, NYU
- Karen Livescu, TTIC
- Tomas Mikolov, Facebook
- Noah Smith, Univ of Washington
Topics
- tokenization-free models
- character-level machine translation
- character-ngram information retrieval
- transfer learning for character-level models
- models of within-token and cross-token structure
- NL generation (of words not seen in training etc)
- out of vocabulary words
- morphology & segmentation
- relationship b/w morphology & character-level models
- stemming and lemmatization
- inflection generation
- orthographic productivity
- form-meaning representations
- true end-to-end learning
- spelling correction
- efficient and scalable character-level models
Anti-harassment Policy
Our workshop highly values the open exchange of ideas, the freedom of thought and expression, and respectful scientific debate. We support and uphold the ACL Anti-Harassment policy, and any workshop participant should feel free to contact any of the NAACL Board members or Priscilla Rasmussen, in case of any issues.