Subword & Character Level Models in NLP

The 1st Workshop on Subword and Character level models in NLP (SCLeM) will be held on September 7 at EMNLP 2017 in Copenhagen, Denmark.

The workshop provides a forum for discussing recent advances as well as future directions on sub-word and character-level natural language processing and representation learning. A detailed discussion on ideas and motivation behind this workshop are described here.

We thank Google Research for sponsoring our workshop in part.

News

14 Sep: the schedule is updated with the slides from four invited talks
10 Sep: link to the proceedings: http://www.aclweb.org/anthology/W/W17/#4100

Important Dates

Deadline for paper submission:~~June 10, 2017~~

Notification of acceptance: ~~June 30, 2017~~
Camera ready submission due: ~~July 21, 2017~~
Early registration deadline: ~~August 12, 2017~~
Workshop: ~~September 7, 2017~~

Invited Speakers

Kyunghyun Cho, NYU
Karen Livescu, TTIC
Tomas Mikolov, Facebook
Noah Smith, Univ of Washington

Topics

tokenization-free models
character-level machine translation
character-ngram information retrieval
transfer learning for character-level models
models of within-token and cross-token structure
NL generation (of words not seen in training etc)
out of vocabulary words
morphology & segmentation
relationship b/w morphology & character-level models
stemming and lemmatization
inflection generation
orthographic productivity
form-meaning representations
true end-to-end learning
spelling correction
efficient and scalable character-level models

Anti-harassment Policy

Our workshop highly values the open exchange of ideas, the freedom of thought and expression, and respectful scientific debate. We support and uphold the ACL Anti-Harassment policy, and any workshop participant should feel free to contact any of the NAACL Board members or Priscilla Rasmussen, in case of any issues.