Subword & Character Level Models in NLP

The 1st Workshop on Subword and Character level models in NLP (SCLeM) will be held on September 7 at EMNLP 2017 in Copenhagen, Denmark.

The workshop provides a forum for discussing recent advances as well as future directions on sub-word and character-level natural language processing and representation learning. A detailed discussion on ideas and motivation behind this workshop are described here.

We thank Google Research for sponsoring our workshop in part.

Important Dates

Deadline for paper submission: June 2, 2017 (new deadline: June 10, 2017)

  • Notification of acceptance: June 30, 2017
  • Camera ready submission due: July 14, 2017 (new deadline: July 21, 2017)
  • Early registration deadline: TBD
  • Workshop: September 7, 2017

Invited Speakers

Topics

  • tokenization-free models
  • character-level machine translation
  • character-ngram information retrieval
  • transfer learning for character-level models
  • models of within-token and cross-token structure
  • NL generation (of words not seen in training etc)
  • out of vocabulary words
  • morphology & segmentation
  • relationship b/w morphology & character-level models
  • stemming and lemmatization
  • inflection generation
  • orthographic productivity
  • form-meaning representations
  • true end-to-end learning
  • spelling correction
  • efficient and scalable character-level models

Anti-harassment Policy

Our workshop highly values the open exchange of ideas, the freedom of thought and expression, and respectful scientific debate. We support and uphold the ACL Anti-Harassment policy, and any workshop participant should feel free to contact any of the NAACL Board members or Priscilla Rasmussen, in case of any issues.