SIGDAT is the Association for Computational Linguistics special interest group for linguistic data and corpus-based approaches to natural language processing.  SIGDAT organizes the EMNLP Conference.


  • September 17-21, 2015 (workshops September 17-18, main conference September 19-21)
  • Culturgest, Lisbon, Portugal
  • General chair:  Lluis Marquez
  • Program co-chairs:  Chris Callison-Burch and Jian Su
  • Deadline:  May 31, 2015 (long papers), June 15, 2015 (short papers)


SIGDAT was founded in 1993 and is one of ACL's oldest SIGs.

Since its inception, SIGDAT's primary mission has been to organize a series of conferences and workshops, including EMNLP (Conference on Empirical Methods in Natural Language Processing) and WVLC (Workshop on Very Large Corpora). These meetings have become quite popular, and and EMNLP is now typically a 3-day conference with 250-500 attendees and 800-1200 page proceedings.

SIGDAT is generally focused on corpus-based and statistical methods in Natural Language Processing, and encourages initiatives in support of this broader mission from its members.