ILLC-NLP 2024

First Workshop on NLP for Indigenous Languages of Lusophone Countries 

March 12, 2024

Thank you everyone that attended ILLC-NLP 2024, the first edition of our workshop! 

The panel discussion can be watched back here: https://drive.google.com/file/d/1dax00Ey2gIGTxPxlt3hKjYU7Nz98IAfA/view?usp=sharing

The keynote and online paper presentations can be viewed here: https://drive.google.com/file/d/1RqFCWCx4kPm2EqhqcLiVzFzK_Ixq86bW/view?usp=sharing

Welcome to ILLC-NLP 2024, the first Workshop on NLP for Indigenous Languages of Lusophone Countries, co-located with PROPOR 2024 in Santiago de Compostela, Galicia!

Get in touch at:  illc-nlp-2024@googlegroups.com

The Lusophone community includes nine Portuguese-speaking nations on four different continents; Portuguese is the sole official language in seven of them and one of the official languages for the other two. While Portuguese may be spoken widely in these countries, they also have many indigenous and minority languages spoken natively by large populations (e.g. Umbundu in Angola has ~7 million native speakers, Makhuwa in Mozambique ~9 million, and Fang in Equatorial Guinea ~1 million). There are also many languages spoken natively by smaller populations which nevertheless have an important cultural history (e.g. Brazil alone has 217 recognized indigenous languages). 

Despite their prevalence and importance, these languages are seriously under-resourced and under-researched, and many face extinction. As advances in NLP-based technologies have started to reach more spheres of society with applications in diverse domains, these include indigenous languages, with tools aiding in promoting and preserving these languages. However, the focus on indigenous languages of Lusophone countries has been limited. As such, ILLC-NLP targets an area for which research, resources, and tools are in dire need of development and promotion.

Lusophone Countries, where Portuguese is an official language (src: wikipedia.org)


Call for Papers

The aim of ILLC-NLP is to encourage the development and application of Natural Language Processing (NLP) techniques to indigenous languages of countries where Portuguese is an official language.  Such languages are under-resourced and marginalized, despite often having a large number of native speakers, and hence there is a strong need to develop techniques which: preserve these languages, and hence their indigenous cultures; improve visibility of marginalized communities; and improve communication and access to information for such communities. Addressing this need, the ILLC-NLP 2024 workshop brings together researchers and practitioners from academia and industry to share their work, including annotated datasets, methods, trained models and applications. The workshop will also provide a forum for researchers and practitioners to collaborate on new projects. The workshop will feature a combination of keynote presentations, panel discussions, paper presentations, and interactive hands-on sessions. We will encourage participants to collaborate and develop concrete projects and initiatives during the workshop.

We call for papers describing work on any topic related to computational language and speech processing of indigenous languages of Lusophone countries by researchers in industry or academia. We also welcome work on low-resource languages of Lusophone countries which don't necessarily fall under the indigenous header, including Portuguese creoles (e.g. Cape Verdean Creole, Guinea-Bissau Creole, Papiamento).  Topics of interest include, but are not limited to:

ILLC-NLP 2024 will be co-located with  PROPOR 2024, which will be held at the University of Santiago de Compostela (Santiago de Compostela – Galicia, Spain) from March 14th to March 15th. 

Submissions should describe original, unpublished work. Authors are invited to submit two kinds of papers:

Submissions should be written in English. At submission time, papers must be in PDF format only. For the final versions, authors of accepted papers will be given 1 extra content page to take the reviews into account. Authors of accepted papers will be requested to send the source files for the production of the proceedings. All submitted papers must conform to the official ACL style guidelines (Latex or Word)

Both long and short papers will be published in the ACL Anthology. 

Submission site: Papers should be submitted via Easy Chair by either selecting the track ILLC-NLP2024 Long Paper or ILLC-NLP2024  Short Paper.

Reviewing format: At least two reviewers will evaluate each submission. The reviewing format will be single-blind. 


Dates

All deadlines are 23:59 A.o.E

Organisers

Program Committee

Program

Fontán Building (room 7)