Submission Deadline : April 20, 2022 (Extended until June 20 - CLOSED)

About the Speaker

NGUYEN Thi Minh Huyen

VNU University of Science, Hanoi

INVITED SPEAKER

Towards Universal Syntactic-Semantic Resources for Vietnamese

Invited Lecture

Despite the fact that Vietnamese is spoken by around 100 million persons all over the world, it remains a low resource language in terms of gold datasets for natural language processing (NLP). Although many NLP applications have been developed in recent years thanks to the emergence of deep learning and embedding methods, it is important to build sustainable linguistic resources, using sophisticated linguistic annotation frameworks for the Vietnamese language, in harmony with universal frameworks. In this talk, I will present our work on the construction of Vietnamese syntactic-semantic resources, including a VerbNet-based lexicon and annotated corpora for syntactic and semantic parsing.

Bionote

Huyen Nguyen started to work on Vietnamese text processing in the 2000s. She obtained her PhD in 2006 at LORIA, working on linguistic resources and tools for French - Vietnamese text alignment. For several years, she has been a key member of the Vietnamese board for Language and Speech Processing (VLSP). In 2020, she became the first president of the Association for VLSP. She has contributed to many projects for building Vietnamese lexical, syntactic and semantic resources and tools shared with the VLSP research community. Since 2012, she has been the organizing co-chair of eight editions of the international VLSP workshop series in Vietnam. She is currently interested in building Vietnamese syntactic-semantic linguistic resources and medical text processing.

Page updated

Google Sites

Report abuse