Syntactic Parsing and MultiWord Expressions in Italian

Welcome to the official website of the PARSEME-IT project.

The PARSEME-IT project aims at improving linguistic representativeness, precision, robustness and computational efficiency of Natural Language Processing (NLP) applications, in particular for the Italian language. The project focuses on a major bottleneck of these applications: MultiWord Expressions (MWEs), that is, groups of words that must be treated as units at some level of linguistic processing, such as hot dog, hard disk, kick the bucket, United Nations and pay attention.

The main aim of the project is to bridge the gap between linguistic precision and computational efficiency in NLP applications by investigating the syntactic and semantic representation of MWEs in language resources, the integration of MWE analysis in syntactic parsing and translation technology. Expected deliverables include mainly enhanced monolingual language resources (lexicons, grammars and annotated corpora) in Italian or multilingual linguistic resources with the Italian language. This project is a spin-off of the European IC1207 COST action, PARSEME.