Resources

Through my involvement in the PARSEME shared-tasks 1.0 and 1.1. I am co-author of the multilingual annotated corpora associated to the shared task.

  • Version 1.0 can be retrieved from the CLARIN/LINDAT repository and cited as: Savary, Agata; Ramisch, Carlos; Cordeiro, Silvio Ricardo; et al., 2017, Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.0), LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University,http://hdl.handle.net/11372/LRT-2282.
  • Version 1.1 can be retrieved from the CLARIN/LINDAT repository and cited as: Ramisch, Carlos; Cordeiro, Silvio Ricardo; Savary, Agata; et al., 2018, Annotated corpora and tools of the PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions (edition 1.1), LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University, http://hdl.handle.net/11372/LRT-2842.

During my PhD I developed a domain-specific German-Spanish parallel corpus aligned at sentence level. It has been released through CLARINO under a CC-BY-NC-SA license. There are 3 versions of it, corresponding to subsequent updates throughout my PhD. If you want to use the most updated one, go for v0.3:

  1. Parra Escartín, Carla, 2012, Parallel Corpus of documents from the Technical Regulations Information System for German-Spanish (v0.3), Common Language Resources and Technology Infrastructure Norway (CLARINO) Bergen Repository, http://hdl.handle.net/11509/79.
  2. Parra Escartín, Carla, 2012, Parallel Corpus of documents from the Technical Regulations Information System for German-Spanish (v0.2), Common Language Resources and Technology Infrastructure Norway (CLARINO) Bergen Repository,http://hdl.handle.net/11509/38.
  3. Parra Escartín, Carla, 2011, Parallel Corpus of documents from the Technical Regulations Information System for German-Spanish (v0.1), Common Language Resources and Technology Infrastructure Norway (CLARINO) Bergen Repository,http://hdl.handle.net/11509/3.