Corpora

CORPORA

Rosés Labrada, J. E. (2016-present). Piaroa Documentation Corpus (Data collection ongoing). Deposited with the Endangered Languages Archive (SOAS)

  • Primary data: 47:16:49 hours of video plus 11:44:37 hours of audio (as of June 2019)

  • Secondary data:

o 81:11:06 hours of transcription and translation of 4:43:51 hours of audiovisual recordings (as of June 2019)

Elfner, E., J. E. Rosés Labrada and P. A. Shaw (2016, 2019). “Segmental Contrasts in Kwak’wala” Corpus

  • Primary data: 19:45:19 hours of audio

Hashimoto, E. & Rosés Labrada, J. E. (2018). Time-aligned Corpus of Makah Stories (based on William Jacobsen’s documentation). Deposited with the California Languages Archive (collection # 2018-32)

  • Time-aligned data: 1 hour of audio + 94 pages of handwritten transcription/translation

Rosés Labrada, J. E., Thiago Chacon & Francia Medina (2017). Arutani and Shirián Documentation Corpus. Deposited with Archive of the Indigenous Languages of Latin America (University of Texas, Austin)

  • Primary data: 13:25:24 hours of audio and 2:37:41 hours of video

Rosés Labrada, J. E. & Francia Medina (2017). Sapé Documentation Corpus. Deposited with the Endangered Language Fund (Yale University) and Archive of the Indigenous Languages of Latin America (University of Texas, Austin)

  • Primary data: 6:14:48 hours of video

  • Other material: 00:51:52 hours of Shirián (ISO: shb) discourse

Rosés Labrada, J. E. (2012-2015). Mako Documentation Corpus. (Partly) Deposited with the Endangered Languages Archive (SOAS)

  • Primary data: 54:40:46 hours of audio and 23:05:07 hours of video

  • Secondary data:

o 178:42:14 hours of transcription and translation of 10:55:37 hours of audio recordings

o 1,202 pages (23.5 cm x 18.4 cm) of transcription and translation notes

Tennant, J., D. Heap, J.E. Rosés Labrada, A. Hernández et al. (Ongoing) Cuban Spanish Corpus

  • Primary data: 10:65:33 hours of recorded interviews with speakers of Holguín Spanish

  • Secondary data: 7:14:8 hours coded in Praat

CURATION OF LEGACY COLLECTIONS

Hortensia Estrada Ramírez’s Collection of Colombian Piaroa lexical data, gathered in 2008. (now deposited at ELAR)

o Primary data: 10:10:10 hours of audio

Jon Landaburu’s Collection of Sáliba materials, gathered 1968. (now digitized)

      • Primary data: 718 items (pages + slips) and 11:29:24 of audio

Lajos Bóglar & Istvan Halmos’s Collection of Piaroa materials, gathered 1967-1968 and housed at the Institute for Musicology, Hungarian Academy of Sciences. (now segmented in SayMore)

      • Primary data: 96:56:38 hours of audio

Larry Krute’s Collection of Piaroa materials, gathered 1963. (now archived at ELAR)

      • Primary data: 03:09:46 hours of audio

Daisy Barreto’s Collection of Mako photos and negatives, gathered 1973-1974. (now digitized and awaiting archiving at AILLA)

      • Primary data: 261 items