Corpora
CORPORA
Rosés Labrada, J. E. (2016-present). Piaroa Documentation Corpus (Data collection ongoing). Deposited with the Endangered Languages Archive (SOAS)
Primary data: 47:16:49 hours of video plus 11:44:37 hours of audio (as of June 2019)
Secondary data:
o 81:11:06 hours of transcription and translation of 4:43:51 hours of audiovisual recordings (as of June 2019)
Elfner, E., J. E. Rosés Labrada and P. A. Shaw (2016, 2019). “Segmental Contrasts in Kwak’wala” Corpus
Primary data: 19:45:19 hours of audio
Hashimoto, E. & Rosés Labrada, J. E. (2018). Time-aligned Corpus of Makah Stories (based on William Jacobsen’s documentation). Deposited with the California Languages Archive (collection # 2018-32)
Time-aligned data: 1 hour of audio + 94 pages of handwritten transcription/translation
Rosés Labrada, J. E., Thiago Chacon & Francia Medina (2017). Arutani and Shirián Documentation Corpus. Deposited with Archive of the Indigenous Languages of Latin America (University of Texas, Austin)
Primary data: 13:25:24 hours of audio and 2:37:41 hours of video
Rosés Labrada, J. E. & Francia Medina (2017). Sapé Documentation Corpus. Deposited with the Endangered Language Fund (Yale University) and Archive of the Indigenous Languages of Latin America (University of Texas, Austin)
Primary data: 6:14:48 hours of video
Other material: 00:51:52 hours of Shirián (ISO: shb) discourse
Rosés Labrada, J. E. (2012-2015). Mako Documentation Corpus. (Partly) Deposited with the Endangered Languages Archive (SOAS)
Primary data: 54:40:46 hours of audio and 23:05:07 hours of video
Secondary data:
o 178:42:14 hours of transcription and translation of 10:55:37 hours of audio recordings
o 1,202 pages (23.5 cm x 18.4 cm) of transcription and translation notes
Tennant, J., D. Heap, J.E. Rosés Labrada, A. Hernández et al. (Ongoing) Cuban Spanish Corpus
Primary data: 10:65:33 hours of recorded interviews with speakers of Holguín Spanish
Secondary data: 7:14:8 hours coded in Praat
CURATION OF LEGACY COLLECTIONS
Hortensia Estrada Ramírez’s Collection of Colombian Piaroa lexical data, gathered in 2008. (now deposited at ELAR)
o Primary data: 10:10:10 hours of audio
Jon Landaburu’s Collection of Sáliba materials, gathered 1968. (now digitized)
Primary data: 718 items (pages + slips) and 11:29:24 of audio
Lajos Bóglar & Istvan Halmos’s Collection of Piaroa materials, gathered 1967-1968 and housed at the Institute for Musicology, Hungarian Academy of Sciences. (now segmented in SayMore)
Primary data: 96:56:38 hours of audio
Larry Krute’s Collection of Piaroa materials, gathered 1963. (now archived at ELAR)
Primary data: 03:09:46 hours of audio
Daisy Barreto’s Collection of Mako photos and negatives, gathered 1973-1974. (now digitized and awaiting archiving at AILLA)
Primary data: 261 items