Skip to Content

Corpora list

B2W-Reviews01

Linkhttps://github.com/b2wdigital/b2w-reviews01
TitleB2W-Reviews01 - A Brazilian Portuguese reviews corpus
Presented byReal, L. , Oshiro, M. Mafra, A.
LanguageBrazilian Portuguese
Language codept-BR
Categoryresource
Statusavailable
Typetext
Year2019

The CIEMPIESS Proper-Names Pronouncing Dictionary

Linkhttps://mega.nz/#!NoZ3XY4Y!kROQmlt0tlhDZUngvauC7NnWi3HyDYD87jEUkyiRStE
TitleThe CIEMPIESS Proper-Names Pronouncing Dictionary
Presented byHernández-Mena, C.
LanguageMexican Spanish
Language codees-MX
Categoryresource
Statusavailable
Typetext
Year2019

Corpus Reacción

Linkhttps://github.com/lyr-uam/CorpusReaccion
TitleCorpus Reacción: consumers engagement in Facebook posts
Presented byRosas-Quezada, E. , Ramírez-de-la-Rosa, G. Villatoro-Tello, E.
LanguageMexican Spanish
Language codees-MX
Categoryresource
Statusavailable
Typetext
Year2019

COPENOR

Linkhttps://gitlab.com/manuel.wortens/copenor
TitleConstrucción del Corpus Periodístico del Noroeste de México (COPENOR)
Presented bySánchez-Fernández, M. Medina-Urrea, A.
LanguageMexican Spanish
Language codees-MX
Categoryresource
Statusin development
Typetext
Year2019

AIRA

Linkhttps://aira.iimas.unam.mx
TitleAIRA: Acoustic Interactions for Robot Audition
Presented byRascón, C. Velez, I.
LanguageMexican Spanish
Language codees-MX
Categoryresource
Statusavailable
Typespeech
Year2019

IESC-Child

TitleIESC-Child: An Interactive Emotional Children’s Speech Corpus
Presented byPérez-Espinosa, H. , Martínez-Miranda, J. , Espinosa-Curiel, I. , Rodríguez-Jacobo, J. , Villaseñor-Pineda, L. Avila-George, H.
LanguageMexican Spanish
Language codees-MX
Categoryresource
Statusavailable
Typespeech
Year2019

Obras Brasileiras

Linkhttp://www.linguateca.pt/
TitleOBras: a fully annotated and partially human-revised corpus of Brazilian literary works in the public domain
Presented bySantos, D. , Freitas, C. Bick, E.
LanguageBrazilian Portuguese
Language codept-BR
Categoryresource
Statusavailable
Typetext
Year2018

Fake.Br

Linkhttps://sites.google.com/icmc.usp.br/opinando/
TitleThe Fake.Br corpus – a corpus of fake news for Brazilian Portuguese
Presented bySantos, R. , Monteiro, R. Pardo, T.
LanguageBrazilian Portuguese
Language codept-BR
Categoryresource
Statusavailable
Typetext
Year2018

The Bosque Corpus

Linkhttps://github.com/UniversalDependencies/UD_Portuguese-Bosque/
TitlePortuguese Universal Dependencies via Bosque
Presented bydePaiva, V. , Freitas, C. , Rademaker, A. , Real, L. Chalub, F.
LanguageBrazilian Portuguese
Language codept-BR
Categoryresource
Statusavailable
Typetext
Year2018

Corpus of Southern Qichwa

Linkhttps://siminchikkunarayku.pe/raw_audio.html
TitleOn the Building of the Large Scale Corpus of Southern Qichwa
Presented byCamacho, L. , Zevallos, R. Melgarejo., N.
LanguageSouthern Qichwa
Language codequ-PE
Categoryresource
Statusavailable
Typespeech
Year2018

Corpora of Mexican Sign Language (MSL)

Linkhttp://cienciadedatosupiita.com/content/bd_creation.sql
TitleMultimedia Corpora of Mexican Sign Language (MSL) with Syntactic Functions.
Presented byPichardo-Lagunas, O. Martinez-Seis, B.
LanguageMexican Sign Language
Language codesgn-MX
Categoryresource
Statusavailable
Typemultimedia
Year2018

CorPop

Linkhttp://www.ufrgs.br/textecc/porlexbras/corpop/index.php
TitleCorPop: a corpus of popular Brazilian Portuguese
Presented byPasqualini, B. Finatto., M.
LanguageBrazilian Portuguese
Language codept-BR
Categoryresource
Statusavailable
Typetext
Year2018

HWxPI

Linkhttps://competitions.codalab.org/competitions/18362
TitleHWxPI: A Multimodal Spanish Corpus for Personality Identification
Presented byRamírez-De-La-Rosa, G. , Villatoro-Tello, E. Jiménez-Salazar, H.
LanguageMexican Spanish
Language codees-MX
Categoryresource
Statusavailable
Typeimage
Year2018

SICK-BR

Linkhttps://github.com/livyreal/SICK-BR
TitleA brief description of SICK-BR
Presented byReal, L. , Rodrigues, A. , Silva, A. , Thalenberg, B. , Guide, B. , Silva, C. , Câmara, I. , Lima, G. , Souza, R. Paiva, V.
LanguageBrazilian Portuguese
Language codept-BR
Categoryresource
Statusavailable
Typetext
Year2018

The Wixarika-Spanish Parallel Corpus

Linkhttps://github.com/pywirrarika/wixarikacorpora
TitleThe Wixarika-Spanish Parallel Corpus
Presented byMager, J. , Carrillo, D. Meza, I.
LanguageWixarika
Language codehch-MX (iso 639-3)
Categoryresource
Statusavailable
Typetext
Year2018