Corpus PI-YALLI de documents nahuatl

The PI-YALLI corpus of Nahutal documents | El corpus PI-YALLI de documentos nahuatl


"NAHUATL"

The Nahuatl corpus PI-YALLI contains a nahuatl documents (mix of several Nahuatl docuuments), and a set of static embeddings. The source document have several sentences that belong to several topics.

The corpus was manually generated by a lot of persons of Université d'Avignon, Universidad Veracruzana and independent researchers).

This corpus is well suitable for test and learning systems working with the Nahuatl language New versions, with more human texts, will be aggregated periodically.

The PUCES corpus (in utf8 format) is distributed under LGPL license.