Collection of NLP resources related to K’iche’ Mayan Language
Published:
K’iche’ is one of the most spoken Mayan language and in Guatemala with approximately one million of speakers. Nonetheless, this language is poorly supported digitally and many native K’iche’ speakers who has not learnt spanish struggle using digital media, this is the reality for many other Mayan language speakers from the Mexico southern states to Central America.
Indeed, such languages are difficult to promote in digital space due the lack of resources and investment from official authorities. However, few efforts has already been done from academic environments and non-profit organizations. This post only collects those that are directly related to Natural Language Processing (NLP) works.
Last updated: June 8, 2026
A morphological analyser for K’iche
Richardson, I. (2021). A morphological analyser for K’iche. Proces. Del Leng. Natural. https://doi.org/10.26342/2021-66-8
A corpus of K’iche’ annotated for morphosyntactic structure
Tyers, F. M., & Henderson, R. (2021). A corpus of K’iche’ annotated for morphosyntactic structure. In M. Mager, A. Oncevay, A. Rios, I. V. M. Ruiz, A. Palmer, G. Neubig, & K. Kann (Eds.), Proceedings of the 1st Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2021 (pp. 10-20). (Proceedings of the 1st Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2021). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.americasnlp-1.2
Curated Datasets and Neural Models for Machine Translation of Informal Registers between Mayan and Spanish Vernaculars
Andrés Lou, Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, and Víctor Sánchez-Cartagena. 2024. Curated Datasets and Neural Models for Machine Translation of Informal Registers between Mayan and Spanish Vernaculars. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 2838–2850, Mexico City, Mexico. Association for Computational Linguistics.
