AmericasNLP 2025 Shared Task 1: Machine Translation Systems for Indigenous Languages
What?
The AmericasNLP 2025 Shared Task on machine translation systems for Indigenous languages is a competition aimed at encouraging the development of machine translation (MT) systems for Indigenous languages of the Americas. Participants will build systems that translate between Spanish and an Indigenous language.Why?
Many of the Indigenous languages of the Americas are so-called low-resource languages: parallel data with other languages as needed to train MT systems is limited. This means that many approaches designed for translating between high-resource languages, such as English and Chinese, are not directly applicable or perform poorly. Additionally, many Indigenous languages exhibit linguistic properties uncommon among languages frequently studied in natural language processing (NLP). For instance, many are polysynthetic. This constitutes an additional difficulty. The goal of the AmericasNLP 2025 shared task on machine translation systems for Indigenous languages is to motivate researchers to take on the challenge of developing MT systems for Indigenous languages.How?
For 2025, we are including new languages, and we are also expanding the shared task to include both translation into an Indigenous language (from Spanish) as well as translation from an Indigenous language into Spanish. These different directions will be represented as two tracks within the shared task:- Track 1: Systems which translate from Spanish into an Indigenous language (Es-XX)
- Track 2: Systems which translate from an Indigeous language into Spanish (XX-Es)
Which languages?
The following language pairs are featured in the shared task:- Awajun–Spanish
- Hñähñu–Spanish
- Wixarika–Spanish
- Nahuatl–Spanish
- Guaraní–Spanish
- Bribri–Spanish
- Rarámuri–Spanish
- Quechua–Spanish
- Aymara–Spanish
- Shipibo-Konibo–Spanish
- Asháninka–Spanish
- Chatino–Spanish
- Wayuunaiki–Spanish
Important Dates
- Release of training and development sets: January 30th, 2025
- Release of baseline systems and baseline results: February 15th, 2025
- Release of test inputs: March 14th 2025
- Submission of results and system description (shared task deadline): March 21th, 2025
- Announcement of winners: March 22nd, 2025
- Notification of acceptance: March 23rd, 2025
- Camera-ready papers due: March 27th, 2025
- Workshop: May 4th, 2025
Organizers
Abteen Ebrahimi, Arturo Oncevay, Pavel Denisov, Robert Pugh, Ona de Gibert Bonet, Raúl Vázquez, Manuel Mager, Luis Chiruzzo, Rolando Coto-Solano, Katharina von der Wense, Shruti RijhwaniContact: americas.nlp.workshop@gmail.com

We thank our sponsors |
||
Platinum |
Gold |
Bronze |
![]() Amazon Web Services |
![]() Google Research |
![]() Aditu |
Design: Rebeca Guerrero and Manuel Mager