We are working on speech- and text-based technologies that aim to assist the stabilization, revitalization and reclamation of Indigenous languages by supporting Indigenous language educators and students, promoting the accessibility of audio recordings, and supporting Indigenous language translators, transcribers and other language professionals.
- Language-independent technology (such as software) will be released to communities as open-source software.
- We will be working under the direction and advice of an Advisory committee, and in close collaboration and partnership with Indigenous community organizations and Indigenous communities across Canada.
- All research done within this project will be compliant with the Tri-Council Research Ethics Policy.
- Budget 2017 invested $89.9M over three years to support Indigenous languages and cultures. We were granted $6M of this funding.
- This project is managed by the NRC's Digital Technologies Research Centre.
- Project to create Inuktut language software and perform new text alignment of the Nunavut Legislative Assembly proceedings
- Project to create online Indigenous language courses
- Project to develop Mohawk verb conjugator and related technologies for Indigenous languages
- Project to segment and index audio recordings of Indigenous languages
- Project to update the Algonquian dictionaries, linguistic atlas, and other learning tools for Indigenous languages
- Project to upgrade the FirstVoices Language Tutor software and create predictive text software for FirstVoices Keyboards
- There are thousands of hours of recordings of Indigenous languages from across the country.
- The recordings can be difficult for Indigenous communities to access and make use of because they are not always fully transcribed, and sometimes are missing metadata (information about what languages are being spoken, who is speaking, etc.).
- To create software that will automatically segment and label audio files while they're being recorded (or shortly afterwards).
- To build and test audio-indexation software that makes it possible to search through existing recordings, including recordings made decades ago, to find key words or phrases.
- The complexity of words in Indigenous languages – in which single, long words made up of many small pieces know as morphemes, can often express what other languages express with entire clauses – poses difficulties for software applications (including both educational and professional software) that lack language-specific word-handling capabilities.
- Teaching how to form words is a central concern in Indigenous language education.
- Word complexity, and, in some languages, the complexity of the writing systems, mean that writing in accordance with official community standards is difficult for many learners.
- To design, in collaboration with instructors, educational tools that support exploratory learning of word formation.
- To develop tools for spell-checking and grammar-checking, for integration with desktop and mobile applications, to help language users at all levels to follow their community's writing standards.
We are taking a "first deep, then broad" approach. Each software tool we build will initially be specialized to one or two Indigenous languages in Canada, but built in a way that allows customization for additional languages.
We are currently working with:
- Cree (several dialects)
- Michif (the Métis language)
- Mohawk (several dialects)
Through thoughtful design, and subsequent testing, we will attempt to ensure that the tools we develop in this way will be adaptable to many different languages after this initial development period.
The following is a list of selected publications by the project team and their collaborators relating to research in Indigenous languages technology.
- Anna Kazantseva, Owennatekha Brian Maracle, Ronkwe'tiyóhstha Josiah Maracle, and Aidan Pine. Kawennón:nis: theWordmaker for Kanyen'kéha. Proceedings of Workshop on Polysynthetic Languages, pages 53–64. Santa Fe, New Mexico, USA, August 20-26, 2018.
- Patrick Littell, Anna Kazantseva, Roland Kuhn, Aidan Pine, Antti Arppe, Christopher Cox, and Marie-Odile Junker. Indigenous language technologies in Canada: Assessment, challenges, and successes. Proceedings of the 27th International Conference on Computational Linguistics, pages 2620–2632. Santa Fe, New Mexico, USA, August 20-26, 2018.
Our project team
Anna Kazantseva, PhD
Computational linguistics of literature (novels and stories); modeling discourse structure of long informal documents; computational linguistics of Iroquoian languages.
Roland Kuhn, PhD (project lead)
Automatic speech recognition; machine translation.
Patrick Littell, PhD (project advisor)
Computational linguistics of low-resource languages; he has worked with several Indigenous languages, including Kwak'wala/Bak'wamk'ala, Gitksan, and Nłeʔkepmxcín (Thompson River Salish).
Development of software for supporting Indigenous languages; he has developed tools in collaboration with Gitksan & Heiltsuk communities.
Software engineering; Applied language modeling; Unicode wrangler.
Natural language processing applications developer. Leads the WeBInuk project, which allows translators to search large amounts of English-Inuktut parallel content.
We are committed to developing technology in collaboration with Indigenous stakeholders, and have implemented an Indigenous Advisory committee to advise on collaborative methodologies and evaluate project implementations.
Chair of the NRC's Indigenous Languages Technology Project Advisory Committee
Secretary-Treasurer, Prairies to Woodlands Indigenous Language Revitalization Circle
Heather is currently directing a new Master-Apprentice Program in Manitoba and is the Secretary-Treasurer of the Prairies to Woodlands Indigenous Language Revitalization Circle. She holds a Bachelor of Arts from the University of British Columbia and a Masters of Education in Indigenous Language Revitalization from the University of Victoria. Heather is reclaiming her heritage language and, in collaboration with Elders, has published educational resources for the Michif language, such as a conversational phrase book and a college level beginner's course. Heather's interests include the use of the Internet to reach language learners in the diaspora and to create technology-mediated speech communities. She is a citizen of the Métis Nation and a member of the Manitoba Métis Federation.
Youth Ambassador, Nak'azdli Whut'en First Nation
Tessa is a member of the Nak'azdli Whut'en First Nation and an eleventh grade student at DP Todd Secondary School in Prince George, BC. Tessa is also a graduate of the First Nations' Technology Council's "Bridging to Technology" program and runs a language project called Dak'elh K'una which is organizing the creation of a Dak'elh language app and immersion summer camp.
Senior Instructor, Pirurvik Centre
Amanda is a graduate of the Nunavut Teacher Education Program (NTEP) who started her career teaching in Apex, and later at Iqaluit's Joamie School. As the Executive Director for Tumikuluit Saipaaqivik, she led Iqaluit's only Inuktut immersion daycare. Amanda grew up in a rich Inuktut speaking and cultural environment in Panniqtuuq with her maternal grandparents. As well as teaching courses in the Pirurvik Centre's Inuktut Revitalization program for Inuit and Inuktut Second Langauge, Amanda assists with the design, writing and teaching of new programs and learning resources.
Director of Programs and Student Support, Mi'kmaw Kina'matnewey
Blaire is the Director of Programs and Student Support at Mi'kmaw Kina'matnewey. She comes from the Mi'kmaq district of Unama'ki and is a proud L'nu'skw and speaker. She strives to advance the educational opportunities and rights for the Mi'kmaq people. Blaire has continued to pursue new and innovating ways to infuse language and culture into the 21st century. She is part of an inspiring team of Mi'kmaq scholars and educators whose collective and individual contributions to Mi'kmaw education have created space for Mi'kmaq innovation in the education system.
Glenn Karonhiio Morrison
Senior Advisor, Natural Resources Canada
Glenn is a Senior Advisor at Natural Resources Canada (formerly Indigenous Policy Manager, Aboriginal Peoples Program, Canadian Heritage). As former Executive Director of the First Nations Confederacy of Cultural Education Centres in the 1990s, he managed the first online presence of a First Nations organization in Canada in 1992 using a bulletin board program and 3rd-party software. More recently, Glenn chaired the interdepartmental Indigenous Languages Translation/Technology Working Group involving Canadian Heritage, Library and Archives Canada, National Research Council of Canada, Parliamentary Translation Bureau and others. He has a longstanding interest in the revitalization of Indigenous languages. Currently, he is on one of NRCan's consultation teams working with BC First Nations on the Trans Mountain pipeline expansion project. He successfully completed the first level of Onkwawenna Kentyohkwa's online Kanien'kehá:ka language program and is a member of the Mohawks of Kahnawá:ke.
Oral History and Language Lab Manager, Museum of Anthropology, University of British Columbia
Gerry is a proud member of the Heiltsuk First Nation and manages the Oral History and Language Lab at the UBC Museum of Anthropology. With over 15 years in the field of Information Management and Heritage Digitization, he works to develop practical, scalable resources for Indigenous cultural heritage preservation, and to decolonize information practices. Gerry also acts as the Technology Lead for the innovative UBC Indigitization Program and sits on the Board of Directors for the First Peoples' Cultural Council.
Youth Ambassador, University of Alberta
Delaney is a fourth year undergraduate student at the University of Alberta majoring in Computer Science and Math. Since her early teens she has worked with her home community of Lac Ste. Anne documenting culture and history. She is working on an interdisciplinary project to develop a language learning system for the Y-dialect of Cree under the supervision of University of Alberta Computing Science professor Dr. Carrie Demmans Epp and University of Alberta Cree professor Dorothy Thunder.
Megan Lukaniec is Wendat from the Huron-Wendat Nation of Wendake, Québec and an Assistant Professor of Indigenous Language Revitalization in the Department of Linguistics at the University of Victoria. Since 2006, she has been working with and for her community in order to reawaken and reclaim the Wendat (Iroquoian) language, which was dormant for well over a century. Within the scope of a SSHRC CURA grant (2007-2012) awarded to the Huron-Wendat Nation and Université Laval, her role as a linguist included reconstructing the language from legacy documentation, training language teachers, teaching introductory language courses, and creating pedagogical materials. In 2017, with the collaboration of the CDFM Huron-Wendat, she created the initial designs of and reconstructed content for an online trilingual dictionary (Wendat-French-English; wendatlanguage.com). She obtained her Ph.D. in Linguistics from the University of California, Santa Barbara, and for her dissertation work, she reconstructed and described the verb morphology of Wendat.
Associate Professor, University of Victoria
tânisi kiyawaw (greetings to you all). Onowa is maskékow-ininiw (a Swampy Cree person) and Scottish-Canadian, born and raised in Treaty 6 territory. She has been a grateful visitor in SENĆOŦEN and Lekwungen speaking territories for over twenty years and is an urban nêhiyâwiwin language learner and Indigenous language warrior. Onowa is an Associate Professor of Indigenous Education at University of Victoria, where she was the former Director of Indigenous Education in the Faculty of Education. Onowa is co-lead on a Social Science and Humanities Research Council of Canada (SSHRCC) Partnership Grant entitled NEȾOLṈEW̱, which is working to build capacity among Indigenous people and maximize Indigenous language revitalization resources in Canada.
Language Team Lead, University nuhelot'įne thaiyots'į nistameyimâkanak Blue Quills (UnBQ)
Marilyn is a member of the Saddle Lake Cree Nation and has worked in adult education for twenty-seven years, four years in small business and four years in Cree Immersion Head Start programming before devoting her time to Language revitalization for both Cree and Dene at UnBQ. While at UnBQ Marilyn has spearheaded the development of a Bachelor of Arts in Cree and Dene, a Masters in Indigenous Languages, an Elders Senate as well as Language Resource Department which produces audio, video and written resources in both Cree and Dene.
Tina Jules Skayda.û
Director of the Yukon Native Language Centre
Tina is the Director of the Yukon Native Language Centre for the Council of Yukon First Nations. She is of Tlingit, Mountain Slavey and Cree ancestry and is a citizen of the Teslin Tlingit Council. Her Tlingit name is Skayda.û and she belongs to the Dakhlaweidí (Eagle) clan. Tina holds a Bachelor of Education from the University of Regina and is a proud graduate of the Yukon Native Teacher Education Program. Her Master's Degree in Education for curriculum and instruction is from Simon Fraser University. She is a passionate advocate for Indigenous language revitalization and indigenized education.
Nathan Thanyehténhas Brinklow
Lecturer, Queens University
Nathan is an educator of Kanyen'kéha (Mohawk) with years of experience teaching both at the Tsi Tyónnheht Onkwawén:na Language and Cultural Centre (TTO) and at Queens University. Nathan has a strong interest in how computational methods can be applied to language revitalization and pedagogy and has been involved in the development of Indigenous Language and Mohawk Language and Culture certificates in partnership with TTO and Queens University.
Roland Kuhn, PhD
Principal Research Officer, Project Lead