This repository contains resources to support testing and development efforts around language support for BC Indigenous writing systems. Some examples of where these resources may be of use:
- Testing fonts – Is your font equipped to handle Indigenous languages in the correct way?
- Building and testing keyboards – Does your keyboard employ the Unicode characters that are correct for the language you're working with?
- Checking for Unicode discrepancies within Indigenous-language data sets – Are there 'confusables' within a document or data set that may cause issues later on?
- Improving search functionality within Indigenous-language software – Are there instances of 'confusables' in your language data that may be reducing functionality?
- Other – If you have an application for this work that we haven't considered, please let us know!
Read more about these resources and their context on our Knowledge Base.
This directory contains the FirstVoices Confusables and Adjacency Pair Databases, with frequency analysis numbers drawn from FirstVoices language data.
This directory contains information about alphabets for each variety of a BC Indigenous language that has been published on the FirstVoices platform.
These databases are published under the APACHE LICENSE, VERSION 2.0
Our team would like to track the scope of this work, and understand its broader applications as well as potentially share your work for inspiration on how to improve Indigenous language support in digital spaces. Please email hello@firstvoices.com to let us know about where and how this database is being used for your project.