Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make database format more language-agnostic #115

Open
loafofpiecrust opened this issue May 20, 2021 · 0 comments
Open

Make database format more language-agnostic #115

loafofpiecrust opened this issue May 20, 2021 · 0 comments

Comments

@loafofpiecrust
Copy link
Collaborator

There are a few areas of the back-end that we assume only Cherokee data will be represented.
We can start re-engineering some of that code to allow for extension into other languages.

  • Layers of linguistic annotation should be an extensible enumeration (enum AnnotationLayer) rather than simply optional fields (phonemic: Option<String>). This encourages a cleaner architecture where we add new variants to AnnotationLayer instead of more fields to AnnotatedWord. It also allows the data to define a specific order of layers, though that may not be desirable anyhow because generally layers of linguistic analysis are presented in a specific static order.
  • Phonetic and morpheme layers may be returned in several orthographies, which are Cherokee specific.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants