hmong-ontology

A semantic ontology resource for the Hmong language, inspired by WordNet (Fellbaum, 1998).

This resource is intended for data science and natural language processing purposes, such as computational modeling of semantics or cleaning of raw data for deep learning.

This repository currently contains the following files:

hmong_ontology.xml : The xml file containing the lexemes (nouns and verbs) and their semantic categories.
stopwords_base.txt : A text file containing Hmong stopwords, excluding classifiers and verbal bound roots/affixes.
stopwords_classifiers.txt : A text file containing Hmong classifiers.
stopwords_verbal_adjuncts.txt : A text file containing Hmong verbal adjuncts, such as bound roots and affixes.
hmongnet.py : A Python code file providing WordNet-style access to the ontology.

TODO:

Create Python code to load the Hmong ontology as a WordNet-style library. Begun as hmongnet.py.
Expand vocabulary based on additional forms attested in speaker community using an automated approach.
Label forms as White Hmong where distinct and add Green Mong forms.
Implement stopword files.
Write documentation.

To consider:

Revisit distinction between object and artifact categories: it does not seem to have significance in Hmong.
Provide an additional xml file containing roots with additional bound root possibilities.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hmong-ontology

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 147 Commits
README.md		README.md
hmong_ontology.xml		hmong_ontology.xml
hmongnet.py		hmongnet.py
stopwords_base.txt		stopwords_base.txt
stopwords_classifiers.txt		stopwords_classifiers.txt
stopwords_verbal_adjuncts.txt		stopwords_verbal_adjuncts.txt

nathanmwhite/hmong-ontology

Folders and files

Latest commit

History

Repository files navigation

hmong-ontology

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages