Skip to content

CLDF dataset derived from Allen's "Bai Dialect Survey" from 2007

License

Notifications You must be signed in to change notification settings

lexibank/allenbai

Repository files navigation

CLDF dataset derived from Allen's "Bai Dialect Survey" from 2007

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Allen, Bryan (2007): Bai Dialect Survey. Dallas: SIL International.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at http://www.sil.org/resources/publications/entry/9121

Conceptlists in Concepticon:

Notes

This dataset comprises 9 varieties of Bai, a Sino-Tibetan language whose origin is still vividly discussed among scholars. We have slightly modified the IPA representation and added morphological segmentation markers to the data as well.

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 9 (linked to 9 different Glottocodes)
  • Concepts: 499 (linked to 499 different Concepticon concept sets)
  • Lexemes: 4,546
  • Sources: 1
  • Synonymy: 1.01
  • Invalid lexemes: 0
  • Tokens: 21,931
  • Segments: 111 (0 BIPA errors, 0 CLTS sound class errors, 110 CLTS modified)
  • Inventory size (avg): 59.56

Contributors

Name GitHub user Description Role
Johann-Mattis List @LinguList maintainer Editor
Bryan Allen data collector DataCollector, Author

CLDF Datasets

The following CLDF datasets are available in cldf: