Skip to content

Latest commit

 

History

History
63 lines (39 loc) · 2.54 KB

README.md

File metadata and controls

63 lines (39 loc) · 2.54 KB

CLDF dataset derived from Allen's "Bai Dialect Survey" from 2007

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Allen, Bryan (2007): Bai Dialect Survey. Dallas: SIL International.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at http://www.sil.org/resources/publications/entry/9121

Conceptlists in Concepticon:

Notes

This dataset comprises 9 varieties of Bai, a Sino-Tibetan language whose origin is still vividly discussed among scholars. We have slightly modified the IPA representation and added morphological segmentation markers to the data as well.

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 9 (linked to 9 different Glottocodes)
  • Concepts: 499 (linked to 499 different Concepticon concept sets)
  • Lexemes: 4,546
  • Sources: 1
  • Synonymy: 1.01
  • Invalid lexemes: 0
  • Tokens: 21,931
  • Segments: 111 (0 BIPA errors, 0 CLTS sound class errors, 110 CLTS modified)
  • Inventory size (avg): 59.56

Contributors

Name GitHub user Description Role
Johann-Mattis List @LinguList maintainer Editor
Bryan Allen data collector DataCollector, Author

CLDF Datasets

The following CLDF datasets are available in cldf: