Skip to content

CLDF dataset derived from Starostin's "Annotated Swadesh Wordlists for the Karen Group" from 2017

License

Notifications You must be signed in to change notification settings

lexibank/starostinkaren

Repository files navigation

CLDF dataset derived from Starostin's "Annotated Swadesh Wordlists for the Karen Group" from 2017

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Starostin, George S. (2017): Annotated Swadesh Wordlists for the Karen Group. Moscow: The Global Lexicostatistical Database.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at https://starling.rinet.ru/cgi-bin/response.cgi?root=new100&morpho=0&basename=new100\stb\krn&limit=-1

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 10 (linked to 10 different Glottocodes)
  • Concepts: 110 (linked to 110 different Concepticon concept sets)
  • Lexemes: 1,055
  • Sources: 1
  • Synonymy: 1.04
  • Cognacy: 1,055 cognates in 232 cognate sets (72 singletons)
  • Cognate Diversity: 0.13
  • Invalid lexemes: 0
  • Tokens: 3,952
  • Segments: 90 (0 BIPA errors, 0 CLTS sound class errors, 90 CLTS modified)
  • Inventory size (avg): 41.50

Contributors

Name GitHub user Description Role
Robert Forkel @xrotwang initial code Other
Christoph Rzymski @chrzyki patron Editor
Johann-Mattis List @LinguList code, profile, maintainer Editor
George S. Starostin data collection Author, DataCurator

CLDF Datasets

The following CLDF datasets are available in cldf: