Skip to content

Latest commit

 

History

History

cldf

Wordlist CLDF dataset derived from Starostin's "Annotated Swadesh Wordlists for the Karen Group" from 2017

CLDF Metadata: cldf-metadata.json

Sources: sources.bib

property value
dc:bibliographicCitation Starostin, George S. (2017): Annotated Swadesh Wordlists for the Karen Group. Moscow: The Global Lexicostatistical Database.
dc:conformsTo CLDF Wordlist
dc:format
  1. http://concepticon.clld.org/contributions/Starostin-1991-110
dc:identifier https://starling.rinet.ru/cgi-bin/response.cgi?root=new100&morpho=0&basename=new100\stb\krn&limit=-1
dc:license https://creativecommons.org/licenses/by/4.0/
dcat:accessURL https://github.com/lexibank/starostinkaren
prov:wasDerivedFrom
  1. lexibank/starostinkaren fe231e5
  2. Glottolog v5.0
  3. Concepticon v3.2.0
  4. CLTS v2.3.0
prov:wasGeneratedBy
  1. lingpy-rcParams: lingpy-rcParams.json
  2. python: 3.12.4
  3. python-packages: requirements.txt
rdf:ID starostinkaren
rdf:type http://www.w3.org/ns/dcat#Distribution

Table forms.csv

Raw lexical data item as it can be pulled out of the original datasets.

This is the basis for creating rows in CLDF representations of the data by

  • splitting the lexical item into forms
  • cleaning the forms
  • potentially tokenizing the form
property value
dc:conformsTo CLDF FormTable
dc:extent 1055

Columns

Name/Property Datatype Description
ID string Primary key
Local_ID string
Language_ID string References languages.csv::ID
Parameter_ID string References parameters.csv::ID
Value string
Form string
Segments list of string (separated by )
Comment string
Source list of string (separated by ;) References sources.bib::BibTeX-key
Cognacy string
Loan boolean
Graphemes string
Profile string
property value
dc:conformsTo CLDF LanguageTable
dc:extent 10

Columns

Name/Property Datatype Description
ID string Primary key
Name string
Glottocode string
Glottolog_Name string
ISO639P3code string
Macroarea string
Latitude decimal
≥ -90
≤ 90
Longitude decimal
≥ -180
≤ 180
Family string
property value
dc:conformsTo CLDF ParameterTable
dc:extent 110

Columns

Name/Property Datatype Description
ID string Primary key
Name string
Concepticon_ID string
Concepticon_Gloss string
property value
dc:conformsTo CLDF CognateTable
dc:extent 1055

Columns

Name/Property Datatype Description
ID string Primary key
Form_ID string References forms.csv::ID
Form string
Cognateset_ID string
Doubt boolean
Cognate_Detection_Method string
Source list of string (separated by ;) References sources.bib::BibTeX-key
Alignment list of string (separated by )
Alignment_Method string
Alignment_Source string