Skip to content

Commit

Permalink
Merge pull request #13 from lexibank/new-raw-data
Browse files Browse the repository at this point in the history
Retrieve raw data from ZENODO
  • Loading branch information
LinguList authored Jan 30, 2019
2 parents 9984f56 + 50c53df commit 4a33bb1
Show file tree
Hide file tree
Showing 16 changed files with 5,072 additions and 4,751 deletions.
408 changes: 408 additions & 0 deletions LICENSE

Large diffs are not rendered by default.

32 changes: 15 additions & 17 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,30 +2,28 @@

Cite the source dataset as

>
> Lieberherr, Ismail and Bodt, Timotheus Adrianus (2017): Sub-grouping Kho-Bwa based on shared core vocabulary. Himalayan Linguistics 16(2). 26-63. URL: https://escholarship.org/uc/item/4t27h5fg
## Statistics
This dataset is licensed under a https://creativecommons.org/licenses/by-nc/4.0/ license

Available online at https://doi.org/10.5281/zenodo.1154518

## Statistics


[![Build Status](https://travis-ci.org/lexibank/lieberherrkhobwa.svg?branch=master)](https://travis-ci.org/lexibank/lieberherrkhobwa)
![Glottolog: 100%](https://img.shields.io/badge/Glottolog-100%25-brightgreen.svg "Glottolog: 100%")
![Concepticon: 100%](https://img.shields.io/badge/Concepticon-100%25-brightgreen.svg "Concepticon: 100%")
![Source: 0%](https://img.shields.io/badge/Source-0%25-red.svg "Source: 0%")
![BIPA: 98%](https://img.shields.io/badge/BIPA-98%25-green.svg "BIPA: 98%")
![CLTS SoundClass: 98%](https://img.shields.io/badge/CLTS%20SoundClass-98%25-green.svg "CLTS SoundClass: 98%")
![Source: 100%](https://img.shields.io/badge/Source-100%25-brightgreen.svg "Source: 100%")
![BIPA: 97%](https://img.shields.io/badge/BIPA-97%25-green.svg "BIPA: 97%")
![CLTS SoundClass: 97%](https://img.shields.io/badge/CLTS%20SoundClass-97%25-green.svg "CLTS SoundClass: 97%")

- **Varieties:** 20
- **Varieties:** 22
- **Concepts:** 100
- **Lexemes:** 1,935
- **Lexemes:** 2,130
- **Synonymy:** 1.00
- **Cognacy:** 1,874 cognates in 234 cognate sets
- **Cognacy:** 2,063 cognates in 243 cognate sets
- **Invalid lexemes:** 0
- **Tokens:** 7,112
- **Segments:** 143 (3 BIPA errors, 3 CTLS sound class errors, 140 CLTS modified)
- **Inventory size (avg):** 48.65

## Possible Improvements:



- Entries missing sources: 1935/1935 (100.00%)
- **Tokens:** 7,944
- **Segments:** 144 (4 BIPA errors, 4 CTLS sound class errors, 140 CLTS modified)
- **Inventory size (avg):** 47.32
Loading

0 comments on commit 4a33bb1

Please sign in to comment.