If you use these data please cite
- the original source
Rosenberg, Carl Benjamin Hermann von. 1853. De Mentawei-Eilanden en Hunne Bewoners. Tijdschrift voor Indische Taal-, Land- en Volkenkunde 1. 403–440.
- the derived dataset using the DOI of the particular released version you were using
This dataset is licensed under a https://creativecommons.org/licenses/by-nc-sa/4.0/deed.en license
Available online at https://www.digitale-sammlungen.de/en/view/bsb10433845?page=450,451
Based on the Rights Statement (presented down below in that page), this digitised journal has a No Copyright-Non-commercial use only condition.
Before the CLDF conversion using Python, the materials in this repository (inside the data directory) were processed using R as an RStudio project (the R scripts are in the codes directory). The English gloss of the Dutch was generated via the DeepL translator using the deeplr
R package.
As a long-time R user, the motivation to produce this repository is as a practice to get started with the cldfbench
workflow in Python to implement the Cross-Linguistic Data Format (CLDF) that I would like to apply and extend to the Enggano lexical resources project I have been part of. The other motivation is to (i) document this legacy data in a computer-readable format, (ii) enrich its content following the CLDF standard, and (iii) contribute to an on-going research on the languages of the Barrier Islands, in Sumatra, Indonesia, extending the Enggano language project.
- Varieties: 1 (linked to 1 different Glottocodes)
- Concepts: 267 (linked to 255 different Concepticon concept sets)
- Lexemes: 271
- Sources: 1
- Synonymy: 1.01
- Invalid lexemes: 0
- Tokens: 1,575
- Segments: 31 (0 BIPA errors, 0 CLTS sound class errors, 31 CLTS modified)
- Inventory size (avg): 31.00
Name | GitHub user | Description | Role |
---|---|---|---|
Gede Primahadi W. Rajeg | @gederajeg | Digitisation Code CLDF conversion Concepticon mapping Orthography profiling |
Maintainer |
The following CLDF datasets are available in cldf:
- CLDF Wordlist at cldf/cldf-metadata.json