Skip to content

CLDF dataset derived from the Johansson et al.'s "The typology of sound symbolism" from 2020

License

Notifications You must be signed in to change notification settings

lexibank/johanssonsoundsymbolic

Repository files navigation

CLDF dataset derived from the Johansson et al.'s "The typology of sound symbolism" from 2020

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Erben Johansson, N., Anikin, A., Carling, G., & Holmer, A. (2020). The typology of sound symbolism: Defining macro-concepts via their semantic and phonetic features, Linguistic Typology , 24(2), 253-310. doi: https://doi.org/10.1515/lingty-2020-2034

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Available online at https://osf.io/3dsn6/

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 100% Concepticon: 83% Source: 99% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 245 (linked to 245 different Glottocodes)
  • Concepts: 344 (linked to 284 different Concepticon concept sets)
  • Lexemes: 69,963
  • Sources: 240
  • Synonymy: 1.00
  • Invalid lexemes: 0
  • Tokens: 378,472
  • Segments: 446 (0 BIPA errors, 0 CLTS sound class errors, 446 CLTS modified)
  • Inventory size (avg): 34.97

Possible Improvements:

  • Entries missing sources: 914/69963 (1.31%)

Contributors

Name GitHub user Description Role
Sacha Beniamine @XachaB Other
Niklas Erben Johansson publication author Author
Kristina Pianykh @Kristina-Pianykh concept mapping Other
Johann-Mattis List @lingulist maintainer, cldf conversion, profile Editor

CLDF Datasets

The following CLDF datasets are available in cldf: