diff --git a/CHANGES.md b/CHANGES.md index 0ab1d11..a829694 100644 --- a/CHANGES.md +++ b/CHANGES.md @@ -3,6 +3,13 @@ Changes between releases of the WALS CLDF dataset. +## [v2020.4] - 2024-10-18 + +- Fixed errata in language metadata. +- Updated Glottocodes to match Glottolog 5.0. +- For a full list of changes run `git diff v2020.3 v2020.4 cldf` on the repository. + + ## [v2020.3] - 2022-12-01 - Changes to language metadata as specified in `raw/languagesMSD_22-09.csv`. diff --git a/RELEASING.md b/RELEASING.md index e805dc8..c1eed03 100644 --- a/RELEASING.md +++ b/RELEASING.md @@ -2,11 +2,11 @@ - Run ```shell - cldfbench makecldf cldfbench_wals.py --glottolog-version v4.6 + cldfbench makecldf cldfbench_wals.py --glottolog-version v5.0 ``` - Run ```shell - cldfbench cldfreadme cldfbench_wals.py` + cldfbench cldfreadme cldfbench_wals.py ``` - Run ```shell diff --git a/cldf/README.md b/cldf/README.md index f7e6810..8eddfba 100644 --- a/cldf/README.md +++ b/cldf/README.md @@ -13,8 +13,8 @@ property | value [dc:identifier](http://purl.org/dc/terms/identifier) | https://wals.info [dc:license](http://purl.org/dc/terms/license) | https://creativecommons.org/licenses/by/4.0/ [dcat:accessURL](http://www.w3.org/ns/dcat#accessURL) | https://github.com/cldf-datasets/wals -[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) |
  1. cldf-datasets/wals v2020.2-6-g42c0da7
  2. Glottolog v4.6
-[prov:wasGeneratedBy](http://www.w3.org/ns/prov#wasGeneratedBy) |
  1. python: 3.8.10
  2. python-packages: requirements.txt
+[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) |
  1. cldf-datasets/wals v2020.2-11-g2955a01
  2. Glottolog v5.0
+[prov:wasGeneratedBy](http://www.w3.org/ns/prov#wasGeneratedBy) |
  1. python: 3.10.12
  2. python-packages: requirements.txt
[rdf:ID](http://www.w3.org/1999/02/22-rdf-syntax-ns#ID) | wals [rdf:type](http://www.w3.org/1999/02/22-rdf-syntax-ns#type) | http://www.w3.org/ns/dcat#Distribution diff --git a/cldf/StructureDataset-metadata.json b/cldf/StructureDataset-metadata.json index 1817fb5..79fd485 100644 --- a/cldf/StructureDataset-metadata.json +++ b/cldf/StructureDataset-metadata.json @@ -16,7 +16,7 @@ { "rdf:about": "https://github.com/cldf-datasets/wals", "rdf:type": "prov:Entity", - "dc:created": "v2020.2-3-ge16d1f6", + "dc:created": "v2020.2-11-g2955a01", "dc:title": "Repository" }, { diff --git a/cldf/docs/chapter_45.html b/cldf/docs/chapter_45.html index 7786f6d..27e517d 100644 --- a/cldf/docs/chapter_45.html +++ b/cldf/docs/chapter_45.html @@ -39,5 +39,5 @@

The avoidance of direct (linguistic) reference to the addressee in the context of face-threatening utterances is the main functional motivation for developing polite referential expressions such as vous  in French, and Sie  in German. The 2pl pronoun vous  in French presumably came into use historically as a polite form of singular address because it renders the reference less direct and less specific (cf. Malsch 1987, Helmbrecht 2002, 2003). Other possible diachronic sources for second-person polite pronouns are first-person plural pronouns (for example in Ainu (Japan)), demonstrative pronouns (for example in Sinhala (Indo-Aryan; Sri Lanka)), reflexive pronouns (for example in Hungarian), and nouns and nominal expressions designating social status (as in Spanish). All these sources of polite second-person pronouns avoid a direct second-person reference in the sense that they initially required some pragmatic inferencing before they were conventionalized as polite means for pronominal reference. For a more detailed treatment of the functional aspects of the grammaticalization of second-person polite pronouns, see Helmbrecht (2002: ch. 9).

5. Conclusions

-

The uneven distribution of politeness distinctions in pronouns a cross the languages of the world suggests that there are other conditioning factors that have to be taken into account. Language contact and the social and cultural disposition to adopt linguistic means which are used to express politeness in neighboring languages that have a high prestige seem to be more important as a determining factor than the general functional background of polite language use. It is this social and cultural disposition of the adopting society which is responsible for the selection of certain forms as politeness forms.

- \ No newline at end of file +

The uneven distribution of politeness distinctions in pronouns across the languages of the world suggests that there are other conditioning factors that have to be taken into account. Language contact and the social and cultural disposition to adopt linguistic means which are used to express politeness in neighboring languages that have a high prestige seem to be more important as a determining factor than the general functional background of polite language use. It is this social and cultural disposition of the adopting society which is responsible for the selection of certain forms as politeness forms.

+ diff --git a/cldf/languages.csv b/cldf/languages.csv index 2d41929..763b003 100644 --- a/cldf/languages.csv +++ b/cldf/languages.csv @@ -395,7 +395,7 @@ but,Buriat,Eurasia,52,108,mong1330,bxm,Altaic,,Mongolic,,bxm,false,false,CN RU M buu,Buru,Papunesia,-3.5,126.5,buru1303,mhs,Austronesian,,Central Malayo-Polynesian,,mhs,false,false,ID,Grimes-1991 Grimes-1995 Hendriks-1897,genus-centralmalayopolynesian buw,Bulu,Africa,3,11,bulu1251,bum,Niger-Congo,Benue-Congo,Bantu,,bum,false,false,CM,Alexandre-1966 Bates-1904,genus-bantu buy,Buli (in Ghana),Africa,10.5,-1.25,buli1254,bwu,Niger-Congo,Gur,Oti-Volta,,bwu,false,false,GH,Kroger-1992,genus-otivolta -bvi,Bali-Vitu,Papunesia,-4.9,149.116666667,bali1280,,Austronesian,Eastern Malayo-Polynesian,Oceanic,,bbn wiv,false,false,PG,Ross-2002c,genus-oceanic +bvi,Bali-Vitu,Papunesia,-4.9,149.116666667,unea1237,,Austronesian,Eastern Malayo-Polynesian,Oceanic,,bbn wiv,false,false,PG,Ross-2002c,genus-oceanic bwa,Bandjalang (Waalubal),Australia,-29.0833333333,152.583333333,band1339,bdy,Pama-Nyungan,,Southeastern Pama-Nyungan,,bdy,false,false,AU,Crowley-1978,genus-southeasternpamanyungan bwc,Bajau (West Coast),Papunesia,6.33333333333,116.333333333,west2560,bdr,Austronesian,,Sama-Bajaw,,bdr,false,false,MY,Miller-2007,genus-samabajaw bxj,Bayungu,Australia,-23,114,bayu1240,bxj,Pama-Nyungan,,Western Pama-Nyungan,,bxj,false,false,AU,,genus-westernpamanyungan @@ -493,7 +493,7 @@ com,Comorian,Africa,-12,44,maor1244,swb,Niger-Congo,Benue-Congo,Bantu,,swb,false coo,Coos (Hanis),North America,43.5,-124.166666667,coos1249,csz,Oregon Coast,,Coosan,,csz,false,true,US,Frachtenberg-1910 Frachtenberg-1913 Frachtenberg-1922a Nichols-1992 Pierce-1971 Stolz-1996 Zenk-1990,genus-coosan cop,Coptic,Africa,26,32,copt1239,cop,Afro-Asiatic,,Egyptian-Coptic,,cop,false,false,EG,Lambdin-1983 Layton-2000 Mallon-1956 Plisch-1999 Plumley-1948 Shisha-Halevy-1988,genus-egyptiancoptic cor,Cora,North America,22.1666666667,-104.833333333,elna1235,crn,Uto-Aztecan,,Corachol,,crn,false,false,MX,Casad-1984 Casad-1985 Langacker-1976 McMahon-1959 McMahon-1967,genus-corachol -cos,Rumsien,North America,36.8333333333333,-121.75,,,Penutian,Utian,Costanoan,,,false,false,US,Kroeber-1904,genus-costanoan +cos,Rumsien,North America,36.8333333333333,-121.75,rums1243,,Penutian,Utian,Costanoan,,,false,false,US,Kroeber-1904,genus-costanoan cpa,Campa Pajonal Asheninca,South America,-10.6666666667,-74.25,ashe1273,cjo,Arawakan,,Pre-Andine Arawakan,,cjo,false,false,PE,Pike-and-Kindberg-1956 Wise-1978,genus-preandinearawakan cpl,Chinantec (Palantla),North America,18.8333333333,-96.75,pala1351,cpa,Oto-Manguean,,Chinantecan,,cpa,false,false,MX,Bybee-et-al-1994 Merrifield-1968,genus-chinantecan cpn,Chepang,Eurasia,27.6666666667,84.75,chep1245,cdm,Sino-Tibetan,Tibeto-Burman,Himalayish,,cdm,false,false,NP,Bybee-et-al-1994 Caughley-1982 Noonan-2003c,genus-himalayish @@ -720,7 +720,7 @@ gbe,German (Bern),Eurasia,47,7.41666666667,swis1247,gsw,Indo-European,,Germanic, gbk,Gbaya (Northwest),Africa,6,15,nort2775,gya,Niger-Congo,,Gbaya-Manza-Ngbaka,,gya,false,false,CF CM,Monino-and-Roulon-1972 Roulon-1975 Roulon-Doko-1995 Tucker-and-Bryan-1966,genus-gbayamanzangbaka gbl,German (Berlin),Eurasia,52.5,13.3333333333,stan1295,deu,Indo-European,,Germanic,,deu,false,false,DE,,genus-germanic gbs,Gbaya (Southwest),Africa,4.7,14.96,sout2785,gso,Niger-Congo,,Gbaya-Manza-Ngbaka,,gso,false,false,,Tucker-and-Bryan-1966,genus-gbayamanzangbaka -gcy,Greek (Cypriot),Eurasia,34.75,33,cypr1245,ell,Indo-European,,Greek,,ell,false,false,CY,,genus-greek +gcy,Greek (Cypriot),Eurasia,34.75,33,cypr1249,ell,Indo-European,,Greek,,ell,false,false,CY,,genus-greek gdb,Gutob,Eurasia,19,83.6666666667,bodo1267,gbj,Austro-Asiatic,,Munda,,gbj,false,false,IN,Subba-Rao-and-Patnaik-1992,genus-munda gdf,Guduf,Africa,11.2333333333,13.8,gudu1252,gdf,Afro-Asiatic,Chadic,Biu-Mandara,,gdf,false,false,NG CM,,genus-biumandara gdi,Godié,Africa,5.41666666667,-5.83333333333,godi1239,god,Niger-Congo,,Kru,,god,false,false,CI,Marchese-1986a Marchese-1988,genus-kru @@ -970,7 +970,7 @@ jeb,Jebero,South America,-5.41666666667,-76.5,jebe1250,jeb,Cahuapanan,,Cahuapana jeh,Jeh,Eurasia,15.1666666667,107.833333333,jehh1245,jeh,Austro-Asiatic,Mon-Khmer,Bahnaric,,jeh,false,false,VN LA,Cohen-1966 Gradin-1966 Gradin-and-Gradin-1979,genus-bahnaric jel,Jeli,Africa,9.5,-5.66666666667,jeri1242,jek,Mande,,Western Mande,,jek,false,false,CI,Trobs-1998,genus-westernmande jem,Jemez,North America,35.8333333333,-107,jeme1245,tow,Kiowa-Tanoan,,Kiowa-Tanoan,,tow,false,false,US,Yumitani-1998,genus-kiowatanoan -jia,Jiarong,Eurasia,31.5,102,jiar1239,jya,Sino-Tibetan,Tibeto-Burman,Na-Qiangic,,jya,false,false,CN,,genus-naqiangic +jia,Jiarong,Eurasia,31.5,102,jiar1240,jya,Sino-Tibetan,Tibeto-Burman,Na-Qiangic,,jya,false,false,CN,,genus-naqiangic jib,Jibbali,Eurasia,17.5,55,sheh1240,shv,Afro-Asiatic,,Semitic,,shv,false,false,OM,Lonnet-and-Simeone-Senelle-1997,genus-semitic jin,Jino,Eurasia,22,101,jino1236,jiu,Sino-Tibetan,Tibeto-Burman,Burmese-Lolo,,jiu,false,false,CN,Gai-1986,genus-burmeselolo jiv,Jivaro,South America,-2.5,-78,shua1257,jiv,Jivaroan,,Jivaroan,,jiv,false,false,EC,Beasley-and-Pike-1957 Beuchat-and-Rivet-1909 Bybee-et-al-1994 Ghinassi-1938 Nichols-1992 Pellizzaro-1969 Turner-1958,genus-jivaroan @@ -985,7 +985,7 @@ jpn,Japanese,Eurasia,37,140,nucl1643,jpn,Japanese,,Japanese,,jpn,true,true,JP,Al jpr,Japreria,South America,10.5,-73,japr1238,jru,Cariban,,Cariban,,jru,false,false,VR,Durbin-and-Seijas-1972,genus-cariban jrn,Juruna,South America,-5,-54.5,juru1256,jur,Tupian,,Yuruna,,jur,false,false,BR,Rodrigues-1999a,genus-yuruna jrw,Jarawa (in Andamans),Eurasia,12,92.5833333333,jara1245,anq,South Andamanese,,South Andamanese,,anq,false,false,IN,Kumar-2003,genus-southandamanese -jug,Jugli,Eurasia,27.5,96.3333333333,jogl1236,nst,Sino-Tibetan,Tibeto-Burman,Brahmaputran,,nst,false,false,IN,Rekhung-1988a,genus-brahmaputran +jug,Jugli,Eurasia,27.5,96.3333333333,tase1235,nst,Sino-Tibetan,Tibeto-Burman,Brahmaputran,,nst,false,false,IN,Rekhung-1988a,genus-brahmaputran juh,Ju|'hoan,Africa,-19,21,juho1239,ktz,Kxa,,Ju-Kung,,ktz,false,true,AO NA BW,Bybee-et-al-1994 Dickens-1992 Dickens-1994 Dickens-nd Guldemann-2000 Nichols-1992 Snyman-1970 Snyman-1975 Stolz-1996,genus-jukung juk,Jukun,Africa,6.91666666667,10.4166666667,juku1254,jbu,Niger-Congo,Benue-Congo,Jukunoid,,jbu,false,false,NG,Shimuzu-1980 Welmers-1968b,genus-jukunoid jum,Júma,South America,-7.5,-64,juma1249,jua,Tupian,,Maweti-Guarani,,jua,false,false,BR,Abramson-1968,genus-mawetiguarani @@ -1626,7 +1626,7 @@ mxe,Ifira-Mele,Papunesia,-17.75,168.25,mele1250,mxe,Austronesian,Eastern Malayo- mxg,Mixtec (San Miguel el Grande),North America,17.05,-97.5666666667,sanm1295,mig,Oto-Manguean,Mixtecan,Mixtec,,mig,false,false,MX,Dyk-and-Stoudt-1973,genus-mixtec mxj,Mixtec (Jicaltepec),North America,16.3333333333,-98,pino1237,mio,Oto-Manguean,Mixtecan,Mixtec,,mio,false,false,MX,Bradley-1970,genus-mixtec mxl,Mixtec (Alacatlatzala),North America,17.25,-98.5833333333,alac1244,mim,Oto-Manguean,Mixtecan,Mixtec,,mim,false,false,MX,,genus-mixtec -mxm,Mixtec (Molinos),North America,17,-97.5833333333,sanm1259,mig,Oto-Manguean,Mixtecan,Mixtec,,mig,false,false,MX,Hunter-and-Pike-1969,genus-mixtec +mxm,Mixtec (Molinos),North America,17,-97.5833333333,sanp1259,mig,Oto-Manguean,Mixtecan,Mixtec,,mig,false,false,MX,Hunter-and-Pike-1969,genus-mixtec mxo,Mixtec (Ocotepec),North America,17.1666666667,-97.75,ocot1243,mie,Oto-Manguean,Mixtecan,Mixtec,,mie,false,false,MX,Alexander-1988,genus-mixtec mxp,Mixtec (Peñoles),North America,17.0833333333,-96.9166666667,peno1244,mil,Oto-Manguean,Mixtecan,Mixtec,,mil,false,false,MX,Daly-1973,genus-mixtec mxs,Mixtec (Silacayoapan),North America,17.5,-98.1666666667,sila1250,mks,Oto-Manguean,Mixtecan,Mixtec,,mks,false,false,MX,Shields-1988,genus-mixtec @@ -2295,7 +2295,7 @@ tli,Tlingit,North America,59,-135,tlin1245,tli,Na-Dene,,Tlingit,,tli,false,true, tll,Taulil,Papunesia,-4.41666666667,152.083333333,taul1251,tuh,Taulil,,Taulil,,tuh,false,false,PG,Stebbins-2002,genus-taulil tlo,Tobelo,Papunesia,1.5,128.5,tobe1252,tlb,North Halmaheran,,North Halmaheran,,tlb,false,false,ID,Holton-2003 Hueting-1908 Hueting-1936 den-Besten-2001,genus-northhalmaheran tlp,Tlapanec,North America,17.0833333333,-99,acat1239,tcf,Oto-Manguean,,Subtiaba-Tlapanec,,tcf,false,false,MX,Radin-1935 Suarez-1983a Suarez-1988,genus-subtiabatlapanec -tls,Talysh (Southern),Eurasia,37.5,49,tari1263,shm,Indo-European,,Iranian,,shm,false,false,IR,Stilo-2005,genus-iranian +tls,Talysh (Southern),Eurasia,37.5,49,shah1254,shm,Indo-European,,Iranian,,shm,false,false,IR,Stilo-2005,genus-iranian tma,Tama,Africa,14.5,22,tama1331,tma,Eastern Sudanic,,Taman,,tma,false,false,TD SD,Tucker-and-Bryan-1966,genus-taman tmc,Timucua,North America,30.25,-82.5,timu1245,tjm,Timucua,,Timucua,,tjm,false,false,US,Gatschet-1877 Granberry-1956 Granberry-1990 Granberry-1993,genus-timucua tmg,Tamagario,Papunesia,-6.41666666667,139.25,tama1336,tcg,Kayagar,,Kayagar,,tcg,false,false,ID,Voorhoeve-1975,genus-kayagar @@ -2387,7 +2387,7 @@ tub,Tubar,North America,27,-108,tuba1279,tbu,Uto-Aztecan,,Tubar,,tbu,false,false tuc,Tucano,South America,0.5,-69.1666666667,tuca1252,tuo,Tucanoan,,Tucanoan,,tuo,false,false,CO BR,Aikhenvald-2007a Bybee-et-al-1994 Derbyshire-and-Payne-1990 Giacone-nd Huber-and-Reed-1992 Sorensen-1969 West-1980,genus-tucanoan tug,Tuareg (Ahaggar),Africa,23,6,taha1241,thv,Afro-Asiatic,,Berber,,thv,false,false,LY NE DZ,Bybee-et-al-1994 Hanoteau-1896 Louali-Reynal-et-al-1997 Penchoen-1973b Prasse-1972 Quitout-1997,genus-berber tui,Türk Isaret Dili,Eurasia,39,34,turk1288,tsm,other,,Sign Languages,,tsm,false,false,TR,,genus-signlanguages -tuk,Tukang Besi,Papunesia,-5.5,123.5,tuka1247,,Austronesian,,Celebic,,bhq khc,true,true,ID,Donohue-1999a Donohue-1999c,genus-celebic +tuk,Tukang Besi,Papunesia,-5.5,123.5,tuka1248,,Austronesian,,Celebic,,bhq khc,true,true,ID,Donohue-1999a Donohue-1999c,genus-celebic tul,Tulu,Eurasia,12.75,75.3333333333,tulu1258,tcy,Dravidian,,Dravidian,,tcy,false,false,IN,Bhat-1967 Brigel-1982,genus-dravidian tum,Tumleo,Papunesia,-3.08333333333,142.416666667,tuml1238,tmq,Austronesian,Eastern Malayo-Polynesian,Oceanic,,tmq,false,false,PG,Schultze-1911,genus-oceanic tun,Tunica,North America,32.6666666667,-91,tuni1252,tun,Tunica,,Tunica,,tun,false,true,US,Haas-1940 Haas-1953 Nichols-1992 Swanton-1919 Swanton-1921,genus-tunica @@ -2514,7 +2514,7 @@ wog,Wogamusin,Papunesia,-4.25,142.333333333,woga1249,wog,Sepik,,Wogamusin-Chenap woi,Woisika,Papunesia,-8.25,124.833333333,kama1365,woi,Greater West Bomberai,Timor-Alor-Pantar,Alor-Pantar,,woi,false,false,ID,Stokhof-1979 Stokhof-1982,genus-alorpantar wol,Woleaian,Papunesia,7.33333333333,143.833333333,wole1240,woe,Austronesian,Eastern Malayo-Polynesian,Oceanic,,woe,false,false,FM,Sohn-1975 Sohn-and-Tawerilmang-1976,genus-oceanic wom,Womo,Papunesia,-2.91666666667,141.833333333,womo1238,wmx,Skou,,Serra Hills,,wmx,false,false,PG,,genus-serrahills -wor,Worora,Australia,-15.6666666667,124.666666667,woro1255,wro,Worrorran,,Worrorran,,wro,false,false,AU,Bybee-et-al-1994 Love-2000,genus-worrorran +wor,Worora,Australia,-15.6666666667,124.666666667,worr1237,wro,Worrorran,,Worrorran,,wro,false,false,AU,Bybee-et-al-1994 Love-2000,genus-worrorran wps,Wapishana,South America,2.66666666667,-60,wapi1253,wap,Arawakan,,Negro-Roraima,,wap,false,false,GY BR,Tracy-1972,genus-negrororaima wra,Warao,South America,9.33333333333,-61.6666666667,wara1303,wba,Warao,,Warao,,wba,true,true,VR,Osborn-1966 Osborn-1967 Romero-Figueroa-1985 Romero-Figueroa-1986 Romero-Figueroa-1997 Vaquero-1965 de-Barral-1979 de-Goeje-1930,genus-warao wrb,Warrnambool,Australia,-38.25,142.5,warr1257,gjm,Pama-Nyungan,,Southeastern Pama-Nyungan,,gjm,false,false,AU,Blake-2003,genus-southeasternpamanyungan @@ -2535,7 +2535,7 @@ wtm,Watam,Papunesia,-3.91666666667,144.5,wata1253,wax,Ramu-Lower Sepik,Ramu,Lowe wuc,Wu,Eurasia,31.6666666667,119.916666667,wuch1236,wuu,Sino-Tibetan,,Chinese,,wuu,false,false,CN,Chao-1970,genus-chinese wur,Waurá,South America,-13,-53,waur1244,wau,Arawakan,,Central Arawakan,,wau,false,false,BR,Derbyshire-1986 Derbyshire-and-Payne-1990 Wise-1990,genus-centralarawakan wwa,Waama,Africa,10.5833333333,1.66666666667,waam1244,wwa,Niger-Congo,Gur,Oti-Volta,,wwa,false,false,BJ,Peter-1990,genus-otivolta -wwr,Woiwurrung,Australia,-37.5,145.5,woiw1237,wyu,Pama-Nyungan,,Southeastern Pama-Nyungan,,wyu,false,false,AU,Blake-1991,genus-southeasternpamanyungan +wwr,Woiwurrung,Australia,-37.5,145.5,woiw1237,wyi,Pama-Nyungan,,Southeastern Pama-Nyungan,,wyi,false,false,AU,Blake-1991,genus-southeasternpamanyungan wwy,Waray-Waray,Papunesia,12,125,wara1300,war,Austronesian,,Greater Central Philippine,,war,false,false,PH,Rubino-2001c,genus-greatercentralphilippine wya,Wyandot,North America,44.3333333333,-77.5,wyan1247,wya,Iroquoian,,Northern Iroquoian,,wya,false,false,CA,Kopris-2001,genus-northerniroquoian wyn,Wayana,South America,3.25,-54.1666666667,waya1269,way,Cariban,,Cariban,,way,false,false,GF SR BR,Jackson-1972 de-Goeje-1946,genus-cariban @@ -2570,7 +2570,7 @@ ych,Yup'ik (Chevak),North America,61.5,-165.75,cent2127,esu,Eskimo-Aleut,,Eskimo ycn,Yucuna,South America,-0.75,-71,yucu1253,ycn,Arawakan,,Japura-Colombia,,ycn,false,false,CO,Huber-and-Reed-1992 Schauer-and-Schauer-1958 Schauer-and-Schauer-1967,genus-japuracolombia yct,Yucatec,North America,20,-89,yuca1254,yua,Mayan,,Mayan,,yua,false,false,MX,Arzapalo-1973 Bricker-et-al-1998 Straight-1976 Suarez-1983b Tozzer-1921,genus-mayan ydb,Yiddish (Bessarabian),Eurasia,47,28.5,east2295,ydd,Indo-European,,Germanic,,ydd,false,false,MD,,genus-germanic -ydd,Yiddish,Eurasia,52,23,yidd1255,ydd,Indo-European,,Germanic,,ydd,false,false,UA BY DE LT PL,Katz-1987,genus-germanic +ydd,Yiddish,Eurasia,52,23,east2295,ydd,Indo-European,,Germanic,,ydd,false,false,UA BY DE LT PL,Katz-1987,genus-germanic ydl,Yiddish (Lodz),Eurasia,51.75,19.4166666667,east2295,ydd,Indo-European,,Germanic,,ydd,false,false,PL,,genus-germanic yei,Yei,Papunesia,-7.91666666667,140.916666667,yeii1239,jei,Yam,,Yei,,jei,false,false,ID PG,Boelaars-1950,genus-yei yel,Yelî Dnye,Papunesia,-11.3666666667,154.166666667,yele1255,yle,Yele,,Yele,,yle,false,false,PG,Henderson-1975 Henderson-1995,genus-yele diff --git a/cldf/requirements.txt b/cldf/requirements.txt index 3e92b49..410e18d 100644 --- a/cldf/requirements.txt +++ b/cldf/requirements.txt @@ -3,7 +3,7 @@ Babel==2.11.0 bs4==0.0.1 certifi==2022.12.7 cldfbench==1.13.0 --e git+https://github.com/cldf-datasets/wals@e16d1f65352770f54b13f0394b4b2cb0d4985b9c#egg=cldfbench_wals +-e git+https://github.com/cldf-datasets/wals@2955a010f811e13778d96bb29394c46c04959375#egg=cldfbench_wals cldfcatalog==1.5.1 cldfzenodo==1.1.0 clldutils==3.18.0 diff --git a/cldf/sources.bib b/cldf/sources.bib index 7de51f0..4028e1d 100644 --- a/cldf/sources.bib +++ b/cldf/sources.bib @@ -1001,8 +1001,7 @@ @book{Berry-nd publisher = {Heffer}, title = {The Pronunciation of Ga}, wals_code = {ga}, - wals_ref_name = {Berry n.d.}, - year = {1000} + wals_ref_name = {Berry n.d.} } @book{Bhat-1967, @@ -9327,7 +9326,7 @@ @book{Polome-nd title = {Swahili Language Handbook}, wals_code = {swa}, wals_ref_name = {Polomé n.d.}, - year = {1000} + year = {1967} } @incollection{Popjes-and-Popjes-1986, @@ -13210,7 +13209,7 @@ @book{Zigmond-et-al-1990-1991 volume = {119}, wals_code = {kws}, wals_ref_name = {Zigmond et al. 1990-1991}, - year = {1000} + year = {1990} } @incollection{Zimmer-and-Orgun-1999, @@ -14268,7 +14267,7 @@ @book{Echols-and-Shadily-1961 title = {Kamus Indonesia Inggris: an Indonesian-English Dictionary}, wals_code = {ind}, wals_ref_name = {Echols and Shadily 1961}, - year = {1000} + year = {1961} } @book{Egli-1990, @@ -14579,8 +14578,7 @@ @unpublished{Grinevald-nd title = {A grammar of Rama}, type = {manuscript}, wals_code = {ram; jak}, - wals_ref_name = {Grinevald n.d.}, - year = {1000} + wals_ref_name = {Grinevald n.d.} } @book{Gudava-1964, @@ -16463,7 +16461,7 @@ @book{Singh-1906 title = {Khasi-English Dictionary}, wals_code = {khs}, wals_ref_name = {Singh 1906}, - year = {1000} + year = {1906} } @incollection{Skorik-1986, @@ -17809,7 +17807,7 @@ @book{Danusugondo-1975 title = {Bahasa Indonesian: Indonesian for Beginners. Volumes 1 and 2}, wals_code = {ind}, wals_ref_name = {Danusugondo 1975}, - year = {1000} + year = {1975} } @misc{Davis-1992, @@ -18263,7 +18261,7 @@ @article{Koite-Herschel-1981-1982 volume = {10-11}, wals_code = {xas}, wals_ref_name = {Koite-Herschel 1981-1982}, - year = {1000} + year = {1982} } @book{Kraft-and-Kirk-Greene-1973, @@ -22422,10 +22420,9 @@ @misc{Haviland-et-al-nd iso_code = {tzz}, olac_field = {phonetics; typology; general_linguistics; phonology}, title = {An On-line Tzotzil Grammar}, - url = {http://www.cerf.net/esteban/Tzotzil/}, + url = {https://theswissbay.ch/pdf/Books/Linguistics/Mega%20linguistics%20pack/Central%20and%20Meso-America/Mayan/Tzotzil%20Grammar%20%28Haviland%2C%20Robinson%20%26%20Gutierrez%29.pdf}, wals_code = {tzz}, - wals_ref_name = {Haviland et al. n.d.}, - year = {1000} + wals_ref_name = {Haviland et al. n.d.} } @article{Hawkins-1950, @@ -22707,8 +22704,7 @@ @unpublished{Hualde-nd title = {Theoretical Consequences of Souletin Basque Accentuation}, type = {manuscript}, wals_code = {bso}, - wals_ref_name = {Hualde n.d.}, - year = {1000} + wals_ref_name = {Hualde n.d.} } @article{Hudson-and-Richards-1969, @@ -24755,7 +24751,7 @@ @book{Ngata-nd title = {Maori Grammar and Conversation}, wals_code = {mao}, wals_ref_name = {Ngata n.d.}, - year = {1000} + year = {1964} } @incollection{Nicholson-and-Nicholson-1962, @@ -24939,7 +24935,7 @@ @book{Otrebski-1958-1965 title = {Gramatyka jezyka litewskiego (3 volumes)}, wals_code = {lit}, wals_ref_name = {Otrebski 1958-1965}, - year = {1000} + year = {1965} } @book{Owens-1984, @@ -26110,7 +26106,7 @@ @article{Strong-1913 volume = {4}, wals_code = {ror}, wals_ref_name = {Strong 1913}, - year = {1000} + year = {1913} } @book{Sundermann-1913a, @@ -27327,8 +27323,7 @@ @unpublished{Austerlitz-nd title = {Class handouts on Gilyak/Nivkh}, type = {manuscript}, wals_code = {niv}, - wals_ref_name = {Austerlitz n.d.}, - year = {1000} + wals_ref_name = {Austerlitz n.d.} } @book{Awobuluyi-1978, @@ -27818,7 +27813,7 @@ @book{Judge-and-Healey-1985 title = {A reference grammar of modern French}, wals_code = {fre}, wals_ref_name = {Judge and Healey 1985}, - year = {1000} + year = {1985} } @book{Kampfe-and-Volodin-1995, @@ -29654,8 +29649,7 @@ @book{Thaeler-and-Thaeler-nd publisher = {Board of Christian Education in Nicaragua}, title = {Miskito grammar}, wals_code = {mis}, - wals_ref_name = {Thaeler and Thaeler n.d.}, - year = {1000} + wals_ref_name = {Thaeler and Thaeler n.d.} } @book{Thurston-1982, @@ -33295,7 +33289,7 @@ @article{Goddard-1903 volume = {1}, wals_code = {hup}, wals_ref_name = {Goddard 1903}, - year = {1000} + year = {1903} } @incollection{Goddard-1911, @@ -35195,7 +35189,7 @@ @book{Kroeber-1904 volume = {2.2}, wals_code = {cba; ess; yok; cos}, wals_ref_name = {Kroeber 1904}, - year = {1000} + year = {1904} } @book{Kroeker-1982, @@ -35357,7 +35351,7 @@ @article{Lanyon-Orgill-1943 volume = {11}, wals_code = {all}, wals_ref_name = {Lanyon-Orgill 1943}, - year = {1000} + year = {1943} } @book{Larochette-1958, @@ -36950,7 +36944,7 @@ @article{Niggemeyer-1951 volume = {76}, wals_code = {aln}, wals_ref_name = {Niggemeyer 1951}, - year = {1000} + year = {1951} } @book{Njie-1982, @@ -39177,8 +39171,7 @@ @unpublished{Staley-nd title = {Olo Dictionary}, type = {manuscript}, wals_code = {olo}, - wals_ref_name = {Staley n.d.}, - year = {1000} + wals_ref_name = {Staley n.d.} } @book{Stechishin-1958, @@ -41033,8 +41026,7 @@ @unpublished{Beeler-nd title = {Topics in Barbareno Chumash}, type = {manuscript}, wals_code = {cba}, - wals_ref_name = {Beeler n.d.}, - year = {1000} + wals_ref_name = {Beeler n.d.} } @article{Bell-1982, @@ -42433,8 +42425,7 @@ @unpublished{Rose-nd title = {The Formation of Ethiopian Semitic Internal Reduplication}, type = {manuscript}, wals_code = {chh; tgr; hrr; krk}, - wals_ref_name = {Rose n.d.}, - year = {1000} + wals_ref_name = {Rose n.d.} } @book{Rubino-1998a, @@ -43141,7 +43132,7 @@ @book{Skorik-1961-1977 title = {Grammatika chukotskogo jazyka. Volume 1 and 2}, wals_code = {chk}, wals_ref_name = {Skorik 1961-1977}, - year = {1000} + year = {1977} } @book{Sofroniou-1962, @@ -43531,8 +43522,7 @@ @unpublished{Dickens-nd title = {Ju|'hoan Grammar}, type = {manuscript}, wals_code = {juh}, - wals_ref_name = {Dickens n.d.}, - year = {1000} + wals_ref_name = {Dickens n.d.} } @incollection{Dobrin-1998, @@ -43924,8 +43914,7 @@ @unpublished{Mous-nd title = {Alagwa Grammar, Texts and Lexicon}, type = {manuscript}, wals_code = {agw}, - wals_ref_name = {Mous n.d.}, - year = {1000} + wals_ref_name = {Mous n.d.} } @article{Newman-1979, @@ -48717,7 +48706,7 @@ @book{Safford-1903-1905 title = {The Chamorro language of Guam}, wals_code = {cha}, wals_ref_name = {Safford 1903-1905}, - year = {1000} + year = {1905} } @book{Sibusiso-Nyembezi-1972, @@ -48824,7 +48813,7 @@ @book{Westermann-1945 title = {Pluralbildung und Nominalklassen in einigen afrikanischen Sprachen}, wals_code = {ewe}, wals_ref_name = {Westermann 1945}, - year = {1000} + year = {1945} } @incollection{den-Besten-1996, @@ -55343,7 +55332,7 @@ @book{Yates-and-Tryon-1970 title = {Thai: Basic Course. Volume 1 and 2}, wals_code = {tha}, wals_ref_name = {Yates and Tryon 1970}, - year = {1000} + year = {1970} } @book{Young-1992, @@ -56001,8 +55990,7 @@ @book{Patel-nd publisher = {Read Well Publications}, title = {Learn Gujarati in a month}, wals_code = {guj}, - wals_ref_name = {Patel n.d.}, - year = {1000} + wals_ref_name = {Patel n.d.} } @book{Perez-Martinez-1994, @@ -56815,8 +56803,7 @@ @unpublished{Nikolaeva-and-Tolskaja-nd title = {A grammar of Udihe}, type = {manuscript}, wals_code = {udh}, - wals_ref_name = {Nikolaeva and Tolskaja n.d.}, - year = {1000} + wals_ref_name = {Nikolaeva and Tolskaja n.d.} } @book{Pashkov-1963, @@ -58226,8 +58213,7 @@ @unpublished{Creider-nd title = {A Syntactic Sketch of Nandi}, type = {manuscript}, wals_code = {nan}, - wals_ref_name = {Creider n.d.}, - year = {1000} + wals_ref_name = {Creider n.d.} } @misc{Caoimhin-P-ODonnaile-pc-cited-in-Gil-1994, @@ -58239,7 +58225,7 @@ @misc{Caoimhin-P-ODonnaile-pc-cited-in-Gil-1994 type = {personal communication}, wals_code = {iri}, wals_ref_name = {Caoimhin P. O'Donnaile, p.c., cited in Gil 1994}, - year = {1000} + year = {1994} } @misc{George-Huttar-pc-cited-in-Gil-1994, @@ -58251,7 +58237,7 @@ @misc{George-Huttar-pc-cited-in-Gil-1994 type = {personal communication}, wals_code = {ndy}, wals_ref_name = {George Huttar, p.c., cited in Gil 1994}, - year = {1000} + year = {1994} } @incollection{Gil-1994a, @@ -64439,8 +64425,7 @@ @unpublished{Ammann-nd title = {Expressions of necessity: Catalan, its diachrony, and a problem for grammaticalization theory}, type = {manuscript}, wals_code = {ctl}, - wals_ref_name = {Ammann n.d.}, - year = {1000} + wals_ref_name = {Ammann n.d.} } @book{Chinggaltai-1952, @@ -66249,8 +66234,7 @@ @misc{Anonymous-4 title = {Khoisan}, url = {http://ling.cornell.edu/khoisan/index.htm}, wals_code = {xam}, - wals_ref_name = {Anonymous 4}, - year = {1000} + wals_ref_name = {Anonymous 4} } @book{Aymonier-1889, @@ -66828,7 +66812,7 @@ @article{Meinhof-1938-39 volume = {29}, wals_code = {bia}, wals_ref_name = {Meinhof 1938/39}, - year = {1000} + year = {1939} } @article{Meyer-1940, @@ -67062,8 +67046,7 @@ @unpublished{Rigden-nd-a title = {Karkar Grammar Essentials}, type = {manuscript}, wals_code = {kyr}, - wals_ref_name = {Rigden n.d. (a)}, - year = {1000} + wals_ref_name = {Rigden n.d. (a)} } @book{Ruelland-1998, @@ -69233,7 +69216,7 @@ @incollection{Nedjalkov-and-Nedjalkov-2007c title = {Reciprocals, sociatives and competitives in Karachai-Balkar}, wals_code = {krc}, wals_ref_name = {Nedjalkov and Nedjalkov 2007c}, - year = {1000} + year = {2007} } @incollection{Ogloblin-and-Nedjalkov-2007, @@ -69411,7 +69394,7 @@ @incollection{Tsunoda-2007b title = {Reciprocal-reflexive constructions in Djaru}, wals_code = {djr}, wals_ref_name = {Tsunoda 2007b}, - year = {1000} + year = {2007} } @incollection{Volodin-2007, @@ -74725,7 +74708,7 @@ @book{Giacone-nd title = {Pequena gramática e dicionário da lingua Tucana}, wals_code = {tuc}, wals_ref_name = {Giacone n.d.}, - year = {1000} + year = {1952} } @book{Gilberti-1901, @@ -75822,7 +75805,7 @@ @book{Lenkersdorf-1979a title = {B'omak'umal Kastiya-Tojol Ab'al. Volume 2: Diccionario español - tojolabal}, wals_code = {toj}, wals_ref_name = {Lenkersdorf 1979a}, - year = {1000} + year = {2010} } @book{Lenkersdorf-1979b, @@ -75834,7 +75817,7 @@ @book{Lenkersdorf-1979b title = {B'omak'umal Tojol Ab'al - Kastiya. Volume 1: Diccionario tojolabal -español}, wals_code = {toj}, wals_ref_name = {Lenkersdorf 1979b}, - year = {1000} + year = {2010} } @book{Leslau-1959, @@ -76647,7 +76630,7 @@ @book{Perry-et-al-nd title = {Western Apache Dictionary}, wals_code = {apw}, wals_ref_name = {Perry et al. n.d.}, - year = {1000} + year = {1972} } @book{Petitot-1876, @@ -77906,7 +77889,7 @@ @book{Voegelin-1938-1940 title = {Shawnee stems and the Jacob P. Dunn Miami dictionary. Part 1-5}, wals_code = {shw}, wals_ref_name = {Voegelin 1938-1940}, - year = {1000} + year = {1940} } @book{Voorhis-1988, @@ -78065,7 +78048,7 @@ @book{Wessely-nd title = {Pocket Dictionary of the English and Italian Languages}, wals_code = {ita}, wals_ref_name = {Wessely n.d.}, - year = {1000} + year = {1898} } @book{White-and-White-1990, @@ -78138,7 +78121,7 @@ @book{Xiong-et-al-nd title = {English-Mong-English Dictionary}, wals_code = {mge}, wals_ref_name = {Xiong et al. n.d.}, - year = {1000} + year = {1984} } @book{Y-Chang-1979, @@ -78164,7 +78147,7 @@ @book{Yacoubian-nd title = {English-Armenian and Armenian-English Dictionary}, wals_code = {arw}, wals_ref_name = {Yacoubian n.d.}, - year = {1000} + year = {1970} } @book{Zeisberger-1887, @@ -78190,7 +78173,7 @@ @unpublished{Zinn-and-Zinn-nd type = {manuscript}, wals_code = {poc}, wals_ref_name = {Zinn and Zinn n.d.}, - year = {1000} + year = {1970} } @book{de-Alviano-1944, @@ -78753,7 +78736,7 @@ @book{Dmitriev-1955 title = {Rusca-Tatarca süzlek/Russko-tatarskij slovar'}, wals_code = {tvo}, wals_ref_name = {Dmitriev 1955}, - year = {1000} + year = {1955} } @book{Dutton-1992, @@ -79112,14 +79095,13 @@ @book{Syamsir-et-al-1985 @misc{Sylestine-et-al-nd, author = {Sylestine, Cora and Hardy, Heather K. and Montler, Timothy}, - howpublished = {online}, iso_code = {akz}, olac_field = {general_linguistics; syntax; typology}, title = {Alabama-English Dictionary}, - url = {http://www.ling.unt.edu/%7Emontler/Alabama/Dictionary/}, + url = {https://doi.org/10.7560/730779-007}, wals_code = {abm}, wals_ref_name = {Sylestine et al. n.d.}, - year = {1000} + year = {1993} } @book{Trebilco-et-al-1974, @@ -79166,10 +79148,10 @@ @misc{Webster-and-Zibell-nd iso_code = {esi}, olac_field = {syntax; typology; general_linguistics}, title = {Interactive Iñupiaq Dictionary}, - url = {http://www.alaskool.org/Language/dictionaries/inupiaq/dictionary.htm}, + url = {http://www.alaskool.org/Language/dictionaries/inupiaq/}, wals_code = {inu}, wals_ref_name = {Webster and Zibell n.d.}, - year = {1000} + year = {1970} } @book{Wolfart-and-Ahenakew-1998, @@ -79569,8 +79551,7 @@ @unpublished{Veinberg-nd title = {Interrogation in Argentine Sign Language: Non-manual Markers}, type = {manuscript}, wals_code = {lsa}, - wals_ref_name = {Veinberg n.d.}, - year = {1000} + wals_ref_name = {Veinberg n.d.} } @book{Vogt-Svendsen-1990, diff --git a/cldfbench_wals.py b/cldfbench_wals.py index 19bb3da..755f12d 100644 --- a/cldfbench_wals.py +++ b/cldfbench_wals.py @@ -54,6 +54,7 @@ def read(self, core, extended=False, pkmap=None, key=None): def cmd_makecldf(self, args): self.create_schema(args.writer.cldf) + glangs = {l.id for l in args.glottolog.api.languoids()} pk2id = collections.defaultdict(dict) @@ -208,6 +209,9 @@ def cmd_makecldf(self, args): family = families[genus['family_pk']] iso_codes = row['iso_codes'].replace(',', '').split() glottocodes = [i[0] for i in lang2id[row['pk']].get('glottolog', [])] + gcode = glottocodes[0] if len(glottocodes) == 1 else None + if gcode: + assert gcode in glangs, 'invalid Glottocode: {}'.format(gcode) srcs = lrefs[row['pk']] if id in gbs_lg_refs: [srcs.append(s) for s in gbs_lg_refs[id] if s not in srcs] diff --git a/setup.py b/setup.py index e9afb08..2fea28d 100644 --- a/setup.py +++ b/setup.py @@ -22,7 +22,7 @@ 'pycldf>=1.19.0', 'pybtex>=0.24.0', 'beautifulsoup4>=4.9.3', - 'csvw>=1.10.1' + 'csvw>=1.10.1', ], extras_require={ 'test': [