Language (BCP 47 code) | File Name | Single Lexicon Number Entries | Single Lexicon Include POS | MWE Lexicon Number Entries |
---|---|---|---|---|
Standard Arabic (arb) | semantic_lexicon_arabic.tsv | 37,313 | ❌ | ❌ |
Mandarin Chinese (cmn) | semantic_lexicon_chi.tsv | 64,541 | ✔️ | ❌ |
Mandarin Chinese (cmn) | mwe-chi.tsv | ❌ | ❌ | 19,040 |
Czech (cs) | semantic_lexicon_cz.tsv | 28,161 | ✔️ | ❌ |
Danish (dk) | semantic_lexicon_dk.tsv | 34 | ✔️ | ❌ |
Danish (dk) | mwe-dk.tsv | ❌ | ❌ | 9 |
Dutch, Flemish (nl) | semantic_lexicon_dut.tsv | 4,220 | ✔️ | ❌ |
Finnish (fi) | semantic_lexicon_fin.tsv | 46,226 | ✔️ | ❌ |
French (fr) | semantic_lexicon_fr.tsv | 2,724 | ✔️ | ❌ |
Italian (it) | semantic_lexicon_ita.tsv | 33,091 | ✔️ | ❌ |
Italian (it) | mwe-ita.tsv | ❌ | ❌ | 5,622 |
Standard Malay (zsm) | semantic_lexicon_ms.tsv | 64,863 | ❌ | ❌ |
Portuguese (pt) | semantic_lexicon_pt.tsv | 13,942 | ✔️ | ❌ |
Portuguese (pt) | mwe-pt.tsv | ❌ | ❌ | 1,781 |
Russian (ru) | semantic_lexicon_rus.tsv | 17,443 | ✔️ | ❌ |
Russian (ru) | semantic_lexicon_rus_names.tsv | 7,643 | ✔️ | ❌ |
Russian (ru) | mwe-rus.tsv | ❌ | ❌ | 713 |
Spanish, Castilian (es) | semantic_lexicon_es.tsv | 9,709 | ✔️ | ❌ |
Spanish, Castilian (es) | mwe-es.tsv | ❌ | ❌ | 4,841 |
Swedish (sv) | semantic_lexicon_se.tsv | 18,082 | ✔️ | ❌ |
Urdu (ur) | Urdu_Semantic_Lexicon.tsv | 2,000 | ✔️ | ❌ |
Welsh (cy) | semantic_lexicon_cy.tsv | 143,292 | ✔️ | ❌ |
Welsh (cy) | mwe-welsh.tsv | ❌ | ❌ | 240 |
Indonesian (id) | semantic_lexicon_id.tsv | 284 | ✔️ | ❌ |
English (en) | semantic_lexicon_en.tsv | 54,797 | ✔️ | ❌ |
English (en) | mwe-en.tsv | ❌ | ❌ | 19,042 |