LexiLang

Simple, fast dictionary-based language detector for short texts.

Installation

pip install lexilang

Usage

from lexilang.detector import detect

print(detect("bonjour")) # ('fr', 0.45)
print(detect("学中文")) # ('zh', 0.45)
print(detect("ciao mondo")) # ('it', 0.9)
print(detect("El gato doméstico")) # ('es', 0.45)

# Optionally, specify a subset of languages to consider
print(detect("ciao", languages=["de", "ro"])) # ('de', 0.45)

detect(text, languages=[]) -> tuple (iso_639_1, confidence)

Supported Languages

Afrikaans
Albanian
Arabic
Bengali
Bulgarian
Catalan
Chinese
Czech
Danish
Dutch
English
Esperanto
Estonian
Finnish
French
German
Greek
Hebrew
Hindi
Hungarian
Indonesian
Italian
Japanese
Kazakh
Korean
Latvian
Lithuanian
Macedonian
Norwegian
Polish
Portuguese
Romanian
Russian
Serbian
Slovak
Slovenian
Spanish
Swedish
Thai
Turkish
Ukrainian
Vietnamese
Farsi

Limitations

This detector was designed for handling small texts (< 20 characters). It will probably not work reliably for longer text sequences. As it relies on dictionaries, if a word is missing or mispelled, the detection will fail.

Contributing

If you want to add a new language, or improve an existing one, add more words to the respective dictionary in the dictionaries folder.

License

AGPLv3

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
dictionaries		dictionaries
lexilang		lexilang
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
publish.sh		publish.sh
setup.cfg		setup.cfg
setup.py		setup.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LexiLang

Installation

Usage

Supported Languages

Limitations

Contributing

License

About

Releases 2

Packages

Contributors 5

Languages

License

LibreTranslate/LexiLang

Folders and files

Latest commit

History

Repository files navigation

LexiLang

Installation

Usage

Supported Languages

Limitations

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 5

Languages

Packages