-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing characters in Latin -> Cyrillic transliteration #14
Comments
The main question is, what to do with them? https://en.wikipedia.org/wiki/Romanization_of_Russian#Transliteration_table
Any other/better ideas? |
In my previous company some guys made a complex tool for multi-language transliteration of toponyms. As far as I understood they had sets of rules with different probabilities which depended on the location of characters' sequences in a word (like ck -> к, ough -> о). The process of transliteration had a few stages implemented in different programming languages, but result was quite good. |
Same here: -> Когда Дигитал Еqуипмент Цорпоратион сократила количество рабочих мест на три тысячи, в ее официальном объявлении говорилось не об увольнениях, а о «вынужденных мерах». |
Characters C, Q, W, X are missing in transliteration tables for Latin -> Cyrillic transliteration.
$ python -c 'import transliterate; print transliterate.translit("CQWX", "ru")'
CQWX
The text was updated successfully, but these errors were encountered: