-
Notifications
You must be signed in to change notification settings - Fork 289
Open
Labels
documentationimprove documentation and test casesimprove documentation and test cases
Milestone
Description
[WIP]
Minimum required version is now 3.9.
This release modernizes the codebase, delivering a 62x reduction in peak import memory (thanks to @what-in-the-nim) and achieving typed package status through improved type annotations. Additionally, the library now ensures compatibility across Python 3.9–3.14.
Many updates were AI-assisted; see Pull Requests for specific prompts and implementation details.
- Add Python 3.14 support (update nlpO3 to 1.4.0)
- Fix
royinromanization Fix royin romanization for Thai consonant clusters and syllable boundaries #1172 - Fix final consonant classification in
check_marttra()Fix check_marttra() Thai final consonant classification #1173 - Lazy load dictionaries to reduce memory usage Reduce peak memory on import by 62x #1186
- Fix Kho Khon alphabet issue in
tltktransliteration Fix KeyError when transliterating text containing obsolete Thai letter ฅ #1187 - Migrate configurations to pyproject.toml Consolidate configuration into pyproject.toml #1188 Integrate tox.ini into pyproject.toml #1226 Remove obsolete configuration files (.pep8speaks.yml and .pyup.yml) #1239
- Update type hints; Use Python 3.9 features Modernize to Python 3.9: type hints with future annotations, exception handling, and code quality #1189 Fix type hint mismatches with None defaults throughout codebase #1190 & many more PRs.
- Remove duplicated entries in volubilis dictionary volubilis_words_th.txt: Remove trailing whitespaces + remove duplicate entries #1200
- Remove star imports Remove star imports and enable F403/F405 Ruff checks #1207
- Remove
requestsdependency Replace requests with urllib.request to reduce core dependencies #1211 - Make package zip-safe Make package zip-safe by migrating to importlib.resources #1212
- Ensure thread-safety for tokenizers Ensure thread-safety for word_tokenize() wrapper functions #1213
- Improved documentation; code cleanup; more tests
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
documentationimprove documentation and test casesimprove documentation and test cases