- Modified
load_bigram_dictionary
to allow dictionary entries to be split into only 2 parts when using a custom separator - Added dictionary files to wheels so
pkg_resources
could be used to access them
- Added
separator
argument to allow user to choose custom separator forload_dictionary
- Added
load_bigram_dictionary
and bigram dictionaryfrequency_bigramdictionary_en_243_342.txt
- Updated
lookup_compound
algorithm - Added
Levenshtein
to compute edit distance - Added
save_pickle_stream
andload_pickle_stream
to save/load SymSpell data alongside other structure (contribution by marcoffee)
- Added
transfer_casing
tolookup
andlookup_compound
- Fixed prefix length check in
_edits_prefix
- Implemented
delete_dictionary_entry
- Improved performance by using python builtin hashing
- Added versioning of the pickle
- Fixed
include_unknown
inlookup
- Removed unused
initial_capacity
argument - Improved
_get_str_hash
performance - Implemented
save_pickle
andload_pickle
to avoid having to create the dictionary every time
- Added
create_dictionary()
feature
- Fixed
lookup_compound()
to return the correctdistance
- Added
<self._replaced_words = dict()>
to track number of misspelled words - Added
ignore_token
toword_segmentation()
to ignore words with regular expression
- Added
word_segmentation()
feature
- Added
encoding
option toload_dictionary()
- Create a package for
symspellpy
- Ported SymSpell v6.3