Skip to content

Conversation

@numdouglas
Copy link

The current sort algorithm used to sort the tokenised strings is alphabetical sort. This has shortcomings such as when two similar words begin with different letters.
I propose the addition of Levenstein sort on the tokens themselves to improve the accuracy in such instances.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant