-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bad performance for other language #76
Comments
Hi @JuanFF, I have the following 2 observations:
If you dont want to change the bert model, I would suggest to pass the vocab file (example) separately like:
|
I have a pending issue #44 on a similar topic, but lately, I have been pretty occupied. If you think you can contribute, please open a PR! The project would be glad to have your contribution! |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. |
Hello,
I'm trying to use the contextual spell checker for Spanish. I run the script in https://github.com/R1j1t/contextualSpellCheck/blob/88bbbb46252c534679b185955fd88c239ed548a7/examples/ja_example.py with the following custom configuration:
but I don't get the desired result
La economia a crecido un dos por ciento
should be corrected asLa economía ha crecido un dos por ciento
Instead, I get
La economia a crecido un dos por cento
If I use another pre-trained model (e.g. "model_name": "PlanTL-GOB-ES/roberta-large-bne") , the result keeps wrong:
Laeconomiaacrecidoundosporciento.
??I wonder if I'm using the proper script to run the spellchecker in another language.
The text was updated successfully, but these errors were encountered: