Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing / wrong Results für seach string containing a German umlaut #724

Open
flyfoto opened this issue Oct 10, 2024 · 1 comment
Open

Comments

@flyfoto
Copy link

flyfoto commented Oct 10, 2024

If I seach for "büchelberg" (in double quotes for exact match) in https://fly-foto.de/luftbild/suche/
I get results matching the search exactly. See
grafik
However if I omit the double quotes the result including the search term is no longer returned but instead matches with 'b.'
see
grafik
If I search for "buechelberg" I get no results but if I search for buechelberg (without quotes) I get similar wrong results as above.
This does not happen if I search e.g. for "buchen" or buchen.
Here the exact match results are included in the result sets.

I suspect the Umlaut "ü" is causing this trouble?
However with the search term "Fürth" I dont get this unexcected result.

@flyfoto
Copy link
Author

flyfoto commented Nov 13, 2024

I noticed that my English website does not show this bug,

After disabling the language detection for the German site by recreating the index with CLI option " --force-language en"
the problem disappears:
grafik

Apparently the stemming or segmentation or whatever happens for the German language incorrectly considers an umlaut (Ü,Ä,Ö) as 2nd char of the search term as a delimiter and this prevents that for searches liken Würzburg, Büchelberg etc. the exisiting exact hits are not returned.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant