Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add BM25F function #11

Open
bohnpessatti opened this issue Sep 4, 2020 · 1 comment
Open

Add BM25F function #11

bohnpessatti opened this issue Sep 4, 2020 · 1 comment
Labels
enhancement New feature or request

Comments

@bohnpessatti
Copy link

Congratulations for the initiative, your project it's being quite useful in my work.

I would like to suggest adding a function for the BM25F method, which takes different document fields relevance into account before using BM25 saturating function.

This avoids dangerous over-estimation of terms importance when combining linearly BM25 scores from different fields [1]. Therefore, it could make your project more robust for structured text ranking.

References:
[1] https://trec.nist.gov/pubs/trec13/papers/microsoft-cambridge.web.hard.pdf
[2] https://www.researchgate.net/publication/221613382_Simple_BM25_extension_to_multiple_weighted_fields

Thank you in advance.

@dorianbrown
Copy link
Owner

Thanks for the kind words, and I'm glad this package has been of use to you!

It sounds like it would be a useful addition to the package. I don't have time to add these changes at the moment, as I no longer work with this application for my job. If you'd like to create a pull request with the changes I'd be happy to review it and add it, but otherwise I'll try and add it when I've got some time for it.

@dorianbrown dorianbrown added the enhancement New feature or request label Sep 6, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants