Skip to content
/ hacknu Public

HackNu 2021 hackathon. News search engine NLP using TF-IDF and BM25. This is a hackathon problem from Google company

Notifications You must be signed in to change notification settings

rinath/hacknu

Repository files navigation

Local Search Engine

<HackNU/2021> Hackathon

by 3.14rates

Google Challenge: Develop a local search engine to quickly search for a set of keywords through around 150,000 news articles dataset. Some examples of keywords are: "work desk", "presidential election", "Olympic closing ceremony" or "documentary". Search engine should output a ranked list of 5 articles which are the best match based on the given keywords and the time it took for the query to execute.

Our solutions mainly consists of preprocessing the input data (tokenization, lemmatization) and using ranking functions as tf-idf and bm25. We were awarded an honorable mention (TOP-3) at the hackathon.

Link to the news dataset

About

HackNu 2021 hackathon. News search engine NLP using TF-IDF and BM25. This is a hackathon problem from Google company

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published