Skip to content

Add master catalogue search index with TF-IDF NN retrieval and keyword#237

Open
falah2001 wants to merge 1 commit intoDataBytes-Organisation:mainfrom
falah2001:Falah_ML_product_search
Open

Add master catalogue search index with TF-IDF NN retrieval and keyword#237
falah2001 wants to merge 1 commit intoDataBytes-Organisation:mainfrom
falah2001:Falah_ML_product_search

Conversation

@falah2001
Copy link
Contributor

  • Implemented TF-IDF–based master catalogue search using cosine nearest neighbours for fast and scalable product retrieval.
  • Added per-category TF-IDF keyword extraction to identify high-signal terms that define each merchandise category.
  • Introduced keyword coverage analysis to measure how broadly or narrowly a term captures products within categories.
  • Implemented shared keyword detection across categories to identify non-specific or high-frequency terms affecting search precision.
  • Designed the module as a configurable, command-line tool supporting both query search and keyword exploration modes for flexible analysis workflows.

Add master catalogue search index with TF-IDF NN retrieval and keyword intelligence (per-category TF-IDF, shared keyword coverage)
Implemented TF-IDF + cosine nearest neighbour search with per-category keyword extraction and coverage analysis.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant