Skip to content

saxenanihal95/Search-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Search-Engine

A Simple Search Engine using nltk

This is a very basic search engine which we build using the following steps :

Scraping Documents from any website

Then the text of the documents will go through removal of puntuations and stopwords singular words to plural

Store the final words(keywords) into the database(mysql)

Create a dictionary of the words(keywords)

Create a document which will contain lists of list and each list will be a document we scraped and list will contain the frequency(number) of words occured in that document.

Create Tf-Idf Model

Finally search a query in our model.