A search engine for searching Wikipedia XML dumps.
python3 and nltk library is required to run the search engine.
To create the inverted index, run the following command.
./index.sh <path_to_wiki_dump_file> <path_to_index_folder>
The arguments are the absolute paths to the Wikipedia XML dump file and the folder where the inverted index is to be created and stored.
To search, run the following command.
./search.sh <path_to_index_folder>
Enter the query one by one to get the top 10 ranked results.