Book-Search-Engine

Search Engine for Books (Java, Apache Lucene, crawler4j, Apache Spark)

Crawled about 100,000 web pages using crawler4j and performed link analysis by implementing PageRank on the web graph with Apache Spark’s Graphx.
Indexed the crawled documents using Apache Lucene and ordered the documents for each query by a combination of PageRank and TF/IDF score.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src/main/java		src/main/java
.gitignore		.gitignore
Book Search Engine.iml		Book Search Engine.iml
README.md		README.md
pom.xml		pom.xml

Provide feedback