Skip to content

Latest commit

 

History

History
24 lines (15 loc) · 1.06 KB

README.md

File metadata and controls

24 lines (15 loc) · 1.06 KB

Python & Spark Collaborative Filtering Script using Movielens Dataset.

This is a script with dataset to run and find similarities between from a big data set using Python and Spark. One needs to essesntially pass an id for the movie and then find similar movies based on item based collaborative filtering. One can change the values of threshold and modify accordingly.

More here: https://realpython.com/build-recommendation-engine-collaborative-filtering/

Files

To Run

  • Install Spark & Python on your system.
spark-submit movie-similarities.py <id>

(The id of the movie to find similarities for, 50 is for star wars!).

Maintainers

  • Vaibhav Magon