Movie Prediction Algorithm and Dataset
- After preprocessing/cleaning the data there were around 2000 data points.
- The main task was to predict the IMDB rating of a movie.
- This was considered as a classification problem by taking 10 classes 1-10 i.e the rating.
- There were initially many features which was then reduced using the domain knowledge finally only 9 features was taken into consideration, the filtered and processed data is saved in the after csv.csv file.
- All the models are pickled in the models folder.
- Error= 0
Model | Accuracy (%) |
---|---|
K Nearest Neighbours | 37.52 |
Logitic Regression | 40.9 |
SVC | 36.35 |
- Error= +/- 1
Model | Accuracy (%) |
---|---|
K Nearest Neighbours | 80.90 |
Logitic Regression | 85.09 |
SVC | 83.91 |