Capstone Project - TrekPredict: Predicting IMDB Ratings Using NLP & Machine Learning

What if you could predict the ratings from the script of any show? I explored and tested this upon one of my most favourite tv shows in existence, Star Trek: The Next Generation.

The project contains (in viewing order):

Summary

KatyaKogan_Capstone_Report.pdf (final report)
KatyaKogan_Final_Presentation.pdf (final presentation for tech audience)
KatyaKogan_Demo_Day.pdf (presentation file for demo day)

Notebooks

Part1_TrekPredict_CleaningEDA.ipynb
Part2_TrekPredict_Modelling.ipynb

Data

model_comp.csv (comparing models)
PCT_graph.csv (modelling dataset)
TNG.csv.gz (original dataset -> https://github.com/RTrek/startrekTNGdataset)
total_word_count.csv

Required libraries:

pandas
numpy
seaborn
matplotlib
sklearn

Extras:

shap (https://shap.readthedocs.io/en/latest/index.html)
mord (https://pythonhosted.org/mord/)
time

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
Data		Data
Notebooks		Notebooks
Plots		Plots
Summary		Summary
.gitattributes		.gitattributes
README.md		README.md
st_banner.png		st_banner.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Capstone Project - TrekPredict: Predicting IMDB Ratings Using NLP & Machine Learning

Summary

Notebooks

Data

About

Releases

Packages

Languages

KatyaZeross/TrekPredict

Folders and files

Latest commit

History

Repository files navigation

Capstone Project - TrekPredict: Predicting IMDB Ratings Using NLP & Machine Learning

Summary

Notebooks

Data

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages