🎬 Netflix Movie Recommendation System

A content-based movie recommendation system built using Python, Pandas, and scikit-learn, designed to suggest movies/TV shows based on similarity in metadata such as language, availability, release date, and viewing statistics.

The system supports retraining on new datasets and provides a Flask-based web interface for real-time recommendations.

📌 Features

Content-Based Filtering: Recommendations are based on similarity in categorical and numerical attributes (e.g., genre, language, popularity).
Data Cleaning Pipeline: Automatic handling of missing values, duplicate titles, and inconsistent formats.
Custom Feature Engineering: Generates Content_ID for efficient mapping and similarity computation.
Web Interface: Flask app for user-friendly recommendation queries.
Retraining Support: Easily retrain on updated or custom datasets.
Optimized for Deployment: Saves model, preprocessor, and metadata for quick loading.

⚙️ How It Works

Data Loading & Cleaning
- Reads CSV dataset
- Cleans numeric fields (Hours Viewed)
- Drops duplicate and missing titles
- Extracts Release_Year from Release Date
- Assigns unique Content_ID to each item
Feature Engineering
- Categorical Columns: One-hot encoded
- Numerical Columns: Standard scaled
- Saves metadata and preprocessor for future use
Recommendation
- Computes cosine similarity between feature vectors
- Returns top-N most similar items to a given title

🚀 Getting Started

1️⃣ Clone the Repository

git clone https://github.com/yourusername/Netflix_Movie_Recommendation_System.git
cd Netflix_Movie_Recommendation_System

2️⃣ Prepare the Dataset

Place your dataset (CSV) inside the data/ folder.
Required Columns:
Title Available Globally? Release Date Hours Viewed Language Indicator Content Type

3️⃣ Retrain the Model (Optional)

If you want to train on your dataset:

python train.py --input (datapath) --outdir ./models --neighbors 50(optional)

4️⃣ Run the Web App

python app.py

Browser View:

http://127.0.0.1:5000/

🎯 Usage Example

Example Query:

Input: wednesday

Output: Similar shows/movies based on metadata (language, year, popularity, etc.).

The recommendations are not random — they are based on metadata similarity, meaning the system suggests titles with similar language, release period, and audience engagement patterns.

🔮 Future Improvements

Include genre-based similarity from NLP on movie descriptions
Add collaborative filtering using user ratings
Support multi-language search
Deploy the app to Heroku / Render

👨‍💻 Author

Dipean Dasgupta
Computer Science Graduate | EdgeAI & ML Enthusiast

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
models		models
static		static
templates		templates
Exploratory_Data_Analysis.ipynb		Exploratory_Data_Analysis.ipynb
README.md		README.md
app.py		app.py
config.py		config.py
data_prep.py		data_prep.py
recommender.py		recommender.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎬 Netflix Movie Recommendation System

📌 Features

⚙️ How It Works

🚀 Getting Started

1️⃣ Clone the Repository

2️⃣ Prepare the Dataset

3️⃣ Retrain the Model (Optional)

4️⃣ Run the Web App

Browser View:

🎯 Usage Example

🔮 Future Improvements

👨‍💻 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎬 Netflix Movie Recommendation System

📌 Features

⚙️ How It Works

🚀 Getting Started

1️⃣ Clone the Repository

2️⃣ Prepare the Dataset

3️⃣ Retrain the Model (Optional)

4️⃣ Run the Web App

Browser View:

🎯 Usage Example

🔮 Future Improvements

👨‍💻 Author

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages