Movie Recommendation System - Content-Based Recommendation System

Overview

This project implements a content-based movie recommendation system utilizing the TMDB 5000 dataset from Kaggle. The system analyzes various movie attributes to generate personalized recommendations based on user input.

Dataset

Source: TMDB 5000 Dataset (https://www.kaggle.com/datasets/tmdb/tmdb-movie-metadata)

Version 1: Stemming + Bag of Words + Similarity Search
Version 2: Lemmatization + TFIDF + Similarity Search

Steps

Data Preprocessing: Clean and prepare the dataset for analysis.
Exploratory Data Analysis (EDA): Analyze the dataset to understand its structure and key features.
Feature Engineering: Extract meaningful features from the dataset to enhance recommendation accuracy.
Tag Creation: Generate tags based on multiple columns including Genre, Overview, Keywords, Cast, and Crew.
Text Processing: Apply stemming and lemmatization techniques, and remove stop words to refine the tags for better similarity matching.
Cosine similarity is applied to find the similar movies

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.devcontainer		.devcontainer
data		data
pickle		pickle
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Movie Recommendation System.ipynb		Movie Recommendation System.ipynb
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movie Recommendation System - Content-Based Recommendation System

Overview

Dataset

Steps

About

Releases

Packages

Languages

License

akshaya13/recommendation-system

Folders and files

Latest commit

History

Repository files navigation

Movie Recommendation System - Content-Based Recommendation System

Overview

Dataset

Steps

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages