Investigate the TMDb movie dataset

This project was done as part of Udacity's Data Analyst Nanodegree - Term 1.

The TMDb Movie dataset, one of Udacity's curated datasets has been selected for investigation using NumPy and Pandas. The dataset is a collection of information on around 10000 movies. For each movie, the dataset includes information on aspects such as popularity, budget, revenue, cast, directors, production house, date of release, runtime, and its rating.

Outline of analysis

Assessed the data and brainstormed questions that could be answered using the data
Performed necessary cleaning steps to unify formats, deal with missing data and prepare the dataset for analysis
Wrangled and explored the data using Pandas and Numpy to gather insights about the relationship between different aspects, created visualizations using matplotlib and made inferences to answer research questions

Research Questions

How have movie production trends varied over the years?
What are the top 20 highest grossing movies? What are the top 20 most expensive movies?
How do budgets correlate with revenues? Do higher budget movies have higher revenue?
Do certain months of release associate with better revenues?
Which months have seen the maximum releases?
How do ratings correlate with commercial success (profits)?
What run times are associated with each genre?
Who are the top 20 directors who made highly rated films? The directors considered for should have made atleast 5 movies in the time period 1960 - 2015 represented in the dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
DC_Investigate_TMDb_movie_dataset.ipynb		DC_Investigate_TMDb_movie_dataset.ipynb
README.md		README.md
tmdb_movies.csv		tmdb_movies.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Investigate the TMDb movie dataset

Outline of analysis

Research Questions

About

Releases

Packages

Languages

divyachandramouli/Investigate_TMDb_Movie_Dataset

Folders and files

Latest commit

History

Repository files navigation

Investigate the TMDb movie dataset

Outline of analysis

Research Questions

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages