This repository contains Jupyter Notebooks
for detailed analysis of the Netflix titles dataset using R programming language
. The dataset offers a comprehensive overview of the diverse selection of movies and TV shows available on Netflix, providing rich details on various aspects of media offerings. This data is instrumental for in-depth analysis, making it an invaluable resource for media analysts, content strategists, and digital entertainment researchers.
- Data Preparation and Cleaning
- Importing Libraries
- Loading Dataset
- Data Overview
- Removing Unwanted Columns
- Detection of Duplicate Records
- Checking Missing Values
- Changing Data Type of Columns
- Creating New Column from "duration"
- Preview of Data
- Exploratoy Data Analysis (EDA)
- Descriptive Statistics
- Visual Data Exploration
The Jupyter Notebooks
in this repository can be viewed directly on GitHub, which allows for easy review of the analysis and outcomes without the need for local execution. For an interactive experience or to modify the analysis, it is recommended to clone the repository and work with the notebooks locally.
If you wish to execute or edit the notebooks on your own machine, ensure you have an R distribution installed, along with the packages mentioned in the notebooks. Jupyter users will need to install IRKernel
to enable the execution of R
within this environment.
If you are interested in contributing to this project, please fork the repository and consider submitting pull requests with your enhancements.