This project analyzes trending YouTube videos using Python. The dataset includes video and channel metadata such as views, likes, duration, categories, captions, and more. The goal is to understand what drives a video to trend and identify patterns that contribute to higher viewer engagement.
- Source: Kaggle - YouTube Trending Videos
- Rows: 115
- Columns: 19
- Scope: Trending videos with different categories
- Time Period: [2007-03-30 to 2016-06-04]
channelId,channelTitle,publishedAt,videoId,videoTitle,videoDescription,duration,durationSec,viewCount,likeCount,dislikeCount,commentCount,videoCategoryId,videoCategoryLabel,definition,caption,Licensed Content
To explore and analyze what makes videos trend on YouTube by:
- Examining relationships between views, likes, duration, and comments
- Identifying high-performing content features
- Visualizing trends across video characteristics
-
Data Cleaning
- Removed duplicates and handled missing values
- Converted and standardized date and duration formats
-
Exploratory Data Analysis
- Analyzed views, likes, comments, and video durations
- Visualized correlations between engagement activities
-
Insights & Conclusion
- Videos with more likes and comments tend to trend more
- Videos published between 9 AM to 12 PM and 1 AM to 4 PM get more views.
- Interestingly videos with more dislikes are reason for less views
- Comparing top 10 videos with more views with likes, dislikes and comments
- Barplots showing correlations between views and different engagement activities
- Barplots showing duration and their frequency for top 15 trending videos
- Python
- Pandas
- Matplotlib, Seaborn
- Google Colab
This project includes a Jupyter Notebook file (YouTube_Trending_Data_Analysis.ipynb) that contains the full code, analysis, and explanations.
-
Google Colab (Free and easy):
- Go to https://colab.research.google.com
- Upload the notebook file or open it directly from Google Drive.
- All code and markdown (text) cells will render properly.
-
VS Code:
- Open the folder containing the notebook in VS Code.
- Click on the
.ipynbfile to open it. - Make sure the notebook opens in the Jupyter Notebook Editor view (not plain text).
- If it opens as JSON or plain text, right-click the file, select “Reopen With” → “Jupyter Notebook Editor”.
-
Jupyter Notebook/Lab (Local environment):
- Install Jupyter via Anaconda or pip if not already installed.
- Run
jupyter notebookorjupyter labin your terminal. - Navigate to the notebook file in the browser and open it.
If you see raw JSON text when opening the notebook, it means it’s not opening in the proper notebook viewer. Use one of the above methods to view the formatted notebook with code and markdown rendered.
This analysis highlights the major traits of trending YouTube content. Creators aiming to increase visibility can focus on:
- Publish time
- Targeting high-interest categories (e.g., People & Blogs, Science & Technology)
- Optimizing video length for engagement(6 to 14 )
| File | Description |
|---|---|
| Youtube_Trending_data_analysis.ipynp | Jupyter Notebook with full code and analysis |
README.md |
Project overview |
Youtube_Trending_videos_data.csv |
Dataset used for analysis |
| cleaned_data.csv | cleaned dataset |
- Dataset from Kaggle-Trending videos on Youtube
- YouTube Trending data publicly available under fair usage
For queries, feedback, connect via \gundetisailikith@gmail.com.