This repository is part of my virtual internship in data analytics and visualization and I have explained how I carried out and completed the project.
As a data analyst, I have been tasked with analyzing the client's data to uncover insights, create visualizations, and present my findings. To accomplish this, I utilized Microsoft Excel for data cleaning and generating insights, Microsoft Power BI for data visualization, and Microsoft PowerPoint for creating presentation slides. Due to their rapid growth, the client lacks the internal resources to effectively manage their data.
click here for cleaned dataset
- The top 5 most popular categories, in descending order, were animals, science, healthy eating, technology, and food. Animals had an aggregate score of 74965. Healthy eating is more popular than food and science and technology also rank high, indicating a preference for factual and informative content.
- The chart illustrates the popularity split among the top 5 categories. The categories are close in popularity, but animals have a slightly larger share, 1.1% more, than the second-ranked category, science. This suggests that animals may continue to grow more popular than the other categories. To maintain a fair balance of content on the platform, it's important to use algorithms that distribute the categories evenly.
1. Joining the Data
Vlookup
2. Power query for data transformation
3. Aggregating scores with top 5 categories
SUMIF
4. Count of top scoring cateogry
COUNTROWS(FILTER (Reactions, Reactions[Category] = "Animals"))
5. Count of unique categories
distinct_categories = CALCULATE(DISTINCTCOUNT(Reactions[Category]))