Analysis of public opinion on the World Cup, its controversies, and the use of the popularity of the event by other groups to advertise or publicise other issues.
Jan Fiszer - fiszer@student.agh.edu.pl
Konrad Pawlik - konradpawlik@student.agh.edu.pl
A detailed description of the steps, the methods used and the conclusions of the analyses carried out can be found in the report - FIFAWorldCup2022-Report
- Python 3
- numpy
- pandas
- tweepy
- nltk
- demoji
- matplotlib
- wordcloud
- networkx
- nrclex
- vaderSentiment
- collect_data.ipynb
- preprocessing.ipynb
- exploratory_data_analysis.ipynb
- social_network_analysis.ipynb
- hashtag_usage_through_time.ipynb
- social_content_analysis.ipynb
Disclaimer: Both social network and content analysis were conducted in parallel, so the associated notebooks should not be run one after the other, but at the same time.
- data
- tweets.csv
- users.csv
Collected data is split into two datasets - tweets.csv and users.csv, which, as the names suggest, respectively contain data on tweets and the users associated with them.
Due to limitations, data can be downloaded from https://drive.google.com/drive/folders/1ujaePKHniGevIWcc4Pv1kLCYXKdTRGZI?usp=share_link