This repository contains the data and code for our paper Analyzing the Dynamics of Climate Change Discourse on Twitter: A New Annotated Corpus and Multi-Aspect Classification published at LREC-COLING 2024.
The discourse surrounding climate change on social media platforms has emerged as a significant avenue for understanding public sentiments, perspectives, and engagement with this critical global issue. The unavailability of publicly available datasets, coupled with ignoring the multi-aspect analysis of climate discourse on social media platforms, has underscored the necessity for further advancement in this area. To address this gap, in this paper, we present an extensive exploration of the intricate realm of climate change discourse on Twitter, leveraging a meticulously annotated ClimaConvo dataset comprising 15,309 tweets. Our annotations encompass a rich spectrum, including aspects like relevance, stance, hate speech, the direction of hate, and humor, offering a nuanced understanding of the discourse dynamics. We address the challenges inherent in dissecting online climate discussions and detail our comprehensive annotation methodology. In addition to annotations, we conduct benchmarking assessments across various algorithms for six tasks: relevance detection, stance detection, hate speech identification, direction and target, and humor analysis. This assessment enhances our grasp of sentiment fluctuations and linguistic subtleties within the discourse. Our analysis extends to exploratory data examination, unveiling tweet distribution patterns, stance prevalence, and hate speech trends. Employing sophisticated topic modeling techniques uncovers underlying thematic clusters, providing insights into the diverse narrative threads woven within the discourse. The findings present a valuable resource for researchers, policymakers, and communicators seeking to navigate the intricacies of climate change discussions. The dataset and resources to reproduce the experiments are available at https://github.com/shucoll/ClimaConvo.
Please refer to this link in the official Twitter API guide to get the tweet contents from the tweet IDs https://developer.twitter.com/en/docs/twitter-api/tweets/lookup/api-reference/get-tweets
Class | Terminology |
---|---|
Non-Relevant | 0 |
Relevant | 1 |
Class | Terminology |
---|---|
Support | 1 |
Denial | 2 |
Neutral | 3 |
Class | Terminology |
---|---|
No-Humor | 0 |
Humor | 1 |
Class | Terminology |
---|---|
No-Hate | 0 |
Hate | 1 |
Class | Terminology |
---|---|
Undirected | 0 |
Directed | 1 |
Class | Terminology |
---|---|
Individual | 1 |
Organization | 2 |
Community | 3 |
@inproceedings{shiwakoti2024analyzing,
title={Analyzing the dynamics of climate change discourse on twitter: A new annotated corpus and multi-aspect classification},
author={Shiwakoti, Shuvam and Thapa, Surendrabikram and Rauniyar, Kritesh and Shah, Akshyat and Bhandari, Aashish and Naseem, Usman},
booktitle={Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)},
pages={984--994},
year={2024}
}