This repository contains code and data for the workshop "Visual Disinformation and the Dark Side of Internet Memes" at the Applied Machine Learning Days EPFL 2022 (Workshop Link).
Click on the following badge to open the notebook in Google Colab (recommended):
Click on the following badge to open the notebook in Google Colab (recommended):
If you want to run the code locally, follow the instructions below to setup your environment.
Clone the repository and install dependencies.
- Clone the repository or download the notebook
- Install dependencies
pip install -r requirements_part1.txt
- set global variable
DATA_ROOT_PATH
to any directory (via notebook)
Clone the repository and install dependencies. Warning: In this version you will see the solutions to some exercises (cell-hiding is a Colab feature).
- Clone the repository. To get the data you need Git lfs while cloning the repository. Alternatively, you can download the data from this Link
(Optional) install git-lfs:
apt-get update
apt-get install git-lfs
Clone the repository:
git clone https://github.com/i4Ds/AMLD-2022-Visual-Disinformation.git
cd AMLD-2022-Visual-Disinformation
- Install the dependencies
pip install -r requirements_part2.txt
- Prepare the data (if not cloned via git-lfs)
Place the data into your preferred directory (default is ./data/) and unpack.
tar -xf ./data/GRU_202012.tar.gz --directory ./data/
- Open Notebook: In the notebook you can skip the data-fetching / unpacking steps.
Raphael Meier, Scientific Project Manager, armasuisse S+T
Marco Willi, Research Associate, FHNW
Michael Graber, Professor, FHNW
FHNW - University of Applied Sciences and Arts Northwestern Switzerland
Data for Part 1 are from:
Data for Part 2 are from:
- Twitter Transparency. Any usage is subject to Twitter's terms and conditions
Radford, Alec, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, et al. “Learning Transferable Visual Models From Natural Language Supervision.” ArXiv:2103.00020 [Cs], February 26, 2021. http://arxiv.org/abs/2103.00020.
Kiela, Douwe, Hamed Firooz, Aravind Mohan, Vedanuj Goswami, Amanpreet Singh, Pratik Ringshia, and Davide Testuggine. “The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes.” ArXiv:2005.04790 [Cs], April 7, 2021. http://arxiv.org/abs/2005.04790.
Suryawanshi, Shardul, Bharathi Raja Chakravarthi, Mihael Arcan, and Paul Buitelaar. “Multimodal Meme Dataset (MultiOFF) for Identifying Offensive Content in Image and Text.” In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, 32–41. Marseille, France: European Language Resources Association (ELRA), 2020. https://aclanthology.org/2020.trac-1.6.