Skip to content

A project for extracting information from WhatsApp data download packages using an online platform(PORT)

License

Notifications You must be signed in to change notification settings

sodascience/port-whatsapp-datadonation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

WhatsApp Data Extractor

WhatsApp datadonation script is developted by SoDa team in collaboration with the Human Data Science group and Eyra. This project is part of the Data Donation initiative, implemented using Port, a frontend for data donation. WhatsApp data extractor aims to provide researchers with access to information extracted from WhatsApp Data Download packages (DDPs) for further analysis, all while ensuring the privacy of the data donors. This study uses a deprecated version of Port, but major parts of the extraction script can be reused. Here you can find a list of other data donation studies that were completed using Port.

WhatsApp data donation process

Participants can voluntary donate their whatsapp data for research purposes.

  1. Make a request for your WhatsApp DDPs through the whatsapp application (on group/account level)
  2. Download the data package on your local storage.
  3. Login to Port environment and select your data package. Please keep in mind that data extraction and anonymization processes run locally on your browser. It means you have not shared any data yet.
  4. Review the extracted data and donate it in case of agreement.

Data Extraction overview

This project consists of two main scripts for extracting information from WhatsApp group chats and WhatsApp account data. The extraction logic is placed in the process function within data_extractor/[whatsapp_chat or whatsapp_account_info]/init.py which follows the template format of PORT. More information about collaboration with PORT can be found here.

The script runs through an online platform called port:

  • Group level

output

output

output

  • Account level

output

Usage

  • Install Poetry:
pip install poetry
  • install the required python packages :
poetry install

The behavior of the process function can be verified by running the tests. The test are located in the tests folder. To run the tests

poetry run pytest

Built with

License

The code in this project is released under MIT license.

Contact

WhatsApp Data Extractor is project by Human Data Science group. The technical implementation is provided by the ODISSEI Social Data Science (SoDa) team.

Do you have questions, suggestions, or remarks on the technical implementation? File an issue in the issue tracker or feel free to contact Parisa Zahedi or Shiva Nadi

SoDa logo

Project Link: https://github.com/sodascience/port-whatsapp-datadonation

About

A project for extracting information from WhatsApp data download packages using an online platform(PORT)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages