Skip to content

Latest commit

 

History

History
42 lines (30 loc) · 1.61 KB

README.md

File metadata and controls

42 lines (30 loc) · 1.61 KB

#MugLife

Which beverage reigns superior? An analysis of the great Tea vs Coffee debate ☕️

Project Overview

This project uses Natural Language Processing techniques to determine once and for all which beverage the world prefers.

The analysis is conducted on the top posts and comments from Reddit.

Set up

  1. Install Python and Conda
  2. Install all the required libraries by pip install -r requirements.txt
  3. Rename .env_template to .env and fill out the .env file params

Reddit data collection

  1. Register an app
  2. Go over PRAW

Google Reviews data collection

  1. Create a Google account
  2. Follow the instruction on the link

YouTube data collection

  1. Create Google account
  2. Follow the instructions on the link

Run

Reddit

  • Run mug_life.ipynb for data collection and analysis of reddit data using terminal or editor

Google review

  • Run google_reviews_data_collection.ipynb for data collection
  • Run cup_of_town.ipynb for data exploration of Google review data using terminal or editor

YouTube

  • Run data_collection_yt.ipynb for youtube data collection
  • Run data_analysis_yt.ipynb for data exploration of YouTube data using the terminal or editor

Contributors

🍃 Milindi Kodikara    ✨ Syeda Shabnam Khan    🎈 Mahawattage Perera

© 2024 Copyright for this project by its contributors.