Skip to content

colin4554/stocks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Stock News Scraper (and other related, useful functions)

A great way to collect data for investing, sentiment analysis, or other projects!

  • Scrape recent news articles for any ticker listed on finviz.com and store the title, full text, summary, keywords, and/or other data
  • Schedule your news scraping and use built in functions to only scrape when necessary and avoid duplicate scraping
  • Get the current list of S&P 500 tickers by a simple scraping function
  • Receive email updates each time the scraper runs with informative logs
  • Deploy the repository + database either locally or on Google Cloud Platform (for free!)

Running Scripts

  • Clone/download repository into desired directory
  • Run pip install -r requirements.txt
  • Set EMAIL_ADDRESS in email_update.py to address that should receive log updates
  • If you want to run this package completely in the cloud, set up a free GCP f1-micro instance

Deploying database locally

  • Create local postgresql table
  • Set LOCAL_DATABASE_ID in schedule_scraper.py to postgresql table id
  • Use local_stock_news_scraper.py

Deploying database on Google Cloud Platform (GCP) as BigQuery

  • Set up a google cloud platform account
  • Create a BigQuery table
  • Download credentials for BigQuery API
  • Set GCP_DATABASE_ID in schedule_scraper.py to BigQuery table id
  • Use gcp_stock_news_scraper.py

Disclaimer: This project was undertaken for learning and personal use. Please be respectful of websites when scraping.

About

Stock data scrapers and other useful scripts

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Languages