If you've always wanted to dive into the riveting world of insider trading data, but were too lazy to manually sift through thousands of pages, then buckle up because you're in for a treat. This is your magic carpet to travel through time, from 2013 to the present, and gather juicy tidbits of insider trading data from the future, well not really, only up to the current month (if it could gather data from the future, I'd probably be on my yacht in the Caribbean by now).
The code in this repository makes use of the requests and BeautifulSoup libraries in Python to scrape data from openinsider. The results are neatly tucked away in a CSV file. I also added threading so it's as fast as a leopard.
The script also comes with a built-in logger that logs events into a file because why not.
Simply build the image and run:
docker buildx build -t openinsider ./
mkdir date
docker run \
-v "${PWD}/data":data \
-e OUTPUT_DIR="data" \
-it openinsider
You can also build the daily image and tell it when to start scraping:
docker buildx build -t openinsider-daily -f Dockerfile.daily
mkdir data
docker run \
-v "${PWD}/data":data \
-e OUTPUT_DIR="data" \
-e START_DATE="2024-03-01" \
-it openinsider-daily
Running the script is as easy as a walk in the park... on a sunny day... with your favorite ice cream in your hand. Clone the repository, make sure you have the required libraries installed:
pip install --upgrade pip
pip install requests BeautifulSoup4 logging datetime
and then just run the python script. Grab a cup of coffee, and watch it do the work.
While this tool is quite powerful, it comes with no guarantee of making you rich. It will just make you data rich, which isn't necessarily the same thing. Also, this tool does not promote insider trading. It's called insider trading data scraper, not insider trading-data scraper.
Enjoy the script and may the odds be ever in your favor!