Introduction
Features
Tech Stack
Run The Project Locally
Extract Data Only
Demo
This is an API that scrapes the newest unanswered question on Stack Overflow by tags. The API should work with any available tags on Stack Overflow.
With this API, user could check out the unanswered questions on Stack Overflow quickly without any hassle. There is a link for each question that leads straight to Stack Overflow.
The motivation behind this project is to practice Python, scraping the web and creating data pipeline skills.
- Can check the unanswered questions by tag
- Can extract data to .csv, .py and .js formats
- Python
- Jupyter Notebook
- Requests_html
- FastAPI
- Pandas
In the terminal execute:
Windows
./start.ps1
MacOS
./start.sh
Check the API out at
Check the app out at http://localhost:8000
The data will be extracted into different formats after the command and can be found in the "data" folder
Execute the below line in the terminal:
Windows
./extract.ps1
MacOS
./extract.sh
https://stackoverflow-api-py.herokuapp.com/