This repository houses a collection of more than 100 Python web scrapers. These scrapers are designed to run daily through GitHub Actions as part of a CI/CD pipeline. Their primary purpose is to ensure that the job listings on https://peviitor.ro/ remain current, providing users with accurate and relevant data.
- python as a programming language
- bs4(BeautifullSoup)
- requests
- job titles
- links
- locations
- job types
The accuracy of the scraped data can be verified on the dedicated verification website: https://scrapers.peviitor.ro/.
To utilize this project, follow these steps:
-
Download or clone this project to a local folder.
-
Install Python 3.x and the required libraries using the following command:
-
Run the
peviitor_scraper.py
file and follow the instructions in the command-line interface to input your search term and other settings. -
The search results will be saved in a JSON file, for example:
results.json
.
This project is open for contributions. If you wish to contribute, you can fork the project and submit a pull request with your changes.
This project is licensed under the MIT License - see the LICENSE file for details.