Skip to content

peviitor-ro/JobsScrapers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Job Scrapers for Peviitor.ro

This project aims to streamline the process of gathering job details, including job title, city, country, and link, and subsequently integrating them into the peviitor.ro. This initiative leverages the power of Python and incorporates essential libraries like bs4 (Beautiful Soup) and requests for web scraping.

Objectives

This project encompasses the following primary objectives:

  • Enhanced Accessibility: Facilitate easier access to job listings for a wider audience by extracting data from various company websites and seamlessly integrating it into peviitor.ro.

  • Automated Data Refresh: Integrate scrapers into GitHub Actions for scheduled, daily execution, ensuring that the job listings remain current and reliable.

Features

The project incorporates the following key features:

  • BeautifulSoup4 (bs4): A Python library for web scraping purposes, allowing for efficient extraction of data from HTML and XML files.
  • Requests: A popular Python library for making HTTP requests, essential for fetching web page content.
  • GitHub Actions: Enables automated workflows directly from GitHub repositories, in this context, ensuring regular updates of job listings.

Setup

To get started with the project, follow these steps:

  • Install and Configure Python 3: Ensure you have Python 3 installed and properly configured on your system.
  • Set Up Your IDE: Prepare your preferred Integrated Development Environment (IDE) for working on this project.
  • Import Cloned Repository as Project: Clone the repository and import it into your IDE as a new project.
  • Add Folder Path to PYTHONPATH: Include the path of the project folder in the PYTHONPATH variable in your environment settings.
  • Install Required Packages: Use the following command to install all the necessary packages:
pip install -r requirements.txt