This is a Scrapy Crawler that is used to extract relevant information from Job posting websites
Access Job Crawler WIKI from this link
Install QualiChain Job Crawler using docker by executing the following bash commands
cd config/
docker-compose up -d --build
Then trigger the spider that extract data from kariera.gr
by executing the docker command above
docker exec -it job_crawler scrapy crawl kariera -a job_positions="software engineer,developer"
Extract data from gr.indeed.com
using this command
docker exec -it job_crawler scrapy crawl indeed -a job_positions="software engineer,developer"
Extract data from skywalker.gr
using this command
docker exec -it job_crawler scrapy crawl skywalker -a job_positions="software engineer,developer"
Execute the following command
scrapy crawl kariera -a job_positions="software engineer,developer"
Export data in csv using command line
scrapy crawl kariera -o jobs.csv -t csv -a job_positions="software engineer,developer"