Skip to content

A simple task with Scrapy Python library to parse hh.ru

Notifications You must be signed in to change notification settings

ferokuk/Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Crawler

A simple task with Scrapy Python library to parse hh.ru resumes.

If you want to run script:

  1. clone repo

  2. run "pip install -r requirements.txt"

  3. run "scrapy crawl cooks_spider -O c.json" to get all visible cook resumes

  4. run "scrapy crawl machinist_spider -O m.json" to get all visible machinist resumes

  5. run "python json_fix.py" to get 2 result files with all vacancies

About

A simple task with Scrapy Python library to parse hh.ru

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages