Ruby Capstone Project of Microverse, which students have to complete a real-world-like project within 72 hours according to this project specifications
I challenged myself to do a 3-in-1 Web Scraper, which allows users to parse all courses from udacity.com and jobs from indeed.com and remote.io into CSV file.
├── README.md
├── bin
│ └── main.rb
└── lib
└── scraper.rb
└── udacity_scraper.rb
└── indeed_scraper.rb
└── remoteio_scraper.rb
└── rspec
└── scraper_spec.rb
└── indeed_spec.rb
└── remoteio_spec.rb
└── udacity_spec.rb
└── spec_helper.rb
Feel free to check out this link for a 3min video walkthrough :)
- Git clone this repo and cd the to the
web_scraper
directory. - Run
bundle install
in command line to install Nokogiri and HTTParty Gem. - Run
bin/main.rb
. - Input either 'udacity', 'indeed', or 'remote.io' and follows the respective commands.
- Tada! 'udacity_courses.csv', 'indeed_jobs.csv', or 'remote_io.csv' would be created at the root directory respectively :)
- Git clone this repo and cd the to the
web_scraper
directory. - Install rspec with
gem install rspec
. - Run
rspec
in Command Line. - You would see failures because all 3 scraped files haven't been created yet.
- To solve it, run
ruby bin/main.rb
and input 'udacity', 'indeed', and 'remote.io' for every execution. - Run
rspec
in CLI again. The test cases would success upon each file created :)
👤 Eri
- Github: @errea
- Twitter: @Erreakay
- Linkedin: Eri Okereafor
Contributions, issues and feature requests are welcome!
Feel free to check the issues page.
Give a ⭐️ if you like this project!
- Microverse
- Nokogiri gem
- HTTParty Parser
- Udacity.com
- Indeed.com
- Remote.io
This project is MIT licensed.