Hold the Vision, Trust the Process.
... a technique used for extracting data from web/websites.
- python
- selenium
- PhantomJS
- beautifulsoap
- requests
- pandas
- tabulate
- Spyder IDE
- Ubuntu 16.4 LTS
- Setup your local environment: Cookbook
❗ I run on Mac OS/Ubuntu so you might have to slightly modify the code to make it work in your env.
-
Go through this for quick insights: Handbook
-
Get hands on: Kick-off
-
Examples:
4.1 Glassdoor_jobs
4.2 Pablo_quotes
This repository explains the rationale for web scraping in python. I have implemented few basic examples using selenium, have a dekko at it! This repo covers approximately 1% of the entire python web scraping. My motive is to get you familiar with the tools that python provides if you forsee your career as a Data Engineer. If you have any suggestions for more commands that should be on this page, let me know or consider submitting a pull request so others can benefit from your work. Thank you very much for reaching out! Please follow if you find it handy and hit ⭐ to get more kick-off repo updates.
📧 Drop In!! Seriously, it'd be great to discuss Technology.
Take risks in your life, If you win, you can lead! If you loose, you can guide! - Swami Vivekananda