GitHub - liv-yaa/googleImageScraper-x: A beautifulsoup powered image scraper from a Google search query

googleImageScraper

Description

A tool that scrapes images from Google Images for the label specified and the number of images to be scraped and dowloaded specified. It then stores them in a local folder with the name of the search query.

Arguments:

Label of the type of images to be scraped (example power lines, dogs, cats, etc.)
Number of images to be scraped and downloaded onto the local machine

Setup

Install requirements:

pip3 install -r requirements.txt
Run on command line with 2 additional arguments, 'query' and 'n' ex:

python3 image_scraper.py pug 4

Stack, libraries

Python
beautifulsoup
requests
os
sys

How it works

Using sys, command line args are parsed, if valid
A search term is generated for google using string formatting with the query name
Using Requests, the page is downloaded via a GET request, with a timer configured to halt the process if it takes too long
Using os, a new directory with the same name as the query is made in the current directory (if it does not already exist)
Using BeautifulSoup, a soup object is created (a list of all html-derived tags) Parsing the BeautifulSoup object allows us to derive just the 'img' tags and any metadata such as alt text
Using os, each image is saved in the named directory, along with its corresponding metadata, in a set of hashmaps including 'alt', 'src', 'size', 'id', 'height', and 'width'

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.gitignore		.gitignore
README.md		README.md
config.py		config.py
image_scraper.py		image_scraper.py
requirements.txt		requirements.txt
tests.py		tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

googleImageScraper

Description

Setup

Stack, libraries

How it works

About

Releases

Packages

Languages

liv-yaa/googleImageScraper-x

Folders and files

Latest commit

History

Repository files navigation

googleImageScraper

Description

Setup

Stack, libraries

How it works

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages