Skip to content

This Python project allows you to extract and download images embedded in HTML content from a specified Excel file. The HTML content is extracted from the second column of the Excel sheet, and all images referenced in the <img> tags are downloaded and saved locally as PNG files.

Notifications You must be signed in to change notification settings

bhandeystruck/Shopify_Blog_Body_Image_Exporter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

Repository files navigation

This Python project allows you to extract and download images embedded in HTML content from a specified Excel file. The HTML content is extracted from the second column of the Excel sheet, and all images referenced in the <img> tags are downloaded and saved locally as PNG files.

Features
HTML Parsing: Extracts <img> tags from HTML content using BeautifulSoup.
Image Downloading: Downloads images from the URLs provided in the src attributes of the <img> tags.
File Naming: Saves images using their original filenames derived from the URL.
Error Handling: Catches and reports errors during image download or save operations.
Excel Integration: Reads an Excel file (.xlsx), extracts HTML content from the second column, and processes each HTML script.
Requirements
Python 3.x
Pandas: For reading Excel files.
BeautifulSoup4: For parsing HTML content.
Pillow (PIL): For saving images as PNG.
Requests: For downloading images from URLs.

About

This Python project allows you to extract and download images embedded in HTML content from a specified Excel file. The HTML content is extracted from the second column of the Excel sheet, and all images referenced in the <img> tags are downloaded and saved locally as PNG files.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages