-
Notifications
You must be signed in to change notification settings - Fork 0
This Python project allows you to extract and download images embedded in HTML content from a specified Excel file. The HTML content is extracted from the second column of the Excel sheet, and all images referenced in the <img> tags are downloaded and saved locally as PNG files.
bhandeystruck/Shopify_Blog_Body_Image_Exporter
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
This Python project allows you to extract and download images embedded in HTML content from a specified Excel file. The HTML content is extracted from the second column of the Excel sheet, and all images referenced in the <img> tags are downloaded and saved locally as PNG files. Features HTML Parsing: Extracts <img> tags from HTML content using BeautifulSoup. Image Downloading: Downloads images from the URLs provided in the src attributes of the <img> tags. File Naming: Saves images using their original filenames derived from the URL. Error Handling: Catches and reports errors during image download or save operations. Excel Integration: Reads an Excel file (.xlsx), extracts HTML content from the second column, and processes each HTML script. Requirements Python 3.x Pandas: For reading Excel files. BeautifulSoup4: For parsing HTML content. Pillow (PIL): For saving images as PNG. Requests: For downloading images from URLs.
About
This Python project allows you to extract and download images embedded in HTML content from a specified Excel file. The HTML content is extracted from the second column of the Excel sheet, and all images referenced in the <img> tags are downloaded and saved locally as PNG files.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published