🌎 🖥 Supercharge your scraper to extract quality page metadata by parsing JSON-LD data via Python's extruct library.
-
Updated
Dec 4, 2024 - Python
🌎 🖥 Supercharge your scraper to extract quality page metadata by parsing JSON-LD data via Python's extruct library.
This a site crawler built with scrapy and stores data generated in mongodb using scrapy
Add a description, image, and links to the extruct topic page so that developers can more easily learn about it.
To associate your repository with the extruct topic, visit your repo's landing page and select "manage topics."