A very simple Python package that scrapes the Vinted site to retrieve information about its items.
You can install Vinted Scraper using pip:
pip install vinted_scraper==2.4.0If you are on Python 3.6 you also have to install
dataclasses:pip install dataclasses
We move from requests to httpx to support Async API call. Now, you can await AsyncVintedScraper or AsyncVintedWrapper.
I haven't finish to update all the docs but you can check exampes to understand how they work.
To install the beta version with pip:
pip install vinted_scraper==3.0.0b1Compatible from python 3.7+
For more info about the Beta check the roadmap, and please if you find a bug open a issue!
The package offers the following functions:
search - (gets all the items present on the listing page)
Parameters
name type data type description params optional Dict Query parameters like the pagination and so on
item - (gets the information about an item, and its seller present on the item detail page)
It is currently only working only on the Beta version (see #78), but it frequently throws a 403 error (see #58).
Parameters
name type data type description id required str The unique identifier of the item to retrieve params optional Dict I don't know if they exist
curl - (Perform an HTTP GET request to the given endpoint)
It is currently only working only on the Beta version
Parameters
name type data type description endpoint required str The endpoint to make the request to params optional Dict Query parameters like the pagination and so on
To obtain the scraped data as a vinted_scraper.models.VintedItem, so you can:
import vinted_scraper.VintedScraper
def main():
scraper = VintedScraper("https://www.vinted.com") # init the scraper with the baseurl
params = {
"search_text": "board games"
# Add other query parameters like the pagination and so on
}
items = scraper.search(params) # get all the items
if __name__ == "__main__":
main()VintedScraper returns structured data that are parsed and converted into a vinted_scraper.models.VintedItem object.
If some attributes are None means that it wasn't found in the response, maybe because they are returned from other
API.
Also, I discard some attribute that I thought was useless but feel free to open an issue or a PR to add them.
If you want to manage the JSON response directly, you should use the VintedWrapper object instead of VintedScraper.
Here's the way of how to use it:
import vinted_scraper.VintedWrapper
def main():
wrapper = VintedWrapper("https://www.vinted.com") # init the scraper with the baseurl
params = {
"search_text": "board games"
# Add other query parameters like the pagination and so on
}
items = wrapper.search(params) # get all the items
if __name__ == "__main__":
main()This project is licensed under the MIT License - see the LICENSE file for details.