version:
0.0.1
Description of simple-web-scraper
To install, run:
syncano-cli add simple-web-scraper
Scrapes webpage and returns xpath or css selectors in xml or json format.
name | type | description | example | long_description |
---|---|---|---|---|
url | string | Webpage address to scrape from | https://funnycatsgallery.com/' | |
selectorType | string | Indicates the type of selector to use in scraping | xpath | Contents can be scraped using either xpath or css selector. |
extract | string | Scraped Data readable format to return. | json | Data can be returned either in json or xml format. |
config | object | Parameter for selecting the content you want to scrape. | { "config": { "title": "//*[@id='content_box']/article/header/h2/a/text()" } } |
mimetype: application/json
{
"message": "Webpage Scraped.",
"statusCode": 200,
"data": { title:[]}
}
{
"message": "Make sure to use GET request method for scraping webpage",
"statusCode": 400,
"data": {}
}