A JS Web scraper that extracts species names from http://www.iucnredlist.org/search into a comma-separated text file.
The remote resource iucnredlist.org currently does not support Cross-Origin Resource Sharing (CORS) which you can read more about here and here so be sure to use a web browser that allows CORS.
For development purposes, the recommended browser to use is the Google Chrome web browser with --disable-security-flags
. See instructions below on how to disable Chrome security for windows.
Host this project folder and/or its contents on an http web server.
-
Enter a starting (From) and ending (To) page range indices on which to scrape data. Maximum page is currently set to 35.
-
Press the Scrape button to start fetching data. Press the Stop button anytime to stop.
-
Once the scraping process has finished stopping, press the Download button to download the species names list in a comma-separated text file.
-
Press the Reset button to restart.
Type the following in a windows commandline:
rem rem stands for remark/comments
rem Kill all chrome process
taskkill /F /IM chrome.exe
start chrome --disable-web-security --user-data-dir
# WARNING!
Once chrome has been re-started with disable flags, it will always run with disabled security unless you re-start it again in a normal manner. It is dangerous to continue using Chrome with disabled security, so be sure to reset it to its original settings.
Type the following in the windows command line:
rem rem stands for remark/comments
rem Kill all chrome process
taskkill /F /IM chrome.exe
Then you can start using chrome normally again.
Date created: 20180518
Date Modified: 20180521