Skip to content

HTML HTTP GET requests for dynamic/client-sided web-scrapping purposes other than traditional static caching protocols.

License

Notifications You must be signed in to change notification settings

EstebanMqz/Web-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Formatted / indexed web-scrapper


Web-scrapper tool for metadata extraction purposes using HTTP GET requests.
For complete attribute structural inspections inherent in code's granularity.



Technique different than those provided by web-development tools:


Web-development View-source Save as: Complete HTML, Single HTML, HTML only traditional methods generally provide unreliable or incomplete information from websites, particularly if they are using dynamic and client-sided scripts.

Usage:


.sh

Terminal

$ ./html-extractor.sh Enter a URL: https://estebanmqz.github.io/EstebanMqz/html/Resume.html Do you want to extract the raw code to a temporary file? (Y/N): Y Enter a filename to save the raw code: Resume Raw code extracted to Resume.html opening..

See also:

 

Note: Usage should be made in compliance with users open-source licenses & privacy rights
& according to international/local laws such as GDPR.

About

HTML HTTP GET requests for dynamic/client-sided web-scrapping purposes other than traditional static caching protocols.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages