Collection Broken links of an entire domain. #25

BAG-OF-CHIPS-XX · 2018-04-12T03:27:44Z

can someone please help me understand this module? when i type "pylinkvalidate.py -P http://www.example.com/" it will only scrape that one URL. Is this script capable of collecting broken links of an entire domain? I am not the best with python :/

kowalcj0 · 2018-04-23T17:04:24Z

yes @cdangerdouglas , it can scan the entire site.
Just have a look at the documentation and play with the parameter values.
Here's a config that works for me:

pylinkvalidate.py \
	    --progress \
	    --timeout=35 \
	    --depth=2 \
	    --workers=10 \
	    --types=a \
	    --strict \
	    --test-outside \
	    --parser=lxml \
	    --header="User-Agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; FSL 7.0.6.01001)" \
	    --ignore="comma,separated,list,of,domains,or,prefixes,that,should-,be,ignored,during,the,scan" \
	    http://www.example.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collection Broken links of an entire domain. #25

Collection Broken links of an entire domain. #25

BAG-OF-CHIPS-XX commented Apr 12, 2018

kowalcj0 commented Apr 23, 2018

Collection Broken links of an entire domain. #25

Collection Broken links of an entire domain. #25

Comments

BAG-OF-CHIPS-XX commented Apr 12, 2018

kowalcj0 commented Apr 23, 2018