forked from merwin-asm/OpenCrawler
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathopencrawler.1
75 lines (53 loc) · 1.1 KB
/
opencrawler.1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
.TH OpenCrawler 1
.SH NAME OpenCrawler v 1.0.0
.SH DESCRIPTION
.B OpenCrawler v 1.0.0
Is an program for crawling through websites.
.TP
An open source crawler/spider. LICENSE - MIT.
.SH COMMANDS
.TP
.BR help
Get info about the commands
.TP
.BR v
Get the version of open crawler
.TP
.BR crawl
Starts up the normal crawler
.TP
.BR forced_crawl <website>
Forcefully crawl a website/Make the crawler crawl the website
.TP
.BR crawled_status
Shows the amount of data in DB , etc
.TP
.BR configure
Write / reWrite The config file
.TP
.BR connection-tree <website> <no of layers>
Makes a tree of websites connected to it, layers by default is 2
.TP
.BR check_html <website>
Checks if a website respond with html content
.TP
.BR crawlable <website>
Checks if a website is allowed to be crawled
.TP
.BR dissallowed <website>
Lists the websites not allowed to be crawled
.TP
.BR re-install
Re installs the Open Crawler
.TP
.BR update
Updates the open crawler
.TP
.BR install-requirements
Installs requirements for open crawler
.TP
.BR search <search>
Search from the crawled data
.TP
.BR fix_db
Tools to fix the DB