Skip to content

Commit 71b67c1

Browse files
committed
Merge remote-tracking branch 'origin/master'
Merge branch
2 parents 7659981 + 3af6468 commit 71b67c1

File tree

2 files changed

+5
-3
lines changed

2 files changed

+5
-3
lines changed

README.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,14 @@
11
# Project-01-TianyanSpider
22
use the selenium and phantomjs tools implements crawl tianyan companies datas and store into mongodb.
3-
##### Hisory Version
3+
#### Hisory Version
44

55
+ 2018-08-07
66

77
实现:公司基本信息包括,网址、注册时间、注册地点、法定代表人、经营范围、公司规模、公司专利、软件著作权、招标等信息
88

99
缺点:爬虫速度过快、爬取数据量过大易引起天眼查的反爬策略,常见的为验证码问题以及身份验证问题
1010

11-
样例数据: ![2018-08-07-data-example]()
12-
11+
样例数据:
12+
13+
![2018-08-07-data-example1](./images/2018-08-08_092058.png)
14+

images/2018-08-08_092622.png

-125 KB
Binary file not shown.

0 commit comments

Comments
 (0)