weiboSearchCrawler

A distributed Sina Weibo Search spider based on Scrapy, Redis and MongoDB. And for the crawled page, extract user info, forward info and pictures and so on.

##Reference scrapy-redis

weibosearch

weibo_login

Installation

$ sudo apt-get install mongodb
$ sudo apt-get install redis-server
$ sudo apt-get install pymongo
$ sudo pip install -r requirements.txt

Usage

put your keywords in items.txt(just for test for me). Also, you can read keywords from mysql.
scrapy crawl weibosearch -a username=your_weibo_account -a password=your_weibo_password
you can test the process of parsing locally, see weibosearch/spiders/tests.py for more
add another spider with scrapy crawl weibosearch -a username=another_weibo_account -a password=another_weibo_password

=======

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
weiboSearchCrawler		weiboSearchCrawler
.gitignore		.gitignore
README.md		README.md
items.txt		items.txt
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

weiboSearchCrawler

Installation

Usage

weiboSearchCrawler

About

Releases

Packages

Languages

ustcck/weiboSearchCrawler

Folders and files

Latest commit

History

Repository files navigation

weiboSearchCrawler

Installation

Usage

weiboSearchCrawler

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages