A powerful scrapy
spider make all-out effort to collect information from PTT into database.
- Python 3
- MongoDB
-
Make sure your
mongod
is running in system. -
Run with
scrapy
command:scrapy crawl ptt
-
(Option) You can export these data into
json
, e.g.:mongoexport --db ptt --collection mobilecomm --out mobilecomm.json