WCTT (Web Crawling system based on Tag path and Text appearnce frequency)
- Download and install Node.js, Python, JDK in this LINK.
- Download ZIP package and unzip.
- Execute Script file.Cancel changes
-
If your OS is Windows, execute install.bat.
-
If your OS is Linux, execute install.sh.
- Start Execution file.
-
If you OS is Windows, execute start.bat.
-
If you OS is Linux, execute start.sh.
-
Then, you can see the browser that you can enroll and control you work.
Node.js Forever Process Kill
netstat -ao | findstr 3001
taskkill /F /PID [PROCESS ID]
ZOOKEEPER
zookeeper.properties
dataDir=%USERPROFILE%\tmp\zookeeper
Kafka
server.properties
log.dirs=%USERPROFILE%\tmp\kafka-logs