You will need the following things properly installed on your computer.
git clone https://github.com/smorzhov/comment_classifier
Remember that Docker container has the Python version 2.7.12!
- Download and unrar (unzip) test and train data into
~src/data
directory. - Download pretrained word2vec model, glove_6B model, globe_840B model and FastText model. Unpack them into
~src/data/raw
directory. - If you are planning to use nvidia-docker, you need to build nvidia-docker image first. Otherwise, you can skip this step
Run container
nvidia-docker build -t sm_keras_tf:gpu .
nvidia-docker run -v $PWD/src:/comment_classifier -dt --name tcc sm_keras_tf:gpu /bin/bash
- Training
nvidia-docker exec tcc python train.py [-h]
You can add some custom stop words. They must be placed in ~src/data/stopwords.txt
file (one word per line).
You can create some files with useful information about training data
nvidia-docker exec tcc python info.py