Contradictory, My Dear Watson

Contradictory, My Dear Watson is a Kaggle competition focused at a Natural Language Inference (NLI). The goal is to predict whether one sentence entails, contradicts or si unrelated with the other.

Prerequisites

Native GPU support has not landed in docker-compose yet. For now install patched versions of docker-py and docker-compose as mentioned here:

pip install --user git+https://github.com/docker/docker-py.git
pip install --user git+https://github.com/yoanisgil/compose.git@device-requests

Getting Started

Build docker image:

$ COMPOSE_API_VERSION=auto docker-compose up --build -d

Docker container is running after building. Next time, docker container can be started and stopped as follows:

# Start docker container
$ docker-compose start

# Stop docker container
$ docker-compose stop

Download data:

$ kaggle competitions download -c contradictory-my-dear-watson -p data/raw/contradictory-my-dear-watson
$ unzip data/raw/contradictory-my-dear-watson/contradictory-my-dear-watson.zip -d data/raw/contradictory-my-dear-watson

Training

To train a new model create run configuration a start training:

$ python -m contradictory_my_dear_watson train @runs/<run.conf>

Evaluation

To evaluate a trained model on test dataset execute:

$ python -m contradictory_my_dear_watson evaluate @run/<run.conf> --checkpoint-path models/<model.pt> --test-data-path data/<test.csv>

Results

Due to time constraints I used only baseline BiLSTM [1] and BERT multilingual [2] model. BiLSTM model was used only english data. I did not do any text preprocessing, pretraining, data augmentation or hyperparameter optimization.

Achieved accuracy using BiLSTM model is 51.53% and using BERT multilingual model is 67%. I know, not great, not terrible, but time ...

References

[1] Conneau, A., et al. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data. arXiv preprint arXiv:1705.02364 (2018).

[2] Devlin, J., et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (2019), Association for Computational Linguistics, pp. 4171–4186.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
contradictory_my_dear_watson		contradictory_my_dear_watson
notebooks		notebooks
runs		runs
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contradictory, My Dear Watson

Prerequisites

Getting Started

Training

Evaluation

Results

References

About

Releases

Packages

Languages

License

dmitana/contradictory-my-dear-watson

Folders and files

Latest commit

History

Repository files navigation

Contradictory, My Dear Watson

Prerequisites

Getting Started

Training

Evaluation

Results

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages