0. Docker Env setup

This repo contains codes of our paper

Better Language Model with Hypernym Class Prediction

He Bai, Tong Wang, Alessandro Sordoni, Peng Shi

ACL 2022

0. Docker Env setup

If you are not using Docker, please make sure your pytorch and cuda version is as same as our Dockerfile and also install the python packages listed in Dockerfile

1. Data Preparison

Run .get_data.sh to download and unzip wikitext-103 dataset automatically.

Download Arixv dataset manually following this link. arxiv_data.py is a script for data spliting.

2.Traning

We list all training commands in train.sh file.

For large model, 8*32GB GPU memorys are required or using 4*40GB with accumulation steps =2.

For base model and small model, 4*32GB gpus is enough.

3. Testing

The training command above could test the best valid model after training automatically. If you would test by yourself, comment the argument "--do_train" can skip training stage and do evaluation and test directly.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
arxiv_data.py		arxiv_data.py
get_data.sh		get_data.sh
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

0. Docker Env setup

1. Data Preparison

2.Traning

3. Testing

About

Releases

Packages

Languages

richardbaihe/robustLM

Folders and files

Latest commit

History

Repository files navigation

0. Docker Env setup

1. Data Preparison

2.Traning

3. Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages