Homework for NLP course at University of Maryland
The homeworks use the dataset of questions. You can use the Makefile to download the files:
make data/qanta.train.json
The above data can also be used for the project since the files contain the quizbowl dataset questions.
You can also download the supporting paragraphs or sentences corresponding to every sentence of a quizbowl question. This is particularly helpful for developing reading comprehension style systems on quizbowl.
For the page, paragraph index, sentence index, and correct answer span information:
make data/qanta.train.evidence.json
For the page, paragraph index, sentence index, and sentence text information:
make data/qanta.train.evidence.text.json
I assume that you have pytorch, sklearn, and nltk installed. It's easy to do this with Anaconda or pip.