For a description of the data set please see the corpus website.
If you want to reproduce the experiments, see the experiments folder README.
The software in this Git repository's master branch is available under the MIT license (see LICENSE.txt).
Note however that the data set itself, available from the corpus website, is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.