This repository contains the corpus described in
- Rhetoric, Logic, and Dialectic: Advancing Theory-based Argument Quality Assessment in Natural Language Processing
by Anne Lauscher, Lily Ng, Courtney Napoles, and Joel Tetreault
Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020) - Creating a Domain-diverse Corpus for Theory-based Argument Quality Assessment
by Lily Ng, Anne Lauscher, Joel Tetreault, and Courtney Napoles
Proceedings of the 7th Workshop on Argument Mining (ArgMining 2020)
If you use this corpus in your research, please include the following citation:
@inproceedings{lauscher-etal-2020-rhetoric,
title = "Rhetoric, Logic, and Dialectic: Advancing Theory-based Argument Quality Assessment in Natural Language Processing",
author = "Lauscher, Anne and Ng, Lily and Napoles, Courtney and Tetreault, Joel",
booktitle = "Proceedings of the 28th International Conference on Computational Linguistics",
month = dec,
year = "2020",
address = "Barcelona, Spain (Online)",
publisher = "International Committee on Computational Linguistics",
url = "https://www.aclweb.org/anthology/2020.coling-main.402",
pages = "4563--4574",
}
The GAQCorpus contains argument quality annotations of arguments selected from four underlying sources:
- L6 - Yahoo! Answers Comprehesive Questions and Answers version 1.0
- Internet Argument Corpus v2
- Yelp Open Dataset
- Cornell ChangeMyView Data v1.0
These data are all available free of charge provided you request them from the original sources and agree to the respective license terms. Once you have gained access to the first three corpora listed above, please forward the confirmations to Courtney Napoles (courtney.napoles@grammarly.com), along with your affiliation and a short description of how you will be using the data, and we will provide access to the GAQCorpus. Please let us know if you have any questions.
Author contact information:
anne@informatik.uni-mannheim.de
courtney.napoles@grammarly.com
jtetreault@dataminr.com
lily.ng@grammarly.com