This repo contains some test for a Text Classification based on models trained with the Weka Explorer. The code however is does not work properly at the moment.
Uses Data by: LI, Xin; ROTH, Dan. Learning question classifiers. In: Proceedings of the 19th international conference on Computational linguistics-Volume 1. Association for Computational Linguistics, 2002. S. 1-7. Link: http://cogcomp.org/Data/QA/QC/
Few comments on how to setup this repo:
- Install Maven (https://maven.apache.org/)
- Install the snowball-stemmers dependency into local maven Repository:
2.1. Download newest snowball-stemmers weka package or install using the jar in the lib/ folder
2.2.
mvn install:install-file -Dfile=snowball-stemmers.jar -DgroupId=org.tartarus.snowball -DartifactId=snowball-stemmers -Dversion=1.0.2 -Dpackaging=jar
- Build the workspace dependencies
- Run de.qaass.classifier.TextClassTest