Extract lists and tables from wikipedia and add their information to DBpedia.
Make sure you have Java 1.8, scala and sbt installed.
- Clone repo
- Install jena CLI
- on OS X you can run
brew install jena - on other platforms you need to install them as described here
- Then run
scripts/loadDumps.sh, optionally you can update the preloaded typeCounts withscripts/typeCount.sh. - Download or create a wiki-markup xml dump. Downloads from special:export work just fine.
- Convert it to a json dump with
scripts/convert.sh. - Copy
src/main/resources/application.conf-defaulttosrc/main/resources/application.conf, there you need to change the input filename accordingly to your generated dump file and you can change the parameters of the algorithm. - To start the application run
sbt runand chooseGenerateTypesas main class.