Container-ized GeoTopicParser-Enabled Apache Tika Server with Lucene Geo-Gazetteer.
Note that the Alpha Version is not working
I recommend this method -- building takes a bit.
Pull the latest stable version of Apache Tika, version 1.25:
docker pull fryto/gtp-tika:latest
or
docker pull fryto/gtp-tika:1.25
Pull the alpha version of Apache Tika, version 2.0.0:
docker pull fryto/gtp-tika:alpha
or
docker pull fryto/gtp-tika:2.0
Run with just the Tika Server accessible by host (using latest image as example):
docker run -d -p 127.0.0.1:9998:9998 fryto/gtp-tika:latest
Run with both the Tika Server and Lucene Geo-Gazetteer Server accessible by host (using latest image as example):
docker run -d -p 127.0.0.1:9998:9998 -p 127.0.0.1:8765:8765 fryto/gtp-tika:latest
Build the latest stable version (1.25) of the Apache Tika Server:
docker build latest/ --tag gtp-tika
Within the latest stable version build, there is an optional build argument "tika_version" that defaults to "1.25". This can be changed to specify the older versions of the Apache Tika Server.
docker build latest/ --build-arg "tika_version=1.22" --tag gtp-tika
Build the alpha version (2.0.0) of the Apache Tika Server:
docker build alpha/ --tag gtp-tika
Run with just the Tika Server accessible by host:
docker run -d -p 127.0.0.1:9998:9998 gtp-tika
Run with both the Tika Server and Lucene Geo-Gazetteer Server accessible by host:
docker run -d -p 127.0.0.1:9998:9998 -p 127.0.0.1:8765:8765 gtp-tika
Test that the GeoTopicParser is working as expected by parsing the provided "polar.geot" file (from the GeoTopicParser-Utils repo):
curl -T polar.geot -H "Content-Disposition: attachment; filename=polar.geot" http://localhost:9998/rmeta
Expected output:
[
{
"Content-Type":"application/geotopic",
"Geographic_LATITUDE":"39.76",
"Geographic_LONGITUDE":"-98.5",
"Geographic_NAME":"United States",
"Optional_LATITUDE1":"27.33931",
"Optional_LONGITUDE1":"-108.60288",
"Optional_NAME1":"China",
"X-Parsed-By":[
"org.apache.tika.parser.DefaultParser",
"org.apache.tika.parser.geo.topic.GeoParser"
],
"X-TIKA:parse_time_millis":"1634",
"resourceName":"polar.geot"
}
]