A doc searcher of the documents on the local host that is based on: Tika, ElasticSearch and Kibana
- Install Elasticsearch with Dockeredit
Testing:curl -X GET "localhost:9200/_cat/nodes?v&pretty"
Java REST Client - Installing Elasticsearch
See the gradle taskinstall
under elasticsearch-server
UI dashboard for Elastic Search
- Install Kibana with Dockeredit
- Insalling Kibana
See the gradle taskinstall
under kibana-server
See TikaOCR
See Recursive Metadata and Conten
This is a Spring Boot application the main tasks of that are:
- scanning every 1 minute the files in the special folder
- extracting a content of the files via Tika+OCR server API
- storing the metadata and the content of the files in the Elasticsearch
./gradlew clean build
./gradlew clean build; docker-compose up --force-recreate --build
- Prune unused Docker objects:
docker system prune -f
- To clear containers:
docker rm -f $(docker ps -a -q)
- To clear images:
docker rmi -f $(docker images -a -q)
- To clear volumes:
docker volume rm $(docker volume ls -q)
- To clear networks:
docker network rm $(docker network ls | tail -n+2 | awk '{if($2 !~ /bridge|none|host/){ print $1 }}')