Heroku app using Tesseract OCR written in Python and based on the Flask web microframework. Only English and Finnish language are supported.
In addition to the default heroku-buildpack-python, this app also requires the custom heroku-buildpack-tesseract.
Heroku natively supports multiple buildpacks per app.
Before deploying the app, you should setup the base buildpack and add the custom buildpack to the app configuration:
heroku buildpacks:set heroku/python
heroku buildpacks:add https://github.com/matteotiziano/heroku-buildpack-tesseract
Let us assume you have deployed this app in Heroku and you called it your-heroku-app
.
The app provides a test client https://your-heroku-app.herokuapp.com/test
and a REST method https://your-heroku-app.herokuapp.com/process
: the client provides the input file (image or PDF), whereas the REST method returns a JSON object containing the OCR of the input.
This app is available at https://secret-harbor.herokuapp.com/test.
MIT Licence. Copyright (c) 2015 Matteo Maggioni