Skip to content

ahmedisam99/pdf-ocr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDF Optical Character Recognition

Dependencies (third-party)

Overview

An electron based windows applicatoin for the conversion of scanned pdf files and/or (weird) arabic pdf files (in which search feature doesn't work) into a searchable pdf files using google's tesseract optical character recognition tool.

Fails :(

  • GraphicsMagic, ImageMagick, and Tesseract-OCR are all pre-built with the app exept for Ghostscript which is a dependency of GraphicsMagic which i could not re-configure to consider the new path of the binaries of Ghostscript. So it shall be installed seperatly.

  • It bored me to read the docs of how to write an nsis script (.nsh/.nsi) in order to automatically update %PATH% environment variable. So it shall be updated manually (the path can be copied into clipboard from within the help submenu).

About

PDF Optical Character Recognition

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published