Skip to content

IndexPDF is a CLE script used to index unique words and their appearance in specific pages of a pdf document. Seeing that there was no similar python script that did not use PyPDF2 extensively, I chose to write this after discovering that Adobe forces its users to pay some gargantuant sum for this simple functionality.

License

Notifications You must be signed in to change notification settings

HypoChloremic/IndexPDF

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

IndexerRetro

Is a small script that will index pdf files, after having been forced to do this, when realizing that the price that Adobe demands for the same service is 300$ per year.

To run:

python indexerRetro.py --parent [Abs path to parent dir of pdf] --file [pdf name] --output [file name]

It will then produce a txt file which contains all of the unique words present in the pdf document, complemented by the page that the words appear.

  • Adapted to Windows
  • Adapted to everything else
2018 Ali Rassolie

About

IndexPDF is a CLE script used to index unique words and their appearance in specific pages of a pdf document. Seeing that there was no similar python script that did not use PyPDF2 extensively, I chose to write this after discovering that Adobe forces its users to pay some gargantuant sum for this simple functionality.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages