Skip to content

carterprince/pdf2txt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdf2txt

A command-line utility to extract text from PDF files using the Poppler library. It is significantly faster than pdftotext, for which it serves as a minimal drop-in replacement (albeit with fewer options).

Dependencies

  • GCC
  • Poppler as well as its glib bindings (poppler-glib)

Installation

To install the program, run:

sudo make install

This will install the executable to /usr/local/bin (or the path specified in the Makefile).

Usage

To extract text from a PDF file, run:

pdf2txt [--stdout] <path_to_pdf>

By default, the program will create a text file with the same name as the PDF file in the current directory. If you pass the --stdout option, the extracted text will be printed to the standard output instead.

License

This project is released under the GNU General Public License v3.0.

About

A C rewrite of the python program pdftotex

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published