Skip to content

ONLYOFFICE-QA/onlyoffice_pdf_parser

Repository files navigation

Onlyoffice PDF Parser

It is gem for parsing pdf files.

Installation

This gem requires pdfinfo app, part of poppler-utils.
Also imagemagick required.

  1. Install system dependencies:

    • Debian-Based Linux:

      sudo apt-get install imagemagick \
                           libmagickwand-dev \
                           poppler-utils
    • Fedora-Based Linux:

      sudo dnf install ImageMagick \
                       ImageMagick-devel \
                       poppler-utils
  2. Install gem by command:

    gem install onlyoffice_pdf_parser

Example

require 'onlyoffice_pdf_parser'

OnlyofficePdfParser::PdfParser.parse('Text.pdf')