Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add options for txt, pdf, and hocr output #13

Open
ElectricRCAircraftGuy opened this issue Mar 15, 2021 · 0 comments
Open

Add options for txt, pdf, and hocr output #13

ElectricRCAircraftGuy opened this issue Mar 15, 2021 · 0 comments

Comments

@ElectricRCAircraftGuy
Copy link
Owner

ElectricRCAircraftGuy commented Mar 15, 2021

Sample run to output all 3:

tesseract Selection_1.png out txt pdf hocr

produces:

out.hocr
out.pdf
out.txt

Make my options to pdf2searchablepdf like this:

  • --txt - also output .txt
  • --pdf [default] - actually, don't have this option, just have --pdf ALWAYS ON instead, and add a --no-pdf option to turn it off!
  • --hocr - also output .hocr

So:

  1. txt ONLY would be --txt --no-pdf
  2. txt and pdf would be --txt
  3. hocr ONLY would be --hocr --no-pdf
  4. hocr AND txt but NOT pdf would be --hocr --txt --no-pdf
  5. all 3: hocr, txt, and pdf, would be --hocr --txt
ElectricRCAircraftGuy added a commit that referenced this issue Apr 6, 2021
I ran:

    tesseract tests/imgs/Selection_1.png tests/imgs/out txt pdf hocr

Relates to issue #13.

	new file:   tests/imgs/README.md
	new file:   tests/imgs/out.hocr
	new file:   tests/imgs/out.pdf
	new file:   tests/imgs/out.txt
ElectricRCAircraftGuy added a commit that referenced this issue Apr 6, 2021
I ran:

    tesseract tests/imgs/Selection_1.png tests/imgs/out txt pdf hocr

Relates to issue #13.

	new file:   tests/imgs/README.md
	new file:   tests/imgs/out.hocr
	new file:   tests/imgs/out.pdf
	new file:   tests/imgs/out.txt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant