Skip to content

Commit

Permalink
Add OCR models Fraktur and Latin for Tesseract
Browse files Browse the repository at this point in the history
Both are not language specific, but support historic and current scripts
used by many European languages.

Signed-off-by: Stefan Weil <sw@weilnetz.de>
  • Loading branch information
stweil committed Aug 14, 2023
1 parent 5379cd1 commit 52742e9
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 0 deletions.
6 changes: 6 additions & 0 deletions public/langs.json
Original file line number Diff line number Diff line change
@@ -1,4 +1,10 @@
{
"Fraktur": {
"tesseract": "Fraktur"
},
"Latin": {
"tesseract": "Latin"
},
"af": {
"tesseract": "afr",
"google": "af"
Expand Down
2 changes: 2 additions & 0 deletions src/Engine/EngineBase.php
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,8 @@ abstract class EngineBase {

/** @var string[] Additional localized names for non-standard language codes. */
public const LANG_NAMES = [
'Fraktur' => 'Fraktur script',
'Latin' => 'Latin script',
'az-cyrl' => 'Azərbaycan (qədim yazı)',
'bali' => 'Balinese palm-leaf manuscripts 16th century',
'ben-print' => 'Bengali Printed Books +150 New',
Expand Down

0 comments on commit 52742e9

Please sign in to comment.