kannada-ocr-test-images-with-ground-truth

This Kannada OCR benchmarking dataset contains 250 images, carefully chosen to have various kinds of recognition challenges. Some of the pages have italics and bold characters. Some of them have Halegannada poems and text; others are letterpress-printed pages, where the vowel modifiers appear as separate symbols and do not touch the consonants they go with. Some pages have interspersed English words; still others have tables with a lot of numeric data. In addition, there are old pages containing either a lot of broken characters or many words with two or more characters merged into a single connected component.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Results		Results
groundtruth		groundtruth
images		images
.gitignore		.gitignore
README.md		README.md
runTesseract.sh		runTesseract.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kannada-ocr-test-images-with-ground-truth

About

Releases

Packages

Contributors 2

Languages

MILE-IISc/Kannada-OCR-test-images-with-ground-truth

Folders and files

Latest commit

History

Repository files navigation

kannada-ocr-test-images-with-ground-truth

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages