Skip to content
Change the repository type filter

All

    Repositories list

    • OCR web-application
      TypeScript
      0460Updated Feb 22, 2023Feb 22, 2023
    • m2repo

      Public
      0000Updated Jun 23, 2022Jun 23, 2022
    • wikiclean

      Public
      A Java Wikipedia markup to plain text converter
      Java
      20000Updated Nov 30, 2021Nov 30, 2021
    • TypeScript
      Apache License 2.0
      0000Updated Jun 12, 2021Jun 12, 2021
    • Java
      1000Updated Jun 12, 2021Jun 12, 2021
    • C++
      3220Updated Jun 8, 2021Jun 8, 2021
    • CRNN

      Public
      Convolutional recurrent neural network for scene text recognition or OCR in Keras
      Python
      MIT License
      34000Updated May 21, 2021May 21, 2021
    • This Kannada OCR benchmarking dataset contains 250 images, carefully chosen to have various kinds of recognition challenges. Some of the pages have italics and bold characters. Some of them have Halegannada poems and text; others are letterpress-printed pages, where the vowel modifiers appear as separate symbols and do not touch the consonants t…
      Shell
      2300Updated May 6, 2021May 6, 2021
    • ScanLibrary is an android document scanning library built on top of OpenCV, using the app you will be able to select the exact edges and crop the document accordingly from the selected 4 edges and change the perspective transformation of the cropped image.
      C++
      MIT License
      472000Updated Mar 9, 2021Mar 9, 2021
    • Java
      0000Updated Feb 3, 2021Feb 3, 2021
    • A browser plugin to Google Chrome, which instantly transliterates a website present in any Indic script to Kannada. This plugin exploits the Unicode block parallelism and also uses a rule-based approach to transliterate web pages to Kannada. This enables a polyglot user to read online documents in other Indic scripts through Kannada script. Curr…
      JavaScript
      Apache License 2.0
      0000Updated Dec 17, 2020Dec 17, 2020
    • OCR dataset of scanned pages of Tulu books along with groundtruth text
      Apache License 2.0
      0200Updated Feb 4, 2019Feb 4, 2019
    • OCR dataset of scanned images of Sanskrit text printed using Kannada script along with groundtruth text
      Apache License 2.0
      0200Updated Feb 4, 2019Feb 4, 2019
    • OCR dataset of Konkani documents printed using Kannada script along with groundtruth text
      Apache License 2.0
      0100Updated Feb 4, 2019Feb 4, 2019
    • Benchmarking dataset of degraded word images (with character splits) in Kannada along with their associated ground truth Unicode text
      Shell
      Apache License 2.0
      0200Updated Dec 30, 2018Dec 30, 2018
    • Benchmarking dataset of merged symbols in Kannada along with their associated ground truth Unicode text
      Shell
      Apache License 2.0
      0200Updated Oct 5, 2018Oct 5, 2018