Skip to content

GbotHQ/ocr-dataset-rendering

Repository files navigation

Handwritten OCR dataset rendering

Generate and render annotated images with text on a virtual paper with folds and other types of damage using PIL and Blender 3D.

Installation

  1. Clone the repository:
git clone --recursive https://github.com/GbotHQ/ocr-dataset-rendering.git
  1. Install the required packages:
cd ocr-dataset-rendering
pip install -r requirements.txt
  1. Download Blender:
cd src/Blender_3D_document_rendering_pipeline
bash download_blender_binary.sh
cd ../

Usage

example usage:

rm -r ../output
python main.py --n_samples 2 --blender_path "Blender_3D_document_rendering_pipeline/blender-3.4.0-linux-x64/blender" --output_dir ../output --device cpu --resolution_x 512 --resolution_y 512 --compression_level 9
python main.py --help

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages