Skip to content

Latest commit

 

History

History
37 lines (27 loc) · 802 Bytes

README.md

File metadata and controls

37 lines (27 loc) · 802 Bytes

Handwritten OCR dataset rendering

Generate and render annotated images with text on a virtual paper with folds and other types of damage using PIL and Blender 3D.

Installation

  1. Clone the repository:
git clone --recursive https://github.com/GbotHQ/ocr-dataset-rendering.git
  1. Install the required packages:
cd ocr-dataset-rendering
pip install -r requirements.txt
  1. Download Blender:
cd src/Blender_3D_document_rendering_pipeline
bash download_blender_binary.sh
cd ../

Usage

example usage:

rm -r ../output
python main.py --n_samples 2 --blender_path "Blender_3D_document_rendering_pipeline/blender-3.4.0-linux-x64/blender" --output_dir ../output --device cpu --resolution_x 512 --resolution_y 512 --compression_level 9
python main.py --help