Handwritten OCR dataset rendering

Generate and render annotated images with text on a virtual paper with folds and other types of damage using PIL and Blender 3D.

Installation

Clone the repository:

git clone --recursive https://github.com/GbotHQ/ocr-dataset-rendering.git

Install the required packages:

cd ocr-dataset-rendering
pip install -r requirements.txt

Download Blender:

cd src/Blender_3D_document_rendering_pipeline
bash download_blender_binary.sh
cd ../

Usage

example usage:

rm -r ../output
python main.py --n_samples 2 --blender_path "Blender_3D_document_rendering_pipeline/blender-3.4.0-linux-x64/blender" --output_dir ../output --device cpu --resolution_x 512 --resolution_y 512 --compression_level 9

python main.py --help

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Handwritten OCR dataset rendering

Installation

Usage

Files

README.md

Latest commit

History

README.md

File metadata and controls

Handwritten OCR dataset rendering

Installation

Usage