OCR Form to JSON

This project uses a multimodal large language model to perform Optical Character Recognition (OCR) on images of forms and extracts the information into a structured JSON format.

Installation

Follow these steps to set up the project environment.

1. Clone the Repository

git clone https://github.com/maxchanhi/OCR-form-to-json.git
cd OCR-form-to-json

2. Install Dependencies

This project uses uv for package management.

First, install uv:

pip install uv

Then, create a virtual environment and install the project dependencies:

uv venv
uv pip sync pyproject.toml

3. Install Ollama and Pull the Model

You need to have Ollama installed to run the multimodal model.

Install Ollama: Follow the instructions on the Ollama website.
Pull the model: Once Ollama is installed and running, pull the qwen2.5vl model:
```
ollama pull qwen2.5vl
```

Usage

To run the OCR process, execute the fill.py script from the root of the project:

uv run fill.py

The script will:

Process all images in the img/ directory.
Use the qwen2.5vl model to extract information based on the template in json_template/.
Save the extracted JSON data into the result/ directory.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
img		img
json_template		json_template
orc.egg-info		orc.egg-info
result		result
.DS_Store		.DS_Store
README.md		README.md
fill.py		fill.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Form to JSON

Installation

1. Clone the Repository

2. Install Dependencies

3. Install Ollama and Pull the Model

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OCR Form to JSON

Installation

1. Clone the Repository

2. Install Dependencies

3. Install Ollama and Pull the Model

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages