How can I run the Orientation model solely to determine the page rotation angle? #1797

sanjay-nit · 2024-11-25T12:16:32Z

sanjay-nit
Nov 25, 2024

Hi Team,

FYI: I'm using docTR (0.10.0) with the following configuration to pass an image and retrieve the page orientation. I'm satisfied with the results from the ocr_predictor. However, to save time, I would like to avoid running the full OCR model. Is there a way to use only the orientation model?
config:

doctr_model = ocr_predictor(
    pretrained=True,
    assume_straight_pages=False,
    straighten_pages=True,
    detect_orientation=True,
    export_as_straight_boxes=True
)

Note: I tried page_orientation_predictor from doctr.models but looks like it only gives one of following angles. 0, 90, 180, 270 (-90) as mentioned in code.

Also, is there any guide available for training a custom orientation model? I checked this page, but it doesn’t clearly explain how to properly prepare the dataset for training, eg: how our images looks like? Thanks!!

Answered by felixdittrich92

Nov 25, 2024

Hi @sanjay-nit 👋,

That's right our orientation prediction depends on the detection model:

Estimate general page orientation -> [0, 90, 180, 270 (-90)]
Correct the input
Detect all text and get the segmentation map
Estimate the "fine grained" orientation

Here is a short snippet how you could use it without the recognition part:

import requests
import numpy as np

from doctr.io import DocumentFile
from doctr.models import detection_predictor, page_orientation_predictor
from doctr.models._utils import estimate_orientation


url = "https://www.francetvinfo.fr/pictures/uGwaNE-aJq7zHLhZJdzdCd9nyjE/1200x900/2021/03/16/phpCDwGn0.jpg"

det_predictor = detection_predictor(
    arch="fast_base",

View full answer

felixdittrich92 · 2024-11-25T12:50:50Z

felixdittrich92
Nov 25, 2024
Maintainer

Hi @sanjay-nit 👋,

That's right our orientation prediction depends on the detection model:

Estimate general page orientation -> [0, 90, 180, 270 (-90)]
Correct the input
Detect all text and get the segmentation map
Estimate the "fine grained" orientation

Here is a short snippet how you could use it without the recognition part:

import requests
import numpy as np

from doctr.io import DocumentFile
from doctr.models import detection_predictor, page_orientation_predictor
from doctr.models._utils import estimate_orientation


url = "https://www.francetvinfo.fr/pictures/uGwaNE-aJq7zHLhZJdzdCd9nyjE/1200x900/2021/03/16/phpCDwGn0.jpg"

det_predictor = detection_predictor(
    arch="fast_base",
    pretrained=True,
    assume_straight_pages=False,
)  #.cuda().half()  # Uncomment if running on GPU
page_orient_predictor = page_orientation_predictor(pretrained=True)  #.cuda().half()  # Uncomment if running on GPU

det_predictor.model.postprocessor.bin_thresh = 0.3
det_predictor.model.postprocessor.box_thresh = 0.65

docs = DocumentFile.from_images([requests.get(url).content])
loc_preds, out_maps = det_predictor(docs, return_maps=True)
seg_maps = [
    np.where(out_map > getattr(det_predictor.model.postprocessor, "bin_thresh"), 255, 0).astype(np.uint8)
    for out_map in out_maps
]
_, classes, probs = zip(page_orient_predictor(docs))
# Flatten to list of tuples with (value, confidence)
page_orientations = [
    (orientation, prob)
    for page_classes, page_probs in zip(classes, probs)
    for orientation, prob in zip(page_classes, page_probs)
]
origin_pages_orientations = [
    estimate_orientation(seq_map, general_orientation)
    for seq_map, general_orientation in zip(seg_maps, page_orientations)
]
orientations = [
    {"value": orientation, "confidence": prob} for orientation, prob in zip(origin_pages_orientations, probs[0])
]
print(orientations)

1 reply

sanjay-nit Nov 26, 2024
Author

Thanks @felixdittrich92
It works!!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I run the Orientation model solely to determine the page rotation angle? #1797

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

How can I run the Orientation model solely to determine the page rotation angle? #1797

sanjay-nit Nov 25, 2024

Replies: 1 comment · 1 reply

felixdittrich92 Nov 25, 2024 Maintainer

sanjay-nit Nov 26, 2024 Author

sanjay-nit
Nov 25, 2024

Replies: 1 comment 1 reply

felixdittrich92
Nov 25, 2024
Maintainer

sanjay-nit Nov 26, 2024
Author