ocrd_utils.coordinates_for_segment: clip to parent?

We have the utility function `coordinates_for_segment` for conversion of polygons from some derived image and its coordinate transform to absolute coordinates of the original image.

This is all the function does, but of course the __expectation__ is more: Whereever that polygon was in the derived image, give me coordinates which I can annotate as final result of segmentation! (And unfortunately, that is also what it is mostly used for.)

However, that would entail also doing the inverse of `image_from_polygon`, i.e. clipping (or rather masking) the polygon to its _parent element_. This usually means ensuring the child element's polygon stays within the parent element's polygon. Except for the top-level parent (`Page` when there is no `Border`), where this means clipping coordinates to a range between zero and image size.

Thus, not having that functionality in core means segmenters will tend to produce inconsistent coordinates (failing the PAGE validator), and sometimes (e.g. when the page was rotated) even negative coordinates (failing XML validators and other PAGE libraries).

Unfortunately, we already made the first argument the segment's __polygon__ instead of its __element__ (as in `coordinates_of_segment` – an ugly asymmetry which itself originated from PAGE's `Border` idiosyncrasy). So we cannot silently enrich `coordinates_for_segment` by looking into the segment's `parent_object_` and clipping the polygon respectively. 

Or can we? How about making it polymorphic and checking the type of the first argument, as an opt-in for implicit conversion consistent with parent? To me that seems much better than providing a new function and trying to find processors which need it. We could even transform the function to be exclusively for the new usage by starting to deprecate the old usage at a later point and hunting it down.

@kba @wrznr if you give me your thumbs up I will kick this off with a PR.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ocrd_utils.coordinates_for_segment: clip to parent? #489

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

ocrd_utils.coordinates_for_segment: clip to parent? #489

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions