add OCR Decoding support - WIP #39

N950 · 2024-06-10T14:28:48Z

Adding Nodes/Loss/Metric for OCR recognition/decoding
Conformity to contribution guidelines will be fixed before closing

* Added ResNet variants * ResNet50 example * Simplified example * fixed resnet config * [Automated] Updated coverage badge --------- Co-authored-by: Martin Kozlovsky <martin.kozlovsky@luxonis.com> Co-authored-by: GitHub Actions <actions@github.com>

* fixed incorrect class property call * fixed exporter uploading * uploadCheckpoint uploads on every checkpoint epoch * fix temp files names * updated callback readme * pre-commit run

* option to source custom code in CLI * removed empty dicts * [Automated] Updated coverage badge --------- Co-authored-by: GitHub Actions <actions@github.com>

* option to source custom code in CLI * removed empty dicts * fixed issue with removed tensor metadata in match case statements

* forbid extra fields in config * fixed configs

* automatic inference of attach index based on type signature * added inference for input and x names

* fixed link in docs * fixed repvgg backbone * fixed efficientnet

* upload logs to mlflow * added mlflwo instance * multithread log upload * fixed upload logs * fixed log file path * removed exceptions * logging exceptions * fixed typo * reverted exception * moved line * replaced warning with error log * Update trainer.py

* add archiver CLI * add archiver callback * add max_det parameter to EfficientBBoxHead * add enum to categorize tasks for the implemented heads * add archiver tests * adjust Archiver to new nn archive format * pre-comit formatting * add LDF creation and adjust to new nn archive format * update requirements.txt * add opencv-python to requirements.txt * add support for ImplicitKeypointBBoxHead * remove support for ObjectDetectionSSD * Update requirements.txt * Added mlflow and removed opencv * [Automated] Updated coverage badge * add support for SegmentationHead and BiSeNetHead * base archiver tests on model from luxonis-train instead of torchvision * adjust head parameters to changes in NN Archive * adjust keypoint detection head parameters to changes in NN Archive * bugfix - make sure self.max_det is used in nms * add max_det parameter to ImplicitKeypointBBoxHead * adjust task categorization for ImplicitKeypointBBoxHead * fixing Windows PermissionError occuring on file deletion * fixing Windows PermissionError occuring on file deletion due to unreleased logging handlers * add method to remove file handlers keeping the log file open * add a logging statement at the end of archiving * add optuna_integration to requirements.txt * add hard-coded solution to determining is_softmax parameter * added help --------- Co-authored-by: Martin Kozlovský <martin.kozlovsky@luxonis.com> Co-authored-by: GitHub Actions <actions@github.com>

* extend NN Archive generation test coverage to cover all implemented heads * [Automated] Updated coverage badge --------- Co-authored-by: GitHub Actions <actions@github.com>

* uploading all checkpoints * fix names * removed comment

* handling SIGTERM signal * resume argument takes path

* handling SIGTERM signal * resume argument takes path * basic task group labels support * updated requirements * fixed tests * fixed loader test * Update luxonis_train/models/luxonis_model.py Co-authored-by: conorsim <60359299+conorsim@users.noreply.github.com> --------- Co-authored-by: conorsim <60359299+conorsim@users.noreply.github.com>

* option to set torch matmul precision for tensor cores * updated readme

* fixed reset not being called * added metric resets * removed inheritance * proper oks reset * removed unnecessary resets * added annotations

* Added seed to config for reproducibility * fixed seg drawing when using torch deterministic backend * added deterministic order of creating nodes * removed seed from example config * added reproducability to inspect * formatting --------- Co-authored-by: DrejcPesjak <dp8949@student.uni-lj.si> Co-authored-by: Martin Kozlovský <martin.kozlovsky@luxonis.com>

* support for custom loaders and datasets * updated configs * custom loaders in inspect command * updated inspect for multi-task labels * removed custom loader from test config * deleted comment * deleted comment * removed custom dataset * removed comment * skipping archiver test untill fixed in luxonis-ml * [Automated] Updated coverage badge --------- Co-authored-by: GitHub Actions <actions@github.com>

Co-authored-by: GitHub Actions <actions@github.com>

github-actions · 2024-06-10T14:46:44Z

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines	Covered	Coverage	Threshold	Status
4978	3753	75%	0%	🟢

New Files

File	Coverage	Status
luxonis_train/attached_modules/losses/ocr_decoding_loss.py	29%	🟢
luxonis_train/attached_modules/metrics/ocr_accuracy.py	27%	🟢
luxonis_train/nodes/ocr_decoder.py	32%	🟢
TOTAL	29%	🟢

Modified Files

File	Coverage	Status
luxonis_train/attached_modules/losses/init.py	100%	🟢
luxonis_train/attached_modules/metrics/init.py	100%	🟢
luxonis_train/nodes/init.py	100%	🟢
TOTAL	100%	🟢

updated for commit: da0106e by action🐍

kozlov721 · 2024-09-10T16:42:05Z

luxonis_train/attached_modules/losses/ocr_decoding_loss.py

+from .base_loss import BaseLoss
+
+
+class FocalCTC(BaseLoss[Tensor, Tensor]):


Add class attribute supported_labels = [LabelType.TEXT] to all the losses and metrics

Also the metrics and losses will probably require custom prepare method that takes the packed output of the head and labels and prepares argument to be passed to forward or update.

kozlov721 · 2024-09-10T16:44:54Z

luxonis_train/nodes/ocr_decoder.py

+from luxonis_train.utils.types import LabelType
+
+
+class ResBlock(nn.Module):


I believe both ResBlock nad DownSample should already be implemented in luxonis_train.nodes.blocks or in one of the backbones

kozlov721 · 2024-09-10T16:46:38Z

luxonis_train/nodes/ocr_decoder.py

+        return out
+
+
+class OCRDecoderBackbone(BaseNode):


Instead of _task_type argument, add a class attribute tasks: list[LabelType] = [LabelType.TEXT]

DrejcPesjak and others added 27 commits February 20, 2024 03:44

Resnet Variants (#9)

7c16e47

* Added ResNet variants * ResNet50 example * Simplified example * fixed resnet config * [Automated] Updated coverage badge --------- Co-authored-by: Martin Kozlovsky <martin.kozlovsky@luxonis.com> Co-authored-by: GitHub Actions <actions@github.com>

MLFlow Upload Fix (#10)

8e35f25

* fixed incorrect class property call * fixed exporter uploading * uploadCheckpoint uploads on every checkpoint epoch * fix temp files names * updated callback readme * pre-commit run

CLI Source Option (#11)

15bd923

* option to source custom code in CLI * removed empty dicts * [Automated] Updated coverage badge --------- Co-authored-by: GitHub Actions <actions@github.com>

Fix Removed Tensor Metadata (#12)

2797278

* option to source custom code in CLI * removed empty dicts * fixed issue with removed tensor metadata in match case statements

Forbid Extra Fields (#13)

2c62a08

* forbid extra fields in config * fixed configs

Automatic Inference of attach_index (#14)

0b51fa0

* automatic inference of attach index based on type signature * added inference for input and x names

Backbone Fix (#15)

bd67595

* fixed link in docs * fixed repvgg backbone * fixed efficientnet

Extend NN Archive Generation Test Coverage (#18)

b3b4e32

* extend NN Archive generation test coverage to cover all implemented heads * [Automated] Updated coverage badge --------- Co-authored-by: GitHub Actions <actions@github.com>

Upload All Checkpoints (#19)

351e0c5

* uploading all checkpoints * fix names * removed comment

LuxonisML v0.1.0 (#20)

9c4cadb

SIGTERM Handling (#21)

f425fdb

* handling SIGTERM signal * resume argument takes path

Tensor Core Float16 Precision (#24)

d1d71f0

* option to set torch matmul precision for tensor cores * updated readme

Metrics - Fixed Missing Reset (#25)

0830043

* fixed reset not being called * added metric resets * removed inheritance * proper oks reset * removed unnecessary resets * added annotations

enums handling (#31)

b6b4688

GPUStatsMonitor (#29)

72afb72

Co-authored-by: GitHub Actions <actions@github.com>

More Efficient Keypoint Export (#28)

5893c3e

Added active param to augmentations (#32)

4110f78

Fix Archiver Pre-Processing (#34)

36a92a6

EfficientRep Variants (#33)

1d9998b

Support for LuxonisML - Annotation Refactor (#37)

c2e98b7

Co-authored-by: GitHub Actions <actions@github.com>

Changed Imports in Config (#38)

abe7d3d

Co-authored-by: GitHub Actions <actions@github.com>

adding OCR Decoding support

da0106e

N950 requested review from kozlov721, tersekmatija and conorsim June 10, 2024 14:28

[Automated] Updated coverage badge

b716f50

kozlov721 reviewed Sep 10, 2024

View reviewed changes

kozlov721 deleted the branch main October 9, 2024 10:35

kozlov721 closed this Oct 9, 2024

kozlov721 reopened this Oct 9, 2024

kozlov721 changed the base branch from dev to main October 9, 2024 10:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add OCR Decoding support - WIP #39

add OCR Decoding support - WIP #39

N950 commented Jun 10, 2024

github-actions bot commented Jun 10, 2024

kozlov721 Sep 10, 2024 •

edited

Loading

kozlov721 Sep 10, 2024

kozlov721 Sep 10, 2024 •

edited

Loading

kozlov721 Sep 10, 2024

		from .base_loss import BaseLoss


		class FocalCTC(BaseLoss[Tensor, Tensor]):

		from luxonis_train.utils.types import LabelType


		class ResBlock(nn.Module):

add OCR Decoding support - WIP #39

Are you sure you want to change the base?

add OCR Decoding support - WIP #39

Conversation

N950 commented Jun 10, 2024

github-actions bot commented Jun 10, 2024

☂️ Python Coverage

Overall Coverage

New Files

Modified Files

kozlov721 Sep 10, 2024 • edited Loading

Choose a reason for hiding this comment

kozlov721 Sep 10, 2024

Choose a reason for hiding this comment

kozlov721 Sep 10, 2024 • edited Loading

Choose a reason for hiding this comment

kozlov721 Sep 10, 2024

Choose a reason for hiding this comment

kozlov721 Sep 10, 2024 •

edited

Loading

kozlov721 Sep 10, 2024 •

edited

Loading