Loghi-HTR V2 #25

TimKoornstra · 2024-03-15T13:42:17Z

V2

This combined pull request encompasses a range of enhancements and refactors across various aspects of the project. Below are the key changes and improvements grouped by their original pull request numbers.

Visualize files revamp #11

Refactored old visualize files & made them independent of data loaders / utils files, dark/light-mode support.
Time-step prediction visualizer takes top-3 most probable characters for a time-step, generates a CTC encoding table, pre-processes "invisible" or unknown characters.
Filter activations visualizer: plots conv layer activations on top of input sample image, plots conv layer activations when provided with a random noise image, accounts for models of different channels / sample image shapes.
PDF combiner that creates a single sheet of the created visualizers.
vis_arg_parser: used for processing user arguments in the terminal similar to how loghi-htr processes arguments.
README instructions.

Main refactor & organize files into subfolders #13

Extensive refactoring of main.py, organizing functions into separate files within subfolders inside the src directory.
Overhauled logging for better clarity and structured logging.
Improved GPU handling by shifting selection from environment variable manipulation to direct TensorFlow handling.
Adjusted TensorFlow logging to clean up output.
Deprecation notices for specific arguments effective May 2024.
Updates in ResidualBlock to incorporate ReLU activation by default.
This update constitutes a major change in the codebase.

Improved LR schedule #14

Added a custom LoghiLearningRateSchedule class for more flexible learning rate adjustments.
Integrated learning rate logging in each step within CustomLoghiCallback.
Added unit tests for the LoghiLearningRateSchedule class and incorporated these tests into GitHub Actions.

Improved args, and added config file #15

Implemented the ability to run Loghi using a --config_file argument.
Completely overhauled config.json file structure for better organization.
Added a Config class in src/setup/config.py for enhanced management of configuration settings.
API updated to support both old and new config.json structures for backward compatibility.

API v2 #17

Significant updates and refactorings across the model loading process, code refactoring for efficiency, and new environment variables.
Gunicorn integration and endpoint monitoring for health checks.
Security enhancements and separate decoding process for better GPU utilization.
Logging improvements and output format changes for predictions.

Minor improvements #18

General code cleanup and custom callback enhancement.
Introduced RMSProp as an additional optimizer option.
Adjustments in CTC Loss functionality and relocation for better organization.

V1 DataGenerator and Data Augment Revamp on GPU (no unittest updates) #21

Refactored data augmentations to be part of the final Sequential model for GPU support.
Custom Keras Layer classes for each type of augmentation, easily extendable for future augments.
Data augmentation visualizations with --visualize_augments.

QOL changes and code simplifications #22

Simplifications across the code enhancing readability and maintainability.
Functional improvements in greedy decoding confidence score and speed enhancements in test and validation modes.

Improve code quality #23

Replaced f-strings in logging with lazy %.
Removed unused variables and added encoding to all with open() statements.

Data loader upgrade #24

Refactored DataLoader and DataGenerator classes, renaming and improving code documentation and clarity.
Fixed an augmentation bug and split functions into smaller subroutines for better readability.
Add ability to use "sample weights" in training data. They should be supplied in the second column of the txt file.

Other changes include deprecated arguments in favor of new ones, updated requirements.txt, and added a recommended model to the model library.

Oh yeah, it's all coming together

Next: another revision and split functions into files

…oved parts of visualize prep to separate vis_utils.py. Added error handling and docstrings

Things WILL break

…alents, docstrings, normalisation of variable names

…rements.txt, pruned vis_arg_parser, reintroduced main() functions, PdfMaker changes to structure

Main refactor & organize files into subfolders

…/loghi-htr into visualize-files-revamp

Data loader upgrade

TimKoornstra and others added 30 commits November 24, 2023 09:40

Add normalization + move deprecating args to dedicated section

401ddda

Additional validation revision

a53efa0

Add WBS

93ba893

Validation revision iteration

7bbf80b

Oh yeah, it's all coming together

Removed unused/redundant files

8a4d3fd

Prettify outputs and print 95% confidence intervals at the end

5c95375

Add normalization to validation confidence score

db7c575

Finish inference

a902cec

Next: another revision and split functions into files

Initial split of main.py into different files

76981b7

Fix output layer bug when creating new VGSL model

0bd2098

Remove unused files

afad57c

Refactor + splits into multiple functions, separate PdfMaker class, m…

b44b839

…oved parts of visualize prep to separate vis_utils.py. Added error handling and docstrings

Readme updates

e67252d

Move files into folders and merge some files

ff7d99b

Things WILL break

Update README

25b8eb5

Add docstrings and type hints to new functions

2a2b552

Filter some TF logging

d384ba0

Add deterministic mode, even stricter TF logging

4e9bad3

Shear TFA dependency replacements with cv2 warpAffine and numpy equiv…

2db3192

…alents, docstrings, normalisation of variable names

Move channels and height args to deprecation zone

e34878a

Refactors after 2nd round of feedback, more docstrings, updated requi…

6df5f28

…rements.txt, pruned vis_arg_parser, reintroduced main() functions, PdfMaker changes to structure

Fix multi-gpu and "all" gpu settings

cd47a0f

Resolve merge conflicts

9bc1226

Add git url to config

52a0ebb

Merge pull request #13 from rvankoert/main-refactor

e08e64d

Main refactor & organize files into subfolders

Update README.md

bdc8ba2

Removed redundant code, added type hints, other small clean-ups

da03ac6

Removed tfa dependency in requirements

685badc

Resolve merge conflicts

eb1e91c

Merge branch 'visualize-files-revamp' of https://github.com/rvankoert…

edc739c

…/loghi-htr into visualize-files-revamp

TimKoornstra and others added 28 commits March 7, 2024 13:44

Fix decoding bug when decoded_array is empty

abc6e74

Fix test and inference modes

f7bea35

Split _create_data into subroutines

7e0f0e4

Upgrade faulty line logging

5802bc9

Add docstrings and type hints

eef1b0c

Rename DataLoader -> DataCreator, DataGenerator -> DataLoader

49e5734

Rename DataCreator to DataManager

bc3266b

Rename test file

23fcebc

Update DataLoader docstrings and type hints

74d356b

Fix old args from config not working bug

b6e363d

Fix augmentations

9f6152d

Fix ElasticTransform bug, manager repeat

60f2190

Minor fixes

fc64ff0

Update DataLoader unittests

5caae32

Update DataManager tests

4ed7449

Improve pylint score

0079214

Fix incorrect length train batches bug + some extra logging

e2baf48

Merge branch 'v2' into data-loader-upgrade

aef7be9

Merge pull request #24 from rvankoert/data-loader-upgrade

398c694

Data loader upgrade

Remove unnecessary logging, add recommended model to VGSL library

85a12d3

Save all model layers as trainable by default

63043ee

Bump requirements

6fcae63

Update configs

f19e107

Deprecate --existing_model, make model library more lenient

151861a

Update README.md

5b7769a

Merge branch 'master' into v2

d4f455d

Make confidence score more like v1

c00808b

Merge branch 'master' into v2

cd626df

TimKoornstra merged commit 59af90a into master Mar 25, 2024
5 checks passed

TimKoornstra deleted the v2 branch March 25, 2024 09:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loghi-HTR V2 #25

Loghi-HTR V2 #25

TimKoornstra commented Mar 15, 2024 •

edited

Loading

Loghi-HTR V2 #25

Loghi-HTR V2 #25

Conversation

TimKoornstra commented Mar 15, 2024 • edited Loading

V2

Visualize files revamp #11

Main refactor & organize files into subfolders #13

Improved LR schedule #14

Improved args, and added config file #15

API v2 #17

Minor improvements #18

V1 DataGenerator and Data Augment Revamp on GPU (no unittest updates) #21

QOL changes and code simplifications #22

Improve code quality #23

Data loader upgrade #24

TimKoornstra commented Mar 15, 2024 •

edited

Loading