How to maintain the aspect ratio of the image? #1592

ArivCR7 · 2023-11-24T12:28:06Z

ArivCR7
Nov 24, 2023

Hi, The images in my dataset are from the line scan cameras and they have a very high aspect ratio. e.g., 200x5000 (wxh). If I set the image_size as 256 in the config.yaml, it gets deformed. How do I maintain the aspect ratio for these kinds of images?

j99ca · 2023-11-24T14:39:19Z

j99ca
Nov 24, 2023

You should be able to set the image_size with a custom resolution (I haven't tried this on all model/configs). Something like:
image_size: [256, 320]
Or perhaps you can preprocess your data before hand in smaller chunks

0 replies

blaz-r · 2023-11-24T19:43:54Z

blaz-r
Nov 24, 2023

As @j99ca said you can set dimensions for both width and height in config. You could also try tilling functionality (that only works for some models) to tile the image.

0 replies

ArivCR7 · 2023-11-25T10:36:35Z

ArivCR7
Nov 25, 2023
Author

Thanks for your suggestions @j99ca and @blaz-r. I'll work it out and keep this thread posted on the results.

0 replies

ArivCR7 · 2023-11-27T07:45:26Z

ArivCR7
Nov 27, 2023
Author

@blaz-r , I enabled tiling the config. Even after enabling it, during inference, the image gets resized to 'image_size' set in the config. My question is, if the tiler splits the image into patches before feeding to the model, what's the significance of image_size in the config? Kindly clarify.

Here's my config:
dataset:
name: boxdefect
format: folder
path: xxx/xxx # dataset resides here
normal_dir: good
abnormal_dir: bad
normal_test_dir: null
task: classification
mask: null
extensions: null
train_batch_size: 1
eval_batch_size: 1
num_workers: 8
image_size: 256 # dimensions to which images are resized (mandatory)
center_crop: null # dimensions to which images are center-cropped after resizing (optional)
normalization: none # data distribution to which the images will be normalized: [none, imagenet]
transform_config:
train: null
eval: null
test_split_mode: none # options: [from_dir, synthetic]
test_split_ratio: 0.2 # fraction of train images held out testing (usage depends on test_split_mode)
val_split_mode: same_as_test # options: [same_as_test, from_test, synthetic]
val_split_ratio: 0.5 # fraction of train/test images held out for validation (usage depends on val_split_mode)
tiling:
apply: true
tile_size: 100
stride: 100
remove_border_count: 0
use_random_tiling: False
random_tile_count: 16

0 replies

blaz-r · 2023-11-27T09:38:54Z

blaz-r
Nov 27, 2023

Tiler is implemented as part of the model. This means that the image is fed to the model in the shape of image_size. Inside the model forward pass, tiler is used to tile the image into tile_size tiles.

0 replies

ArivCR7 · 2023-11-27T10:31:56Z

ArivCR7
Nov 27, 2023
Author

Thanks for the reply @blaz-r , In that case, how do we deal with images with different aspect ratios. Setting a fixed resolution as the image_size might deform the images. Any suggestion on this please.

0 replies

blaz-r · 2023-11-27T11:19:52Z

blaz-r
Nov 27, 2023

In that case, I'm not sure. The way most of the models work, I'd say you'd want a fixed aspect ratio for them to work best. If some other task has a different aspect ratio it would probably be best to crop the image to the same ratio or if that's not possible train a separate model. If the difference in ratio is small, then it shouldn't be that much of a problem to use the same resolution.

0 replies

ArivCR7 · 2023-11-29T09:18:47Z

ArivCR7
Nov 29, 2023
Author

Thanks for you suggestion @blaz-r . I'm trying to do tiling before-hand to maintain the aspect ratio and feed the patches to the model instead of whole image. Will share the results in this thread. Once again, thanks for your inputs @blaz-r and @j99ca .

0 replies

ArivCR7 · 2023-12-04T06:29:59Z

ArivCR7
Dec 4, 2023
Author

As mentioned, I did tiling before-hand and fed the patches to the network. It works fine and able to detect defects if both the good and bad samples are uniform. However, in my case, I will encounter patches with different patterns(say with texts with different chars and orientations and with barcodes of diff. sizes and orientations). The way I thought through is - train a separate model for each kind of a patch. Can anomalib library be useful for such cases? I've attached samples for reference. @blaz-r kindly share your thoughts on this.

0 replies

ArivCR7 · 2023-12-04T06:41:35Z

ArivCR7
Dec 4, 2023
Author

What I observed from the MVTec dataset is, the good samples are all belong to the ideal condition of one single product with not much variation of patterns in it. In my case, the good samples might contain different textures/patterns such as texts, labels in it as shared above. Can anomalib be used for such scenarios? Kindly share your expertise on this @blaz-r @j99ca

0 replies

blaz-r · 2023-12-04T09:41:44Z

blaz-r
Dec 4, 2023

Hello. Currently the tiled ensemble, where each patch has a separate model, is still in development.
Regarding the different orientation of labels. While models are usually not sensitive to orientation, I think it's a bit more tricky with text as text is more detailed. So I think that for text you could either align it, or use a different solution like OCR.

0 replies

ArivCR7 · 2023-12-04T10:09:17Z

ArivCR7
Dec 4, 2023
Author

Thanks for the reply @blaz-r , I don't want to do OCR as the text on the patch is not important. All I need is to be able to detect any defects on the patch with different kinds of texts. Thoughts?

0 replies

blaz-r · 2023-12-04T10:40:30Z

blaz-r
Dec 4, 2023

My idea was that if you can read the text you can check it in text form to see if it's okay. If content is not important, then I assume you can use Anomalib models, but some models work a bit better if you can align the text.

0 replies

ArivCR7 · 2023-12-04T10:52:05Z

ArivCR7
Dec 4, 2023
Author

@blaz-r , Once again thanks for the reply. Not all the patches will have text in it. Some patches as I shared above will have some printed diagrams in it. Shared a sample here.

0 replies

blaz-r · 2023-12-04T11:41:02Z

blaz-r
Dec 4, 2023

It should still work, but I think it will work best if same model always seems same patch that is also aligned.

0 replies

ArivCR7 · 2023-12-15T10:23:13Z

ArivCR7
Dec 15, 2023
Author

Hi @blaz-r , I tried to train a padim model for this use case. The model gets trained if the dataset size is small(~1000 good images and ~100 bad images). However, if I increase the dataset size, the training gets killed. I referred a similar issue - #630. So I'm planning to train a different model than padim. Which one do you think will be a good fit to detect anomalies in the samples I shared above, as there are many models to try out. Thanks!

0 replies

blaz-r · 2023-12-15T10:53:21Z

blaz-r
Dec 15, 2023

Hi. I think for such large datasets you should use models that don't utilize memory banks. So I'd say that you should avoid Padim and Patchcore (I think also CFA could be problematic, as well sa dfm and dfkde, but can't say for sure). You could try cflow, reverse distillation, efficient_ad and stfpm. I'd say that it should be okay as long as the model doesn't use memory bank, but still try different architectures as some might work better.

0 replies

ArivCR7 · 2023-12-15T11:03:42Z

ArivCR7
Dec 15, 2023
Author

Thanks for the suggestion @blaz-r . I'll post the results on this thread.

0 replies

glucasol · 2024-01-24T14:47:26Z

glucasol
Jan 24, 2024

Hi @blaz-r , you said to use different models if the dataset is quite large, so Padim is not the better option. But what if I have a large dataset and I am interested in doing real time inference? Models like efficient_ad are a good option? Because I know that Padim is relatively fast on inference due to the usage of the CNN just to extract the features of the images.

Thanks!

1 reply

blaz-r Jan 25, 2024

Hello, I think that EfficientAD is a good option if you need a fast inference and good performance. I'm not sure how fast padim inference is if you have large images, but it surely is quick to train.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to maintain the aspect ratio of the image? #1592

{{title}}

Replies: 19 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

How to maintain the aspect ratio of the image? #1592

Replies: 19 comments · 1 reply

ArivCR7 Nov 25, 2023 Author

ArivCR7 Nov 27, 2023 Author

ArivCR7 Nov 27, 2023 Author

ArivCR7 Nov 29, 2023 Author

ArivCR7 Dec 4, 2023 Author

ArivCR7 Dec 4, 2023 Author

ArivCR7 Dec 4, 2023 Author

ArivCR7 Dec 4, 2023 Author

ArivCR7 Dec 15, 2023 Author

ArivCR7 Dec 15, 2023 Author

Replies: 19 comments 1 reply

ArivCR7
Nov 25, 2023
Author

ArivCR7
Nov 27, 2023
Author

ArivCR7
Nov 27, 2023
Author

ArivCR7
Nov 29, 2023
Author

ArivCR7
Dec 4, 2023
Author

ArivCR7
Dec 4, 2023
Author

ArivCR7
Dec 4, 2023
Author

ArivCR7
Dec 4, 2023
Author

ArivCR7
Dec 15, 2023
Author

ArivCR7
Dec 15, 2023
Author