Training from custom dataloaders / torch dataset #2261

luizfelippesr · 2024-08-19T12:10:07Z

luizfelippesr
Aug 19, 2024

Anomalib works natively with customized lighting DataModule objects, with the possibility creating those from files on the local file system (the FolderDataset) . Is there a recommended way of using a bespoke torch dataset/dataloader instead?

I have a particular use-case where the part of the data can be stored on other devices and can be retrieved on-the-fly (and possibly cached) by an adaptor written as a torch dataset. Also, I often prefer pre-splitting my data into train/val/test and storing separately to avoid the possibility of later contamination (and also allowing things like extending only the training set with a few extra examples).

Since the training engine uses lightning, it should be possible to simply using engine.fit(..., train_dataloaders=my_dataloader). However, this appears to give different results.

For example, if I take the example run:

# Import the required modules
from anomalib.data import MVTec
from anomalib.models import Patchcore
from anomalib.engine import Engine

# Initialize the datamodule, model and engine
datamodule = MVTec()
model = Patchcore()
engine = Engine()

# Train the model
engine.fit(datamodule=datamodule, model=model)

it finishes quickly in my system (using anomalib 1.1.0).

If modify in the following way (which should be equivalent):

# Import the required modules
from anomalib.data import MVTec
from anomalib.models import Patchcore
from anomalib.engine import Engine

# Initialize the datamodule, model and engine
datamodule = MVTec()
model = Patchcore()
engine = Engine()

# Prepares the data
datamodule.setup()

# Train the model
engine.fit(
    model=model,
    train_dataloaders=datamodule.train_dataloader(),
    val_dataloaders=datamodule.val_dataloader(),
)

I quickly run out of memory.

Any ideas?

samet-akcay · 2024-08-20T08:34:06Z

samet-akcay
Aug 20, 2024
Maintainer

This might be related to this #2254

1 reply

luizfelippesr Aug 21, 2024
Author

The workaround mentioned in #2254 bug report does not seem to help: the dataloader.dataset.transform is already None and the model seems to have the correct transform configured.

BTW, @samet-akcay should I re-write my question as a bug report?

luizfelippesr · 2024-09-18T13:01:30Z

luizfelippesr
Sep 18, 2024
Author

Related: #2034

0 replies

blaz-r · 2024-09-26T11:48:00Z

blaz-r
Sep 26, 2024

This is quite unusual, and I'm not sure what would be causing this. Are you running out of RAM or GPU RAM?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training from custom dataloaders / torch dataset #2261

{{title}}

Replies: 3 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Training from custom dataloaders / torch dataset #2261

luizfelippesr Aug 19, 2024

Replies: 3 comments · 1 reply

samet-akcay Aug 20, 2024 Maintainer

luizfelippesr Aug 21, 2024 Author

luizfelippesr Sep 18, 2024 Author

blaz-r Sep 26, 2024

luizfelippesr
Aug 19, 2024

Replies: 3 comments 1 reply

samet-akcay
Aug 20, 2024
Maintainer

luizfelippesr Aug 21, 2024
Author

luizfelippesr
Sep 18, 2024
Author

blaz-r
Sep 26, 2024