Skip to content

Arctique #30

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 10 commits into from
Jan 20, 2025
Merged

Arctique #30

merged 10 commits into from
Jan 20, 2025

Conversation

ClaudiaWinklmayr
Copy link
Contributor

What is the purpose of this PR?

To add the Arctique dataset to the collection of datasets.

How did you implement your changes?

Torch-dataset that accesses the Arctique data.

How did you test your changes?

pytest-tests or all functions of the dataset-class.

Remaining issues

@FabianRei FabianRei self-assigned this Jan 20, 2025
Copy link
Contributor

@FabianRei FabianRei left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me. We might want to init the dataset with a path to the full dataset (no train/test/val split, as our goal lmdb file should encompass full arctique with train/test/val split then set via a .csv file which should look like this:

sample_name,train_test_val_split
reg001_A.hdf,test
reg001_B.hdf,test
reg002_A.hdf,train
reg002_B.hdf,train
reg003_A.hdf,valid
reg003_B.hdf,valid
reg004_A.hdf,train
reg004_B.hdf,train
reg005_A.hdf,test
reg005_B.hdf,test
reg006_A.hdf,train
reg006_B.hdf,train

"""
Initializes the ArctiqueDataset with the given local path.
The three PanNuke dataset is located on the /fast file system on the MDC cluster under the path
'/fast/AG_Kainmueller/synth_unc_models/data/v_review_sample1500_NOISY'.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good doc


print("LOCAL PATH", local_path)

self.images_folder = os.path.join(local_path, f'train/images/')
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe, local path is 'full' instead of 'train' and e.g. images_folder is than only 'images/'?


sample_ID = self.sample_IDs[idx]

data = {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good

@ClaudiaWinklmayr ClaudiaWinklmayr merged commit 9fe04cd into main Jan 20, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants