Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

This repository has been archived by the owner on May 6, 2023. It is now read-only.

AFAgarap / pt-datasets Public archive

Notifications You must be signed in to change notification settings
Fork 3
Star 6

Code
Issues 3
Pull requests 1
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: AFAgarap/pt-datasets

Releases · AFAgarap/pt-datasets

Add random oversampling

19 Feb 12:02

AFAgarap

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Add random oversampling

Add option to use a simple random over sampling instead of using SMOTE alone.

Assets 2

Loading

All reactions

Rename datasets directory

18 Feb 11:24

AFAgarap

Compare

Choose a tag to compare

Loading

Rename datasets directory

Use datasets directory instead of torch_datasets.

Assets 2

Loading

All reactions

Fix transformation pipeline

14 Feb 15:04

AFAgarap

Compare

Choose a tag to compare

Loading

Fix transformation pipeline

Fix the transformation pipeline for the following datasets:

SVHN
KMNIST
COVID19 (binary and non-binary)

Assets 2

Loading

All reactions

Fix augmentation pipeline

13 Feb 04:14

AFAgarap

Compare

Choose a tag to compare

Loading

Fix augmentation pipeline

Rearranged the augmentation transform pipeline.

Assets 2

Loading

All reactions

Fix data augmentation

13 Feb 04:04

AFAgarap

Compare

Choose a tag to compare

Loading

Fix data augmentation

Augment the training set of *MNIST datasets only.

Assets 2

Loading

All reactions

Add preprocessing batch size for COVID19 dataset

09 Feb 07:21

AFAgarap

Compare

Choose a tag to compare

Loading

Add preprocessing batch size for COVID19 dataset

When loading the COVID19 datasets, we can now specify the batch size to use when we preprocess it. This is to avoid the memory exhaustion due to huge tensors.

Assets 2

Loading

All reactions

Add oversampling function

04 Feb 15:07

AFAgarap

Compare

Choose a tag to compare

Loading

Add oversampling function

Oversample minority class using SMOTE

Assets 2

Loading

All reactions

Load preprocessed COVID19 dataset

01 Feb 05:02

AFAgarap

Compare

Choose a tag to compare

Loading

Load preprocessed COVID19 dataset

Features

Preprocess dataset if it does not exist yet.
Load the preprocessed dataset using the same dataset classes for COVID19 dataset.
Specify image size for preprocessing.

Assets 2

Loading

All reactions

Resolve dependency issues

30 Jan 02:42

AFAgarap

Compare

Choose a tag to compare

Loading

Resolve dependency issues

Fix setup issues with tsnecuda.
Add cmake and opency-python to list of dependencies.

Assets 2

Loading

All reactions

Add support for preprocessed COVID19 datasets

28 Jan 04:06

AFAgarap

Compare

Choose a tag to compare

Loading

Add support for preprocessed COVID19 datasets

Features

Normalize WDBC features.
Add preprocessor module for COVID19 datasets, which can be used for resizing the dataset images and exporting them together with the labels to a .pt file.
Add class for preprocessed COVID19 datasets.

Bug fixes

Convert WDBC features data type to float32.
Pack the test features and test labels for WDBC since what were being packed before was a tuple of test labels.
Convert COVID19 datasets labels to int64.

Assets 2

Loading

All reactions

Previous 1 2 3 4 5 6 Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.