Open
Conversation
support binary/multi-class. included tests (100% cov)
documentation - more informative docstrings - explanation of when/when not to use rebalanced LOGO _iter_test_masks/get_n_splits - implementation now almost identical to LeaveOneGroupOut - uses check_array() for validation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Adds
RebalancedLeaveOneGroupOut, a leave-one-group-out cross-validator that rebalances the training set so every fold has the same class balance, matching the behavior of the other RebalancedCV splitters when splitting by groups.Implementation: For each fold, one group is left out as the test set and the rest form the training set. The training set is then subsampled so that every fold has the same number of samples per class using the minimum/smallest per-class count across folds (we compute the smallest number of elements a given class will have across folds by subtracting the max number of elements of that class across all groups from the number of elements of that class).
Support binary/multi-class labels
Passes all tests locally (100% cov)