-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
0.3.5 #345
Merged
Merged
0.3.5 #345
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Collaborator
sigmafelix
commented
Jul 16, 2024
- Base learner targets (draft; should be changed in accordance with Big Data Considerations #325)
- Download targets
- Post-checkout hooks
- Utility functions for the new targets
- attach_xy: Retaining all columns - generate_cv_index: Retaining all columns
- path setting is changed to if-else - pipeline error failing mode to "abridge"
- removing irrelevant arguments passed to terra::rast
- Branching base learner fitting
- shared interface for branching cv set generation
- Pipeline base learner: 3 CV strategies and hyperparameter tuning targets were added - fit_base_* function get `return_best` and `tune_bayes_iter` for `workflow` compatibility - prepare_cvindex assigns fold ids using function names when spatialsample functions are used - restore_rset_full: as speed-up and disk saving measures - rset objects are generated based on essential coordinates only; this function restores full data for subsequent steps - TODO: size issues persist in hyperparamter tuning and identifying the best model as the entire workflow should be saved. rsample always saves training/test data. - TODO: duplicates in features; identify where the duplicates come
- Data size reduction for memory / storage management: added trim_resamples argument in fit_base_* - make_subdata: bootstrapping (currently 30%) - restore_fit_best: restore full data with CV rsample rset objects, extract the best tuned results, then fit the data - Dealing with nested list in tibbles from tidymodels workflow/hyperparameter tuning - TODO/Q: Do we save fitted model object or just keep predictions?
- set_args_download and feature_raw_download are written - targets_download.R is revised to reflect the structure of the two functions
- LICENSE file gets 554
- set_args_download update - _targets.R update to generate arglist_download
- README.md update - setup_hook.sh is capable of immediately activating permission change
- targets_download.R: duplicate target names - Roxygen2 documentation typo fix in fit_base_lightgbm
@sigmafelix It looks like |
@kyle-messier I merged |
kyle-messier
approved these changes
Jul 16, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.