SLEP016: parameter spaces on estimators #62

jnothman · 2021-11-30T11:07:40Z

No description provided.

amueller

That was quick! Some nitpicks and one question about allowing __. I think there are two important limitations: dependencies within an estimator and sharing parameters across estimators.
Basically this model assumes parameters and estimator grids are one-to-one.

slep014/proposal.rst

jnothman

Thanks a lot for that speedy and detailed first pass @amueller

slep014/proposal.rst

Co-authored-by: Andreas Mueller <t3kcit@gmail.com>

jnothman · 2022-03-18T00:53:00Z

Ready for review!

adrinjalali

I think I prefer the Searchgrid design, but I'm taking your word on people preferring this design to that one.

slep016/proposal.rst

jnothman · 2022-03-19T10:50:30Z

Thanks for the prompt review @adrinjalali. At this point I'm not sure what to update based on your comments, but I am curious to understand better what you find attractive about the Searchgrid API relative to this.

jnothman · 2022-03-23T05:03:12Z

Note to self: an alternative here might be a way to alias deep parameters at the metaestimator level...

jnothman · 2022-03-27T13:04:49Z

Thanks for the review @amueller. I hope I find clarity of mind and time soon to make edits.

Over all, do you feel like we should have separate set_grid and set_distrib, or combined set_param_space whose rvs will be rejected by a grid search?

One thought that comes to mind is how to make this something that usefully integrates with external hyperparameter optimisation tools. I mean it may be their responsibility to do so, but I wonder how we assist in their support.

aidiss · 2023-07-21T13:05:52Z

I would like to propose an alternative approach.
Core idea is to use mapping between an object and space, and then iterate through given pipeline and build the parameter_grid.

    pipe = make_pipeline(DecisionTree())
    object_param_grid = {DecisionTree: {"max_depth": [1, 2]}
    create_param_grid(pipe, object_param_grid) == {'decisiontree__max_depth': [1, 2]}

This is useful when the pipeline is not complex and there are no instances that use different params.

It is possible to change this a bit and use instance_to_space mapping.

    tree = DecisionTree()
    pipe = make_pipeline()
    object_param_grid = {tree: {"max_depth": [1, 2]}
    create_param_grid(pipe, object_param_grid) == {'decisiontree__max_depth': [1, 2]}

This would add more granularity, but would requite instantiating transformers/estimators before generating the grid.

Here is a source that achieves object_to_paramgrid mapping.

def _get_steps(object_, ):
    """Retrieves steps/transformers from Pipeline, FeatureUnion or ColumnTransformer objects"""
    object_to_step_name = {
        Pipeline: "steps", 
        FeatureUnion: "transformer_list",
        ColumnTransformer: "transformers",
    }

    if step_name := object_to_step_name.get(object_.__class__):
        steps = [steps for steps in getattr(object_, step_name)]
        return steps
    
def resolve(parent_object, object_param_grid, path="", param_grid=None):
    if param_grid is None:
        param_grid = {}

    steps = _get_steps(parent_object)
    if not steps:
        return param_grid

    for child_object_path, child_object, *_ in steps:
        full_path = '__'.join([path, child_object_path]).strip("__")
        child_object_class_name = child_object.__class__

        if object_params := object_param_grid.get(child_object_class_name):
            flattened_param_grid = {f"{full_path}__{k}": v for k, v in object_params.items()}
            param_grid = flattened_param_grid | param_grid

        param_grid = resolve(child_object, object_param_grid, path=path+child_object_path, param_grid=param_grid)
    return param_grid

jnothman · 2023-12-27T03:51:12Z

I like several aspects of that proposal, @aidiss, though I'm not sure I'd worry about supporting classes in the first version. Specifically:

The entire parameter grid can be defined explicitly in one place in user code, or can be assembled from multiple places quite easily
The effect of this change could be localised to *SearchCV implementations, with such a dict being a drop-in replacement for the current param_grid or param_distributions. Therefore the change would not require a SLEP.
It may get rid of the need for a specialised design to handle RVs vs grids.

One thing I'm not sure about is how to manage errors. If, for instance, the user were to clone the estimator, we'd suddenly have no grid for it if it were defined in terms of instances, and that would not be obvious to the user. On the other hand, I think this is a flaw in all or several the proposed designs.

I'll otherwise need to think about whether there are aspects of searchgrid capability that could not be reproduced with this approach.

jnothman · 2023-12-29T00:41:58Z

I've recalled @aidiss that your suggestion is pretty much the same thing that I proposed in scikit-learn/scikit-learn#21784, although I admit that the .set interface is not particularly pleasant.

jnothman · 2023-12-29T01:01:50Z

I think I prefer the Searchgrid design, but I'm taking your word on people preferring this design to that one.

I wonder if you meant the GridFactory design, @adrinjalali? Searchgrid secretly sets attributes on estimators, so it's not a great functional/OOP design.

jnothman · 2023-12-29T01:21:16Z

I'd be keen to merge this and then have the team decide if it's the right proposal, or if we should go down one of the SLEP-free paths (e.g. GridFactory, or a lighter API variant of it like above).

We have a lot of the implementation already, and mostly need to come to a decision.

@adrinjalali interested in reviewing for merge?

adrinjalali

Apart from 2 reservations, I quite like this now.

adrinjalali · 2024-02-01T15:51:42Z

slep016/proposal.rst

+These could be combined into a single method, such that
+:class:`~sklearn.model_selection.GridSearchCV` rejects a call to `fit` where `rvs`
+appear. This would make it harder to predefine search spaces that could be used
+for either exhaustive or randomised searches, which may be a use case in Auto-ML.


in those cases, they'd need to have separate calls to set_search_grid and set_search_rvs anyway (unless I'm misunderstanding something). So in terms of practicality, they can still create two instances of the pipeline, one with seting rvs and one with setting the grid, but via a single set_search_space method.

What I'm trying to communicate here is that if we had a single set_search_space, an AutoML library that is set up to call it would have to choose between setting RVs and grids. But this presumes a lot of things about how an AutoML library using this API might look.

Not sure if I should change anything here.

Inserted that clarification into the text.

adrinjalali · 2024-02-01T15:52:48Z

slep016/proposal.rst

+Another possible alternative is to have `set_search_grid` update rather than
+replace the existing search space, to allow for incremental construction. This is
+likely to confuse users more than help.


isn't this your proposal now?

adrinjalali · 2024-02-01T16:40:21Z

slep016/proposal.rst

+    ).set_search_grid(reduce_dim=[
+        PCA(iterated_power=7).set_search_grid(n_components=N_FEATURES_OPTIONS),
+        SelectKBest().set_search_grid(k=N_FEATURES_OPTIONS),
+    ])


One issue here is that the user still needs to know that the first step is called "reduce_dim", which is a caveat you mentioned in the motivation section. I'm not sure how to improve this.

I think this is an improvement over what we have, but it still feels quite verbose maybe? I'm not sure.

The way to get around it is with more of a factory API for things like Pipeline, whether it's with a strict factory pattern:

pipeline = (PipelineFactory() .add( grid=[ PCA(iterated_power=7).set_search_grid(n_components=N_FEATURES_OPTIONS), SelectKBest().set_search_grid(k=N_FEATURES_OPTIONS ]) .add(estimator=svc, name="classify") ).build()

or with a mutable Pipeline

pipeline = Pipeline() pipeline.add( grid=[ PCA(iterated_power=7).set_search_grid(n_components=N_FEATURES_OPTIONS), SelectKBest().set_search_grid(k=N_FEATURES_OPTIONS ]) pipeline.add(svc, name="classify")

but these solutions are clearly orthogonal to the current proposal

Thinking of a factory reminds me of the work @koaning is doing in the playground project.

Was wondering what you'd think about this proposal related to your experiments for creating pipelines and how this could potentially fit there @koaning

I think you're referring to playtime? If so, that approach revolves around operators, so you might do stuff like:

pipeline = feats("age", "fare", "sibsp", "parch") + onehot("sex", "pclass")

That's meant to keep things super simple for folks with a modelling background but who are light on programming. I am conciously avoiding hyperparameters in my experiment there in an attempt to focus more on the data.

What folks are discussing here seems to be different. It seems to be a less "string"-y way to declare what hyperparams you would need to set in a gridsearch. So these two pieces of work seem to address different concerns.

That said, the string-y way of declaring things has never really bothered me that much. I usually found it much harder to deal with components that depend on each-other. Things like "I want to impute values in my pipeline if the final estimator is a logistic regressor but I want no imputing when the final estimator is a histogram boosted model". Would this be something we could tackle here as well? I may be able to come up with better examples that related directly to hyperparameters instead of including/excluding estimators but this aspect of dependencies within a pipeline has always been the one thing I found hard to tackle with sklearn automatically.

Maybe something with PCA and SelectKBest?

make_pipeline(PCA(n1), SelectKBest(n2))

Suppose that PCA has components going from n1=1..10 and SelectKBest also has n2=1..10. Then you are going to hit an issue when there is only 1 PCA component but SelectKBest wants to select 10. So this might be a "nice" example of a dependency based on hyperparams.

adrinjalali

Two things that came to my mind this time:

What happens to classes such as LassoCV? Do they take the grid from themselves?
We have a bunch of classes which do a CV search, but not over parameters of the child class. Like TunedThresholdClassifierCV. We should find a way to make it very clear and discoverable to users that they do NOT search over potentially provided parameter grid.

I'm still not convinced by having both .._grid and .._rvs method pairs, but I wouldn't want that to be a blocker here to move forward.

I'd be happy to merge this soon-ish, request for comment, and get to a vote after.

adrinjalali · 2024-05-23T17:27:24Z

slep016/proposal.rst

+    ).set_search_grid(reduce_dim=[
+        PCA(iterated_power=7).set_search_grid(n_components=N_FEATURES_OPTIONS),
+        SelectKBest().set_search_grid(k=N_FEATURES_OPTIONS),
+    ])


Thinking of a factory reminds me of the work @koaning is doing in the playground project.

Was wondering what you'd think about this proposal related to your experiments for creating pipelines and how this could potentially fit there @koaning

adrinjalali · 2024-05-23T17:30:27Z

slep016/proposal.rst

+space to remain constant regardless of whether ``reduce_dim`` is a feature
+selector or a PCA.
+
+This SLEP proposes to add a methods to estimators that allow the user


we're adding here 4 methods, aren't we? But I'm not sure if we want to introduce them here or after the example bellow. I'm okay generally with the text as is. Maybe we just want here to say we add 4 methods and let the details be left for later.

adrinjalali · 2024-05-23T17:44:33Z

slep016/proposal.rst

+        Note that this parameter space has no effect when the estimator's own
+        ``fit`` method is called, but can be used by model selection utilities.


Since I was thinking about HalvingSearchCV with @glemaitre , there's something tricky I notice here:

HalvingSearchCV's resource parameter can be one of the estimator's parameters itself, which is on top of the param_grid parameter. It's clear how this proposal interacts with param_grid, but not clear to me how it would interact with resource there. And makes me think there might be some cases we're missing here?

adrinjalali · 2024-05-23T17:48:31Z

slep016/proposal.rst

+we might need to store the candidate grids and distributions in a known instance
+attribute, or use a combination of `get_grid`, `get_distribution`, `get_params`
+and `set_search_grid`, `set_search_rvs` etc. to perform `clone`.


In the meantime, we have the configurable clone and also this for metadata routing:

try: new_object._metadata_request = copy.deepcopy(estimator._metadata_request) except AttributeError: pass

So I think we can remove this paragraph since it's a non-issue at this point?

adrinjalali · 2024-05-23T17:51:57Z

slep016/proposal.rst

+``cv_results_`` is similarly affected by large changes to its keys when small
+changes are made to the composite model structure. Future work could provide
+tools to make ``cv_results_`` more accessible and invariant to model structure.


since this new way of providing the grid requires explicit act of the user via changing the value of param_grid passed to search objects, I think at the same time for those cases we can change the structure of cv_results_?

jnothman added 2 commits November 30, 2021 22:06

Partial draft of SLEP014: parameter spaces on estimators

c619fdc

typos

79b67ff

amueller reviewed Nov 30, 2021

View reviewed changes

jnothman commented Dec 1, 2021

View reviewed changes

jnothman and others added 5 commits December 1, 2021 13:46

Apply suggestions from code review

85e064b

Co-authored-by: Andreas Mueller <t3kcit@gmail.com>

Add some discussion points from Andy's code review

046f108

Reintroduce edits after poor merge

239ff8a

Add docstring for set_grid

8541950

Attempt to complete Implementation section

05be687

jnothman changed the title ~~Partial draft of SLEP014: parameter spaces on estimators~~ Partial draft of SLEP016: parameter spaces on estimators Feb 2, 2022

jnothman added 2 commits February 2, 2022 15:05

Correct SLEP number to 016

f39e095

Complete draft of the SLEP

e6c61c4

adrinjalali reviewed Mar 18, 2022

View reviewed changes

jnothman changed the title ~~Partial draft of SLEP016: parameter spaces on estimators~~ SLEP016: parameter spaces on estimators Mar 19, 2022

Joel Nothman and others added 2 commits December 29, 2023 11:47

Merge remote-tracking branch 'upstream/main' into slep014-search-spaces

7f3df10

Comment on aidss approach

45adda4

jnothman added 2 commits December 29, 2023 12:22

address review comments

748fa31

address reviews

0c946c7

jnothman requested review from adrinjalali, amueller and glemaitre December 29, 2023 01:31

adrinjalali reviewed Feb 1, 2024

View reviewed changes

Add clarifcation

bb56429

jnothman requested a review from adrinjalali March 20, 2024 22:32

adrinjalali reviewed May 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SLEP016: parameter spaces on estimators #62

SLEP016: parameter spaces on estimators #62

jnothman commented Nov 30, 2021

amueller left a comment

jnothman left a comment

jnothman commented Mar 18, 2022

adrinjalali left a comment

jnothman commented Mar 19, 2022

jnothman commented Mar 23, 2022

jnothman commented Mar 27, 2022

aidiss commented Jul 21, 2023

jnothman commented Dec 27, 2023 •

edited

Loading

jnothman commented Dec 29, 2023

jnothman commented Dec 29, 2023

jnothman commented Dec 29, 2023 •

edited

Loading

adrinjalali left a comment

adrinjalali Feb 1, 2024

jnothman Mar 20, 2024

jnothman Mar 20, 2024

jnothman Mar 20, 2024

adrinjalali Feb 1, 2024

jnothman Mar 20, 2024

adrinjalali Feb 1, 2024

jnothman Mar 20, 2024 •

edited

Loading

adrinjalali May 23, 2024

koaning May 23, 2024 •

edited

Loading

koaning May 23, 2024

adrinjalali left a comment

adrinjalali May 23, 2024

adrinjalali May 23, 2024

adrinjalali May 23, 2024

adrinjalali May 23, 2024

adrinjalali May 23, 2024

		Note that this parameter space has no effect when the estimator's own
		``fit`` method is called, but can be used by model selection utilities.

SLEP016: parameter spaces on estimators #62

Are you sure you want to change the base?

SLEP016: parameter spaces on estimators #62

Conversation

jnothman commented Nov 30, 2021

amueller left a comment

Choose a reason for hiding this comment

jnothman left a comment

Choose a reason for hiding this comment

jnothman commented Mar 18, 2022

adrinjalali left a comment

Choose a reason for hiding this comment

jnothman commented Mar 19, 2022

jnothman commented Mar 23, 2022

jnothman commented Mar 27, 2022

aidiss commented Jul 21, 2023

jnothman commented Dec 27, 2023 • edited Loading

jnothman commented Dec 29, 2023

jnothman commented Dec 29, 2023

jnothman commented Dec 29, 2023 • edited Loading

adrinjalali left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jnothman Mar 20, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

koaning May 23, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adrinjalali left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jnothman commented Dec 27, 2023 •

edited

Loading

jnothman commented Dec 29, 2023 •

edited

Loading

jnothman Mar 20, 2024 •

edited

Loading

koaning May 23, 2024 •

edited

Loading