[GSOC] `hyperopt` suggestion service logic update #2412

shashank-iitbhu · 2024-08-21T22:01:36Z

What this PR does / why we need it:

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes #2374

Checklist:

Docs included if any changes are user facing

tenzen-y · 2024-08-22T18:39:38Z

/area gsoc

andreyvelich

Thank you for this @shashank-iitbhu!
I left a few comments

.github/workflows/e2e-test-pytorch-mnist.yaml

examples/v1beta1/hp-tuning/hyperopt-distribution.yaml

andreyvelich · 2024-09-02T21:53:07Z

pkg/apis/manager/v1beta1/api.proto

-    NORMAL = 2;
-    LOG_NORMAL = 3;
-    DISTRIBUTION_UNKNOWN = 4;
+    DISTRIBUTION_UNKNOWN = 0;


Please keep the same name as for parameter_type

Suggested change

DISTRIBUTION_UNKNOWN = 0;

UNKNOWN_DISTRIBUTION = 0;

Suggested change

DISTRIBUTION_UNKNOWN = 0;

DISTRIBUTION _UNSPECIFIED = 0;

I would like to select the UNSPECIFIED suffix here.
Please see: https://google.aip.dev/126

Make sense, @tenzen-y should we rename other gRPC parameters to UNSPECIFIED ?

Changing released gRPC, it indicates losing backward compatibility.
So, I would like to keep using the existing API for released protocolbuffers API, @andreyvelich WDYT?

Since these gRPC APIs are not exposed to the end-users, do you still think that we should not change the existing APIs ?
It only affects users who build their own Suggestion service.

Since these gRPC APIs are not exposed to the end-users, do you still think that we should not change the existing APIs ?

Almost correct. Additionally, when users keep using the removed Suggestion Services like the Chocolate Suggestion, users face the same problem.

So, can we collect feedback on the dedicated issue outside of here?

Sure, let's followup on this in the issue, and rename it after few months if we don't get any feedback.
@shashank-iitbhu Please can you create an issue to track it ?

Sure, let's followup on this in the issue, and rename it after few months if we don't get any feedback. @shashank-iitbhu Please can you create an issue to track it ?

Sure, I will create a separate issue to track the renaming of other gRPC parameters to UNSPECIFIED.

pkg/controller.v1beta1/suggestion/suggestionclient/suggestionclient.go

test/unit/v1beta1/suggestion/test_hyperopt_service.py

andreyvelich · 2024-09-02T21:57:32Z

pkg/suggestion/v1beta1/hyperopt/base_service.py

-                )
-            elif param.type == DOUBLE:
-                hyperopt_search_space[param.name] = hyperopt.hp.uniform(
+                hyperopt_search_space[param.name] = hyperopt.hp.uniformint(


If parameter is int, why we can't support other distributions like lognormal ?

Distributions like uniform quniform loguniform normal etc return float values. They are designed to sample from a range of values that can take any real number (float), which might not make sense if we're looking for an integer value.
Although we can definitely add support for these distributions when parameter is int also. Should we do this?

@tenzen-y @kubeflow/wg-training-leads @shashank-iitbhu Should we round this float value to int if user wants to use this distribution and int parameter type ?

@tenzen-y @kubeflow/wg-training-leads @shashank-iitbhu Should we round this float value to int if user wants to use this distribution and int parameter type ?

SGTM
Users can specify the double parameter type if they want to compute more exactly.
But, documentation of this restriction for int parameter type would be better.

pkg/suggestion/v1beta1/hyperopt/base_service.py

shashank-iitbhu · 2024-09-19T20:24:59Z

@tenzen-y I have added two new parameters, weight_decay and dropout_rate, to the Hyperopt example and passed them to mnist.py, but I haven't used them in the Net class yet in the train and test functions. If you check the logs for this e2e test, the maximum value of the loss metrics is an enormously large number. I can't figure out what I'm missing. Also tested this locally.

examples/v1beta1/hp-tuning/hyperopt-distribution.yaml

shashank-iitbhu · 2024-09-22T15:07:33Z

@tenzen-y

katib/pkg/suggestion/v1beta1/hyperopt/base_service.py

Lines 265 to 266 in 867c40a

    
           if param.type == INTEGER: 
        
               assignments.append(Assignment(param.name, int(vals[param.name][0])))

here the float values sampled from the distribution get converted to int for INT parameter type.

tenzen-y · 2024-09-22T15:12:27Z

pkg/suggestion/v1beta1/hyperopt/base_service.py

+                    log_min = math.log(float(param.min))
+                    log_max = math.log(float(param.max))


Shouldn't we use the fixed value when the min and max are scalers the same as Nevergrad, right?

Yeah we can, but that was an edge case when min and max are not defined in case of nevergrad.

elif isinstance(param, (p.Log, p.Scalar)): if (param.bounds[0][0] is None) or (param.bounds[1][0] is None): if isinstance(param, p.Scalar) and not param.integer: return hp.lognormal(label=param_name, mu=0, sigma=1)

For example,

- name: batch_size parameterType: int feasibleSpace: min: "32" max: "64" distribution: "logNormal"

The above parameter will be sampled out from this graph:

where u=3.8123 and sigma=0.3465 are calculated by putting min=32 and max=64 in our code. and E(X) represents the mean which is 48 in our case.

That makes sense.
In that case, could you address the cases where min or max is not specified, as well as nevergrad?

https://github.com/facebookresearch/nevergrad/blob/a2006e50b068fe598e0f3d7dab9c9bcf6cf97e00/nevergrad/optimization/externalbo.py#L61-L64

@shashank-iitbhu This is still pending.

katib/pkg/webhook/v1beta1/experiment/validator/validator.go

Lines 287 to 290 in 867c40a

if param.FeasibleSpace.Max == "" && param.FeasibleSpace.Min == "" {

allErrs = append(allErrs, field.Required(parametersPath.Index(i).Child("feasibleSpace").Child("max"),

fmt.Sprintf("feasibleSpace.max or feasibleSpace.min must be specified for parameterType: %v", param.ParameterType)))

}

The webhook validator requires feasibleSpace.max or feasibleSpace.min to be specified.

But when either min or max is empty, this validation does not reject the request, right?
So, shouldn't we implement the special case in the Suggestion Service?

Yes, the validation webhook does not reject the request when either min or max is empty. But I created an example where:

- name: batch_size parameterType: int feasibleSpace: min: "32" distribution: "logNormal"

For this, the experiment is being created but the suggestion service is not sampling out any value hence the trials are not running, though handled this case (when either min or max are not specified) in pkg/suggestion/v1beta1/hyperopt/base_service.py.
Do we need to check experiment_defaults.go file?
https://github.com/kubeflow/katib/blob/867c40a1b0669446c774cd6e770a5b7bbf1eb2f1/pkg/apis/controller/experiments/v1beta1/experiment_defaults.go

test/unit/v1beta1/suggestion/test_hyperopt_service.py

.github/workflows/e2e-test-pytorch-mnist.yaml

examples/v1beta1/hp-tuning/hyperopt-distribution.yaml

pkg/suggestion/v1beta1/hyperopt/base_service.py

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> sigma calculation fixed fix parse new arguments to mnist.py

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

Co-authored-by: Yuki Iwai <yuki.iwai.tz@gmail.com> Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

… specified Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

andreyvelich · 2025-01-30T19:31:21Z

Thanks for this great contribution @shashank-iitbhu!
/lgtm
/approve

google-oss-prow · 2025-01-30T19:31:30Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: andreyvelich

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [andreyvelich]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

* resolved merge conflicts Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * fix Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * DISTRIBUTION_UNKNOWN enum set to 0 in gRPC api Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * convert parameter method fix Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> validation fix add e2e tests for hyperopt added e2e test to workflow * convert feasibleSpace func updated Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * renamed DISTRIBUTION_UNKNOWN to DISTRIBUTION_UNSPECIFIED Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * fix Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * added more test cases for hyperopt distributions Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * added support for NORMAL and LOG_NORMAL in hyperopt suggestion service Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * added e2e tests for NORMAL and LOG_NORMAL Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> sigma calculation fixed fix parse new arguments to mnist.py * hyperopt-suggestion example update Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * updated logic for log distributions Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * updated logic for log distributions Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * e2e test fixed Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * added support for parameter distributions for Parameter type INT Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * unit test fixed Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * Update pkg/suggestion/v1beta1/hyperopt/base_service.py Co-authored-by: Yuki Iwai <yuki.iwai.tz@gmail.com> Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * comment fixed Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * added unit tests for INT parameter type Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * completed param unit test cases Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * handled default case for normal distributions when min or max are not specified Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * fixed validation logic for min and max Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * removed unnecessary test params Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * fixes Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * added comments Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * fix Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * set default distribution as uniform Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * line omit Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> * removed empty spaces from yaml files Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> --------- Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> Co-authored-by: Yuki Iwai <yuki.iwai.tz@gmail.com> Signed-off-by: Gary Miguel <garymm@garymm.org>

google-oss-prow bot requested review from anencore94, gaocegege and johnugeorge August 21, 2024 22:01

google-oss-prow bot added the size/M label Aug 21, 2024

shashank-iitbhu mentioned this pull request Aug 21, 2024

[GSOC] Project 8: Support various Parameter Distribution in Katib #2374

Closed

12 tasks

google-oss-prow bot added area/gsoc size/L and removed size/M labels Aug 22, 2024

andreyvelich reviewed Sep 2, 2024

View reviewed changes

shashank-iitbhu force-pushed the feat/hyperopt-suggestion-service-update branch 2 times, most recently from fddb763 to 282f81d Compare September 10, 2024 16:33

shashank-iitbhu commented Sep 11, 2024

View reviewed changes

pkg/suggestion/v1beta1/hyperopt/base_service.py Show resolved Hide resolved

shashank-iitbhu requested a review from tenzen-y September 17, 2024 12:16

tenzen-y reviewed Sep 19, 2024

View reviewed changes

pkg/suggestion/v1beta1/hyperopt/base_service.py Show resolved Hide resolved

google-oss-prow bot added size/XL size/L and removed size/L size/XL labels Sep 22, 2024

shashank-iitbhu requested a review from tenzen-y September 22, 2024 14:46

shashank-iitbhu commented Sep 22, 2024

View reviewed changes

examples/v1beta1/hp-tuning/hyperopt-distribution.yaml Outdated Show resolved Hide resolved

tenzen-y reviewed Sep 22, 2024

View reviewed changes

test/unit/v1beta1/suggestion/test_hyperopt_service.py Show resolved Hide resolved

tenzen-y reviewed Sep 22, 2024

View reviewed changes

google-oss-prow bot added size/XL and removed size/L labels Sep 22, 2024

shashank-iitbhu force-pushed the feat/hyperopt-suggestion-service-update branch from a1156fc to 658daaf Compare September 23, 2024 04:44

andreyvelich reviewed Sep 24, 2024

View reviewed changes

shashank-iitbhu and others added 22 commits January 30, 2025 23:14

added more test cases for hyperopt distributions

0a87259

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

added support for NORMAL and LOG_NORMAL in hyperopt suggestion service

325cd3e

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

added e2e tests for NORMAL and LOG_NORMAL

0c15b3a

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in> sigma calculation fixed fix parse new arguments to mnist.py

hyperopt-suggestion example update

8dfacf9

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

updated logic for log distributions

73add59

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

updated logic for log distributions

77b5df0

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

e2e test fixed

a5acb75

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

added support for parameter distributions for Parameter type INT

e6f82a0

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

unit test fixed

df01c1c

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

Update pkg/suggestion/v1beta1/hyperopt/base_service.py

ad0281f

Co-authored-by: Yuki Iwai <yuki.iwai.tz@gmail.com> Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

comment fixed

6e6722e

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

added unit tests for INT parameter type

7aedeed

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

completed param unit test cases

627d32a

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

handled default case for normal distributions when min or max are not…

b0aed17

… specified Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

fixed validation logic for min and max

03ee74e

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

removed unnecessary test params

b91b0ac

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

fixes

9e6a398

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

added comments

630141c

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

fix

6b64073

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

set default distribution as uniform

8c465c1

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

line omit

7506e55

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

removed empty spaces from yaml files

4fc897d

Signed-off-by: Shashank Mittal <shashank.mittal.mec22@itbhu.ac.in>

shashank-iitbhu force-pushed the feat/hyperopt-suggestion-service-update branch from 494f41e to 4fc897d Compare January 30, 2025 17:45

google-oss-prow bot assigned andreyvelich Jan 30, 2025

google-oss-prow bot added the lgtm label Jan 30, 2025

google-oss-prow bot added the approved label Jan 30, 2025

google-oss-prow bot merged commit bf03463 into kubeflow:master Jan 30, 2025
66 checks passed

		log_min = math.log(float(param.min))
		log_max = math.log(float(param.max))

	if param.FeasibleSpace.Max == "" && param.FeasibleSpace.Min == "" {
	allErrs = append(allErrs, field.Required(parametersPath.Index(i).Child("feasibleSpace").Child("max"),
	fmt.Sprintf("feasibleSpace.max or feasibleSpace.min must be specified for parameterType: %v", param.ParameterType)))
	}

[GSOC] hyperopt suggestion service logic update #2412

[GSOC] hyperopt suggestion service logic update #2412

Uh oh!

Conversation

shashank-iitbhu commented Aug 21, 2024

Uh oh!

tenzen-y commented Aug 22, 2024

Uh oh!

andreyvelich left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tenzen-y Sep 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andreyvelich Sep 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shashank-iitbhu Sep 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shashank-iitbhu commented Sep 19, 2024

Uh oh!

Uh oh!

shashank-iitbhu commented Sep 22, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[GSOC] `hyperopt` suggestion service logic update #2412

[GSOC] `hyperopt` suggestion service logic update #2412

tenzen-y Sep 3, 2024 •

edited

Loading

andreyvelich Sep 3, 2024 •

edited

Loading

shashank-iitbhu Sep 6, 2024 •

edited

Loading