Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Log-reward uniformisation, logging and new hyperparams for temperature, Z-training, validation. #54

Merged
merged 55 commits into from
Mar 22, 2023

Conversation

julienroyd
Copy link
Contributor

Several changes (most of them minor):

  • Added option ‘constant’ for ‘temperature_sample_dist’
  • New hyperparameters:
    • objectives (variable list of objectives to optimise)
    • num_thermometer_dim (for beta encoding)
    • Z_learning_rate
    • valid_random_action_prob (now performs greedy_validation by default)
    • n_valid_prefs (number of validation preference vector to span simplex)
    • n_valid_repeats_per_pref (number of molecules to sample for each preference vector at validation)
  • Added logging
    • Now saves molecules generated during validation
    • Saves hyperparameters in stdout and on disk
    • Saves ‘git_hash’ and ‘hostname’ in hyperparameters
    • To stdout during training
  • Made all algorithms consistent with log-rewards

bengioe and others added 30 commits February 18, 2023 20:07
…spond to self.temperature_dist_params=<fixed_value>
  * objectives: the list of objectives to optimise among {'seh', 'qed', 'sa', 'mw'}
  * num_thermometer_dim: the length of the thermometer vector for encoding temperature
…passing full thermometer instead of empty one)
…on_prob to be a argument of graph_sampler.sample_from_model()
  * n_valid_prefs
  * n_valid_repeats_per_pref
….setup() to be caught by Determined agents as well
…ling task.cond_info_to_reward() for tasks that wouldn't have been updated to using logrewards rather than rewards)
…sks:

  * SEHMOOTask now inherits from SEHTask
  * Now SEHMOOTask only concatenates the preferences to what was already in cond_info (done by parent classes)
@julienroyd julienroyd requested a review from bengioe as a code owner March 10, 2023 15:20
@julienroyd julienroyd force-pushed the julien-pr1 branch 2 times, most recently from 9b8f7ee to 48198cf Compare March 15, 2023 15:50
@bengioe
Copy link
Collaborator

bengioe commented Mar 20, 2023

#50 and #52 were merged, would you mind resolving conflicts and rebasing?

@julienroyd
Copy link
Contributor Author

@bengioe I merged the trunk here, we can fast-forward now. I am currently running an experiment just to validate that we still get the same results and then we should be good to close. I'll let you know when the results are in.

@julienroyd
Copy link
Contributor Author

I am done with the validation run, results look good (see on this link).

@julienroyd
Copy link
Contributor Author

Closing.

@julienroyd julienroyd merged commit 838bbd6 into trunk Mar 22, 2023
@julienroyd julienroyd deleted the julien-pr1 branch March 22, 2023 14:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants