Skip to content

New options for preference tuning: rpo alpha, logprobs normalization, reference-free, simpo gamma #841

New options for preference tuning: rpo alpha, logprobs normalization, reference-free, simpo gamma

New options for preference tuning: rpo alpha, logprobs normalization, reference-free, simpo gamma #841

Triggered via pull request June 13, 2025 08:40
Status Success
Total duration 27s
Artifacts

check_code_quality.yml

on: pull_request
pre-commit
22s
pre-commit
Fit to window
Zoom out
Zoom in