-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RFC: Remove training harness wrappers (learn!
, evaluate!
) from Lighthouse.jl?
#81
Comments
I agree! I think one annoying thing about this harness is if you want access to some variable inside for current_epoch in 1:epoch_limit
train!(model, get_train_batches(), logger)
...other stuff...
post_epoch_callback(current_epoch)
end so if you want to know the current epoch inside One fix to that particular problem would be to also pass in the I think the main benefit of a harness is that it's easy to get started; it tells you what things you need (although TBH the current docs are kinda confusing on this point) and then it assembles them for you to have a basic training harness to get started. IMO that benefit could be obtained by having an example (that runs in CI so it doesn't bitrot) that shows you a basic harness. E.g. we could start just by moving this code from the package to an example file (with the key point that we can freely change the example code without breaking anything downstream). Then when you start a new modelling project, you can copy-paste the example to your codebase, and then start tweaking it. FWIW Flux itself has had this discussion since they provide a training loop, e.g. https://discourse.julialang.org/t/is-it-a-good-time-for-a-pytorch-developer-to-move-to-julia-if-so-flux-knet/38453/47. They currently do provide one but I've often seen people say that's mostly for examples and you should copy the loop and modify it yourself once you need to. IMO in our case the maintenance costs outweigh the benefits.
That's true, though it's meant to be if you use LighthouseFlux, it just implements |
I generally agree that the training harness is too rigid to be really useful (in a recent modeling effort we considered building around it but it was gonna be hard to integrate into the distributed harness we were setting up).
That would be well served by a doctest IMO, which is/can be run in CI and also is surfaced in the docs so it's clearly an example! |
Hm, not really sure about this-- it could be like a several-hundred-line doctest if it's really defining I think there could be overlap w https://github.com/beacon-biosignals/LegolasFlux.jl/blob/main/examples/digits.jl, which is run in LegolasFlux tests to test model serialization on a more complicated example. I wonder if there could be like an "example model" that uses & checks that everything works together (Lighthouse, logging, serialization, etc) which could serve as a starting point for new models as well as be open-source and could help with reproducing issues etc. |
I took a recent time-boxed* attempt at deprecating
EvaluationRow
(#71), and it was Not A Fun Time™. I'm not sure what the correct thing to do here is, and I think that it is very possible that we should remove some (all?) of the training harness aspects of Lighthouse.jl altogether---or at least, we should figure out the minimal set of them that users actually depend on.We've spent a certain amount of time maintaining the existing harness behavior, but afaik our internal usage of the harness itself is basically non-existent, as teams have combined the Lighthouse components in their own ways to support their specific use-cases. (Specifics to be left to a Beacon slack thread!) Externally, I would be surprised if anyone was depending on them (although if someone is, please say the word!!). Additionally, https://github.com/beacon-biosignals/LighthouseFlux.jl/blob/main/src/LighthouseFlux.jl does not use
learn!
orevaluate!
.*technically it was also [hurtling-through-]space-boxed as well, huzzah for transcontinental flight....
The particular hole I fell down was trying to cleanly swap out the use of
evaluation_metrics_row
(and its correspondinglog_evaluation_row!
) insideevaluate!
.Lighthouse.jl/src/learn.jl
Line 233 in 0540cdd
Why? Well, the existing usage conflates tradeoff metrics and hardened metrics, and we shouldn't be computing (and logging) both there without the caller specifying things like how the per-class binarization should happen (to create the hardened metrics), and whether they want to additionally compute multirater label metrics (i.e., if
votes
exist), and whether they additionally want to compute additional metrics that (until now) have only been computed for binary classification problems.At the point that we expose separate interfaces for the various combinations of inputs that a user might want (mulitclass v binary, single rater v multirater, binarization choice, pre-computed
predicted_hard_labels
for multiclass input), we will have created a lot more code to maintain for a lot less benefit. At the end of the day, asking the caller to implement their ownevaluate!(model::AbstractClassifier, ...)
would be easier....which seems like a viable option. Except what inputs would a template
evaluate!
function take? What combination of the existing? Because the choice depends on the type of metrics being computed in the first place.
If various projects were depending on this
evaluate!
function, it seems like choosing some minimal required arguments (?) and passing the others as kwargs from the
learn!
function might do it. Buuuut I'm not sure how many folks use this function in the first place!Which brings me to: what is the correct thing to do here? And does anyone use the
learn!
andevaluate!
functions, or could/should we remove them except as examples (in the docs) for how to implement a custom loop for one's own model?To quote @ericphanson, when talking about this issue,
I agree---and so I think losing the
evaluate!
andlearn!
calls as they currently exist would allow us to focus on those primitives and sink less time into maintaining code that isn't used. And if more flexible (or smaller scoped) harnesses make their way back into Lighthouse.jl, I think that would be cool also, but we should let that happen from the ground up once we create training loops that actively work for us and could be shared.The text was updated successfully, but these errors were encountered: