mini-batch api coverage #818
ColdTeapot273K
started this conversation in
General
Replies: 1 comment 4 replies
-
Hey there @ColdTeapot273K! That's music to my ears. As you say, there are some essential pieces of logic that indeed should support mini-batches. Feel welcome to open a PR, we can iron out the details together. @raphaelsty did quite a bit of work in this direction too. At some point it might be worth considering properly benchmarking these mini-batch implementations to compare them with their |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey, I've made a draft implementations of mini-batch api (like
learn_many
) for severalriver
abstractions likecolumn selector
,transformer union
and iterating on one forone hot encoder
. Together with already covered abstractions (linear models
for regression/classification,standard scaler
andpipeline
) it should be an essential set of mini-batch tools ready. I can show here/submit a PR later if it's interestingMini-batch api has given me logarithmic speedup in training, from 30+ hours to 3 minutes, on some quite large dataset.
Beta Was this translation helpful? Give feedback.
All reactions