You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
#1560 temporarily disables rematerialization by default to mitigate #1337
What happens is that it tries to fuse all nonlinearities, which works if you compute all pre-nonlinearity-activations at once and the put them into the fusion.
So I think we fundamentally need a memory-aware way of moving operations around for things like rematerialization.
One thing could be to not do the full graph approach but try some local surgery.
#1560 temporarily disables rematerialization by default to mitigate #1337
What happens is that it tries to fuse all nonlinearities, which works if you compute all pre-nonlinearity-activations at once and the put them into the fusion.
So I think we fundamentally need a memory-aware way of moving operations around for things like rematerialization.
One thing could be to not do the full graph approach but try some local surgery.
cc @riccardofelluga
The text was updated successfully, but these errors were encountered: