Batch size during init #18

shashank2000 · 2022-02-08T01:26:11Z

shashank2000
Feb 8, 2022

Hey @matthias-wright
Quick question - I was wondering why the ResNet model was initialized with a batch size of 1; more generally, if I were to use one of the pretrained models here and then have a linear layer, what would the initialization process look like?

I'm assuming I'd define a new linen module consisting of, say, a ResNet and a Dense layer. Then I would initialize that module with some x with batch size 1? Would this be true even if the batch size in my downstream model is >1? I'm a little bit confused because in the train loop in the same resnet file above it seems like there are indeed batches being passed through the forward pass of the model with apply - but the init call for the same model has an x with the first dim being just 1.

Let me know if that wasn't clear, and thanks very much in advance.

matthias-wright · 2022-02-08T20:02:32Z

matthias-wright
Feb 8, 2022
Maintainer

Hi @shashank2000, for the initialization, the batch size does not matter, since the dimension of the weights does not depend on it.
For example, when you initialize a dense layer like this:

dense = nn.Dense(features=4)

you are telling Flax that your layer should have 4 output neurons. Flax still needs to know how many input neurons you have, in order to determine the dimension of the weights.
Then you initialize your layer:

params = dense.init(key, jnp.ones((1, 3)))

which will tell Flax that there will be 3 input neurons.
You can also initialize your layer as follows:

params = dense.init(key, jnp.ones((10, 3)))

It will be the same, as long as the number of features are the same.
And of course, for training you can use any batch size you want. Even if you initialized with batch size 1.

Hope that helps!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch size during init #18

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Batch size during init #18

shashank2000 Feb 8, 2022

Replies: 1 comment

matthias-wright Feb 8, 2022 Maintainer

shashank2000
Feb 8, 2022

matthias-wright
Feb 8, 2022
Maintainer