UNet deepest layer number of features #1760

kissievendor · 2021-03-11T13:17:01Z

kissievendor
Mar 11, 2021

Hi,

I was again looking at the UNet architecture, and I noticed that there is an extra convolution (stride 1x1x1) in the deepest layer.
So, if I define my network like below, the 256 features is achieved through convolution of 128 instead of strided convolution.

Could you explain the reasoning behind this?

Thank you.

model = UNet(
dimensions=3,
in_channels=1,
out_channels=7,
kernel_size=3,
channels=(32, 64, 128,256),
strides=(2,2,2),
norm=Norm.BATCH,
dropout=0.2,
).to(device)

edit:
I have added the output of this model (print(model)). It seems that in the deepest layer, the connection is both a convolution and a skip connection.

Answered by ericspod

Mar 12, 2021

The final value in the channels argument defines the size of the bottom layer of the UNet structure, which doesn't involve any down/upsampling thus the stride of 1. UNet typically has a bottom layer of convolutions representing information dense in the channel dimension with downsampled spatial dimensions, this is the latent space the upsampling branch of the network starts with to decode the output.

View full answer

Nic-Ma · 2021-03-12T02:56:12Z

Nic-Ma
Mar 12, 2021
Maintainer

Hi @ericspod ,

Could you please help share some info here?

Thanks.

0 replies

ericspod · 2021-03-12T11:33:11Z

ericspod
Mar 12, 2021
Maintainer

The final value in the channels argument defines the size of the bottom layer of the UNet structure, which doesn't involve any down/upsampling thus the stride of 1. UNet typically has a bottom layer of convolutions representing information dense in the channel dimension with downsampled spatial dimensions, this is the latent space the upsampling branch of the network starts with to decode the output.

0 replies

kissievendor · 2021-03-12T11:52:09Z

kissievendor
Mar 12, 2021
Author

Okay, so its goes from 128 with a normal convolution to 256. Where does the SkipConnection come from (384)? I was under the impression that after the last down-sampling, so in the bottom layer, it would go to up-sampling directly. So without the convolution with stride 1.

0 replies

ericspod · 2021-03-12T12:41:23Z

ericspod
Mar 12, 2021
Maintainer

That follows the bottom layer and is the first layer of the upsampling branch. The UNet is structured as embedded layers where each includes the downsample path, a skip connection, the next layer down, and the upsample branch concatenating data from the skip connection and the output of the next layer down.

0 replies

kissievendor · 2021-03-12T19:39:18Z

kissievendor
Mar 12, 2021
Author

Thank you for answering my questions. I think I understand it now.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UNet deepest layer number of features #1760

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 5 comments

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

UNet deepest layer number of features #1760

kissievendor Mar 11, 2021

Replies: 5 comments

Nic-Ma Mar 12, 2021 Maintainer

ericspod Mar 12, 2021 Maintainer

kissievendor Mar 12, 2021 Author

ericspod Mar 12, 2021 Maintainer

kissievendor Mar 12, 2021 Author

kissievendor
Mar 11, 2021

Nic-Ma
Mar 12, 2021
Maintainer

ericspod
Mar 12, 2021
Maintainer

kissievendor
Mar 12, 2021
Author

ericspod
Mar 12, 2021
Maintainer

kissievendor
Mar 12, 2021
Author