fix usage of gpu #11

alishahali1382 · 2024-05-22T20:04:27Z

when calling model.to(device), a new instance of model is created and loaded on the device and returns the new model. But by calling this and using the previous model, all operations are performed on the previous device (here it is cpu).
therefore GPU is not used in current code.

amirhossein-razlighi · 2024-05-31T16:16:23Z

Please look at the example below:

model = nn.Linear(1, 1)
print(model.weight.device)
> cpu
model.cuda()
print(model.weight.device)
> cuda:0

x = torch.randn(1)
print(x.device)
> cpu
x.cuda() # ERROR!!!
print(x.device)
> cpu

So, as you can see, despite tensors, the nn.Module instances' weights are moved to the cuda device inplace! So, no bugs here as self.model is an instance of nn.Module and not a tensor.
Please refer to this discussion on PyTorch's forum for further info.

fix usage of gpu

b899034

amirhossein-razlighi closed this May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix usage of gpu #11

fix usage of gpu #11

alishahali1382 commented May 22, 2024

amirhossein-razlighi commented May 31, 2024

fix usage of gpu #11

fix usage of gpu #11

Conversation

alishahali1382 commented May 22, 2024

amirhossein-razlighi commented May 31, 2024