Error while training StyleGAN with CIFAR-10 #124

hexiangdong2017 · 2020-12-08T12:42:11Z

(fb_gan_zoo) root@f56c103c5607:~/pytorch_GAN_zoo# python train.py StyleGAN -c config_cifar10.json --restart -n cifar10
Setting up a new session...
Running StyleGAN
size 10
50000 images found
AC-GAN classes :
{'Main': {'order': 0, 'values': ['horse', 'deer', 'automobile', 'cat', 'frog', 'ship', 'airplane', 'truck', 'dog', 'bird']}}

size 10
50000 images found
50000 images detected
size (8, 8)
50000 images found
Changing alpha to 0.000
/root/pytorch_GAN_zoo/models/base_GAN.py:278: UserWarning: This overload of add_ is deprecated:
add_(Number alpha, Tensor other)
Consider using one of the following signatures instead:
add_(Tensor other, *, Number alpha) (Triggered internally at /pytorch/torch/csrc/utils/python_arg_parser.cpp:882.)
avg_p.mul_(0.999).add_(0.001, p.data)
Traceback (most recent call last):
File "train.py", line 137, in
GANTrainer.train()
File "/root/pytorch_GAN_zoo/models/trainer/progressive_gan_trainer.py", line 235, in train
status = self.trainOnEpoch(dbLoader, scale,
File "/root/pytorch_GAN_zoo/models/trainer/gan_trainer.py", line 486, in trainOnEpoch
allLosses = self.model.optimizeParameters(inputs_real,
File "/root/pytorch_GAN_zoo/models/base_GAN.py", line 249, in optimizeParameters
self.classificationPenalty(predFakeD,
File "/root/pytorch_GAN_zoo/models/base_GAN.py", line 563, in classificationPenalty
loss.backward(retain_graph=True)
File "/root/anaconda3/envs/fb_gan_zoo/lib/python3.8/site-packages/torch/tensor.py", line 221, in backward
torch.autograd.backward(self, gradient, retain_graph, create_graph)
File "/root/anaconda3/envs/fb_gan_zoo/lib/python3.8/site-packages/torch/autograd/init.py", line 130, in backward
Variable._execution_engine.run_backward(
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [512, 512]], which is output 0 of TBackward, is at version 2; expected version 1 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).

hexiangdong2017 · 2020-12-08T12:42:42Z

@likethesky @Celebio @teytaud @colesbury

varshakishore · 2021-01-15T23:05:52Z

Were you able to fix this error?
@hexiangdong2017

CyberKing0514 · 2021-04-01T08:35:02Z

pytorch_GAN_zoo/models/networks/styleGAN.py

modify line 158 to

self.mean_w**.data** = self.gamma_avg * self.mean_w**.data** + (1-self.gamma_avg) * mapping.mean(dim=0, keepdim=True)

mhaines94108 · 2022-09-29T05:34:54Z

CyberKing's fix works better without the extra *'s:

        self.mean_w.data = self.gamma_avg * self.mean_w.data + (1 - self.gamma_avg) * mapping.mean(
            dim=0, keepdim=True)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error while training StyleGAN with CIFAR-10 #124

Error while training StyleGAN with CIFAR-10 #124

hexiangdong2017 commented Dec 8, 2020

hexiangdong2017 commented Dec 8, 2020

varshakishore commented Jan 15, 2021 •

edited

Loading

CyberKing0514 commented Apr 1, 2021

mhaines94108 commented Sep 29, 2022

Error while training StyleGAN with CIFAR-10 #124

Error while training StyleGAN with CIFAR-10 #124

Comments

hexiangdong2017 commented Dec 8, 2020

hexiangdong2017 commented Dec 8, 2020

varshakishore commented Jan 15, 2021 • edited Loading

CyberKing0514 commented Apr 1, 2021

mhaines94108 commented Sep 29, 2022

varshakishore commented Jan 15, 2021 •

edited

Loading