Original paper: https://openreview.net/pdf?id=Sy2fzU9gl
Implementation includes improvements from: https://arxiv.org/abs/1804.03599
Note: Models were trained on random subset of 150,000 images from the dsprites data set due to lack of compute power close to NeurIPS deadline 2020.
Performance was evaluated on DCI metric.
Black and white dsprites data was used.
Color: Since we are using black-white no latent variable is responsible
Shape: μ_10
Scale: Unclear, looks to be combination of μ_4, μ_6, μ_8
Orientation: μ_10
X-axis Position: μ_8
Y-axis Position: μ6
It appears the latent space has not learned that the shape and orientiation as a continuous space. Likely due to the small subset of samples shown. I expect performance to increase as training data set in increased.