EDGAN

This repository modifies the original StackGAN code from github.

Dataset

use MSCOCO data set

get data set and preprocessed model

Download MSCOCO dataset and annotations including captions and instances
Download pretrained char-CNN-RNN embedding of MSCOCO.
misc/preprocess_mscoco.py preprocess the image in to different sizes for selected supercategory ,write them into tfrecords file along with the corresponding caption embedding.

New features

Data input pipline

use mscoco python API
dataloader that load tfrecords from mscoco
image augumentation including cropping, flipping, and standarlization (when downsample the image, use INTER_AREA method)
sampling from multiple caption embeddings, visualize embedding distributions
negative example (use inner product of embedding captions, see method CLSGAN)
filter out selective images based on classes and their areas

Modification of GAN network

enlarge capacity of generator network, adding 3 residual blocks.
change relu to leaky relu
option to no batch norm in discriminator
increase or reduce discriminator final dimension

Multiple training methods of GAN

Option to trian with vanilla GAN
Option to train with WGAN (excluding weight clipping for batchnorm)
Option to train with LSGAN
Option to train with CLSGAN, continous least square GAN that estimates the inner products of embeddings between right caption embeddings and wrong caption embeddings.
Option to train with BGAN (not implemented yet)

Classification Transfering from Imagenet to MSCOCO (for future 3 stage GAN)

Label each image in MSCOCO with multiple labels for objects that have area larger than the threshold
Transfer resnet from Caffe to Tensorflow
Train resnet to classify the 80 categories of objects in MSCOCO

References publications

StackGAN
text2image
[char-RNN-CNN]
WGAN
LSGAN
BGAN

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
2_stage_1		2_stage_1
2_stage_2		2_stage_2
Data		Data
deprecated		deprecated
future		future
misc		misc
models		models
papers		papers
transfer		transfer
visualization		visualization
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
activate		activate
tensorboard.sh		tensorboard.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EDGAN

Dataset

get data set and preprocessed model

New features

Data input pipline

Modification of GAN network

Multiple training methods of GAN

Classification Transfering from Imagenet to MSCOCO (for future 3 stage GAN)

References publications

About

Releases

Packages

Languages

License

yao-zhao/EDGAN

Folders and files

Latest commit

History

Repository files navigation

EDGAN

Dataset

get data set and preprocessed model

New features

Data input pipline

Modification of GAN network

Multiple training methods of GAN

Classification Transfering from Imagenet to MSCOCO (for future 3 stage GAN)

References publications

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages