AIPlane3 / README.md
PrakhAI's picture
Update README.md
f332b85
|
raw
history blame
2.87 kB
metadata
datasets:
  - https://www.robots.ox.ac.uk/~vgg/data/fgvc-aircraft/
Generated Real (for comparison)
image/png image/png

This GAN model is trained on the FGVC Aircraft dataset. The model uses Progressive Growing with Spectral Normalization.

The work builds up on https://huggingface.co/PrakhAI/AIPlane and https://huggingface.co/PrakhAI/AIPlane2.

This model was trained to generate 256x256 images of Aircrafts. The implementation in JAX on Colab can be found here.

Convolutional Architecture

A significant improvement over https://huggingface.co/PrakhAI/AIPlane2 is the elimination of "checkerboard" artifacts. This is done by using Image Resize followed by Convolution layer in the Generator instead of a Transposed Convolution where the kernel size is not divisible by the stride.

Transposed Convolution (kernel size not divisible by stride) Resize followed by convolution
image/png image/png

Image Quality

The model, while generating several high quality images of Airplanes, also generates poor quality images.

A total of 400 generated images were labeled by hand as either desirable (151) or undesirable (249).

Sample desirable outputs Sample undesirable outputs
image/png image/png

Latent Space Interpolation

Latent Space Interpolation can an educational exercise to get deeper insight into the model.

It can be observed below that several aspects of the generated image such as the color of the sky, grounded-ness of the plane, as well as the plane shape and color are frequently continuous through the latent space.

image/png

Training Progression