First introduced in 2014 by Ian Goodfellow, Generative Adversarial Networks (GANs) have been under the scope and have been proven very good for generating complex data. Images, text, video have been successfully generated with very good performances.

The original architecture is based on two artificial neural networks trained simultaneously in a competitive manner. One of them, the generator, has the objective of generating the most realistic possible data,while the second network - the discriminator, has the opposite aim of aiming to distinguish the realistic data from the synthetic data the best it can. So, the elegance of this architecture is that each network tries to make the other perform better every time. The GAN architecture is shown below:

GAN architecture

The discriminator is represented by D_ω, parametrised by weights ω. The goal of the discriminator is to assign 1 to the samples from the real distribution P_X and 0 to the generated samples (P_θ). So, GANs can be mathematically represented by a minimax game identified by:

$\min_{G}\max_{D} \; E [log(D_{\omega}(X)) + log(1-D_{\omega}(G_{\theta}(Z))]$

So, G must minimise this equation and D must maximise it, each one tweaking the weights of its network (θ and ω) to do so.

Putting this knowledge into practice

The very basic implementation we are going to follow through came firstly from julialang discourse, so props where props are due. The idea is to create a simple implementation so we can follow along and additionally, that gans for image generation are very common on the internet, so this is also an opportunity to explore something different. To create this, we will need Flux package.