Introduction to Conditional GANs

August 29, 2022

Table of content

What is GAN and How does it work?
What is Conditional GAN?
Advantages of cGAN
Pictorial explanation
Use-cases

What is GAN and How does it work?

GAN is a generative model which achieves a high level of realism by pairing a generator with a discriminator.

The generator learns to produce the target output, while the discriminator learns to distinguish true data from the output of the generator.

To give you an analogy, the generator tries to fool the discriminator, and the discriminator tries to keep from being fooled.

Initially, we train the discriminator with a known dataset. The training involves presenting the discriminator with samples from the training dataset until it achieves acceptable accuracy. Then, the generator trains based on if it is successful in fooling the discriminator. Typically, the generator is fed with randomized input that is sampled from a predefined latent space (e.g a multivariate normal distribution). Consequently, the output generated by the generator is evaluated by the discriminator.

Generator network -Takes as input random vector, and decodes it into a synthetic image
Discriminator network – Takes as input an image, and predicts whether the image came from the training set or was created by the generator network.

(Citation: Deep learning with python by Francois Chollet)

GANs rely on back-propagation on both networks to minimize the errors so that the generator produces better images, while the discriminator becomes most skilled at flagging synthetic images. Typically, the discriminator is a convolution neural network, and the generator is a deconvolutional neural network.

Although originally proposed as a form of a generative model for unsupervised learning, GANs have also proven useful for supervised learning and reinforcement learning.

https://www.tensorflow.org/tutorials/generative/images/gan2.png

What is a Conditional GAN?

A conditional generative adversarial network, or cGAN for short, is a type of GAN that involves the conditional generation of images by the generator model.

In our previous section, we have explored GANs where we have no control over the type of output that is being produced. Unlike most generative network architectures, cGANs are not completely unsupervised in their training methods. These cGAN network architectures require some kind of class labels or labeled data to perform the desired action. Let us understand the difference between a simple GAN architecture and a cGAN architecture with some mathematical formulas. Firstly, let us explore the mathematical expression for the GAN structure, as shown below.

https://blog.paperspace.com/content/images/2022/03/image-6.png

With a minor modification to our previous formula of the simple GAN architecture, we have now added a y-label to both the discriminator and the generator network. By converting the previous probabilities into conditional probabilities with the addition of the 'y'-labels, we can ensure that the training generator and discriminator networks are now trained only for the respective label. Hence, we can send a particular input label and receive the desired output from the generative network once the training procedure is complete.

https://blog.paperspace.com/content/images/2022/03/image-7.png

Both the generator and discriminator networks will have these labels assigned to them during the training process. Hence, both these networks of the cGAN architecture are trained conditionally such that the generator generates only outputs similar to the expected label output, while the discriminator model ensures to check if the generated output is real or fake alongside checking if the image matches the particular label.

Advantages of cGANs

By providing additional information to the model, we get two benefits:

Convergence will be faster. Even the random distribution that the fake images follow will have some pattern.
You can control the output of the generator at test time by giving the label for the image you want to generate.

Explanation with picture

If that was confusing, consider this example for gaining more understanding:

Suppose you train a GAN on hand-written digits (MNIST dataset). You normally cannot control what specific images the generator will produce. In other words, there is no way you can request a particular digit image from the generator.

This is where the cGANs come in as we can add an additional input layer of one-hot-encoded image labels. This additional layer guides the generator in terms of which image to produce.

The input to the additional layer can be a feature vector derived from either an image that encodes the class or a set of specific characteristics we expect from the image.

Conditional generative adversarial networks are not strictly unsupervised learning algorithms because they require labeled data as input to the additional layer.

Train Conditional Generative Adversarial Network (CGAN) - MATLAB & Simulink

Use cases of cGANs

Image-to-image translation
Text to image synthesis
Video generation
Convolutional face generation

Sign up for Free Trial

Latest Blogs

A vector illustration of a tech city using latest cloud technologies & infrastructure

Introduction to Conditional GANs

Example H2

https://www.tensorflow.org/tutorials/generative/images/gan1.png

Table of content

What is GAN and How does it work?
What is Conditional GAN?
Advantages of cGAN
Pictorial explanation
Use-cases

What is GAN and How does it work?

GAN is a generative model which achieves a high level of realism by pairing a generator with a discriminator.

The generator learns to produce the target output, while the discriminator learns to distinguish true data from the output of the generator.

To give you an analogy, the generator tries to fool the discriminator, and the discriminator tries to keep from being fooled.

Generator network -Takes as input random vector, and decodes it into a synthetic image
Discriminator network – Takes as input an image, and predicts whether the image came from the training set or was created by the generator network.

(Citation: Deep learning with python by Francois Chollet)

Although originally proposed as a form of a generative model for unsupervised learning, GANs have also proven useful for supervised learning and reinforcement learning.

What is a Conditional GAN?

A conditional generative adversarial network, or cGAN for short, is a type of GAN that involves the conditional generation of images by the generator model.

Advantages of cGANs

By providing additional information to the model, we get two benefits:

Convergence will be faster. Even the random distribution that the fake images follow will have some pattern.
You can control the output of the generator at test time by giving the label for the image you want to generate.

Explanation with picture

If that was confusing, consider this example for gaining more understanding:

This is where the cGANs come in as we can add an additional input layer of one-hot-encoded image labels. This additional layer guides the generator in terms of which image to produce.

The input to the additional layer can be a feature vector derived from either an image that encodes the class or a set of specific characteristics we expect from the image.

Conditional generative adversarial networks are not strictly unsupervised learning algorithms because they require labeled data as input to the additional layer.

Use cases of cGANs

Image-to-image translation
Text to image synthesis
Video generation
Convolutional face generation

Latest Blogs

Introduction to Conditional GANs

Table of Contents

Table of content

Introduction to Conditional GANs

Table of Contents

Table of content

9 Cloud Computing Trends Shaping India’s Digital Future in 2025

LoRA fine-tune Gemma 7B Using TIR with 10 Easy Steps

How Does RAG Improve the Accuracy of LLM Responses?

Top 10 Cloud GPU Providers in 2025

What is Retrieval-Augmented Generation (RAG)?

AI Inference vs Training: Understanding Key Differences

Sovereign Cloud: India's Key to Digital Independence in the AI Age

E2E Sovereign Cloud Platform: Revolutionizing Cloud Sovereignty

Top 8 Generative AI Applications in 2025

A Comparison between TIR Containerized VMs vs Traditional VMs