0% found this document useful (0 votes)

10 views42 pages

06 Generative Adversarial Networks

Uploaded by

for.code.things

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views42 pages

06 Generative Adversarial Networks

Uploaded by

for.code.things

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Generative AI

Generative Adversarial Networks

Discover theDiscover
world at the
Leiden
world
University
at Leiden University
Outline
What are Generative Adversarial Networks?

Extensions to ‘Vanilla’ Generative Adversarial Networks

Applications of Generative Adversarial Networks

Tutorial Exercise

Discover the world at Leiden University

What are Generative Adversarial Networks?

Discover the world at Leiden University

What are Generative Adversarial Networks?
An Adversarial Game
Generative adversarial networks are based on a game, in the sense of game theory,
between two machine learning models.
The generator defines pmodel(x) implicitly
It is not able to evaluate the density function pmodel but can draw samples from it.
A prior distribution p(z) over a vector z is used as input to a generator function
G(z; θ (G)) where θ (G) is a set of learnable parameters defining the generator’s strategy
in the game.
The prior distribution p(z) is typically unstructured, e.g., a high-dimensional normal
distribution. Consequently, samples z are noise.
The generator must learn the function G(z) that transforms unstructured noise z into
realistic samples.

Discover the world at Leiden University

What are Generative Adversarial Networks?
An Adversarial Game
Generative adversarial networks are based on a game, in the sense of game theory,
between two machine learning models.
The discriminator examines samples x and returns an estimate D(x; θ (D)) of whether x is
real (drawn from pdata) or fake (drawn from pmodel by running the generator).
Each player incurs a cost (a loss): J (G)(θ (G), θ (D)) for the generator and J (D)(θ (G), θ (D)) for
the discriminator. Each player attempts to minimise its own cost.
The discriminator’s cost encourages it to correctly classify data as real or fake.
The generator’s cost encourages it to generate samples that the discriminator
incorrectly classifies as real.
There have been different formulations of these loss functions.

Discover the world at Leiden University

Generative Adversarial Networks (GANs)
Discriminator: learns to
classify input into “fake” or
“real” inputs using
“ground truth” examples.
Goal: to minimise
classification error
Generator: learns to
transform a random vector
into outputs capable of
fooling discriminator that
its output is “real”.
Goal: to maximise
classification error.
Schematic of a Generative Adversarial Network

Discover the world at Leiden University

Training GANs
The key to the success of GANs is how training
is alternated between the two networks:
At the start, the generator outputs noisy
images and the discriminator predicts
randomly
By training the generator, it becomes better
at generating fake observations
By training the discriminator, it becomes
better at identifying fake observations
As the generator improves, the discriminator
must adapt to identify fakes
As the discriminator improves, the generator
must find new ways to produce fakes

Schematic of a Generative Adversarial Network Training

Discover the world at Leiden University

Extensions to Generative Adversarial Networks

Discover the world at Leiden University

Conditional GANs
Conditioning the generator
on some data other than
the noise vector provides
contextual information
Discriminator may use
label to decide real versus
fake, or may be trained to
classify images

Schematic of Conditional GAN

Discover the world at Leiden University

Conditional Image Generation
Conditioning the generator
on some data other than
the noise vector provides
contextual information
Discriminator may use
label to decide real versus
fake, or may be trained to
classify images

Examples of Outputs from a Conditional GAN

Discover the world at Leiden University

InfoGAN
InfoGAN is similar to
Conditional GAN but
instead of a label the goal is
to generate codes (c) that
will organise the latent
space
The latent codes are
learned as part of the
training process by
another network Q

Schematic of InfoGAM

Discover the world at Leiden University

InfoGAN
InfoGAN is similar to
Conditional GAN but
instead of a label the goal is
to generate codes that will
organise the latent space

Illustration of organisation of latent space in InfoGAN (Source: Chen et al, 2016)

Discover the world at Leiden University

Applications of GANs

Discover the world at Leiden University

Examples Applications of GANs
Image Generation: generate an image based on an existing dataset: e.g. DCGAN

High Quality Image Generation: generate HQ images e.g., ProGAN, BigGAN

Image-to-Image Translation: convert one class of image to another, e.g., pix2pix

Image Super-resolution: from low resolution to high resolution images, e.g., SRGAN

Next Frame Prediction: generate the next frame in a video, e.g., FutureGAN

Text-to-Image Generation: generate an image from a text description, e.g., StackGAN

Text-to-Speech Generation: generate speech from text input, e.g., GAN-TTS

Discover the world at Leiden University

Image Generation
Deep Convolutional
Generative Adversarial
Network (DCGAN)
Extended original GAN
architecture to use
convolutional layers

Source: Radford et al (2016)

Discover the world at Leiden University

Source: Radford et al (2016)

Discover the world at Leiden University

Image Generation
Deep Convolutional
Generative Adversarial
Network (DCGAN)
Extended original GAN
architecture to use
convolutional layers
Greatly improves ability of
generator to produce
images and discriminator
to classify images
Demonstrated ability to
perform vector arithmetic
in the latent space of noise
vectors to the generator
Source: Radford et al (2016)

Discover the world at Leiden University

High Quality Image Generation
ProGAN progressively
increases the size of the
images generated as
training progresses
Allows stable learning of
much higher quality
images than previous
approaches

Source: Karras et al (2018)

Discover the world at Leiden University

High Quality Image Generation
ProGAN progressively
increases the size of the
images generated as
training progresses
Allows stable learning of
much higher quality
images than previous
approaches
Demonstrated ability of
GANs to work with high
quality images

Source: Karras et al (2018)

Discover the world at Leiden University

This Person Does Not Exist

Source: https://thispersondoesnotexist.com

Discover the world at Leiden University

High Quality Image Generation
BigGAN
Combined multiple
improvements and large-
scale training to build a
large model of images.

Source: Brock, Donahue and Simonyan, 2018

Discover the world at Leiden University

High Quality Image Generation
BigGAN
Combined multiple
improvements and large-
scale training to build a
large model of images.

Latent Space
Random variables
provided to generator
define a space that can be
sampled to produce
images not in training set.

Source: Brock, Donahue and Simonyan, 2018

Discover the world at Leiden University

Image-to-Image Translation
pix2pix and other image-
to-image translation GANs
can perform multiple tasks
Semantic images to photos
Satellite photos to maps
Day to night conversion
Black & white to colour
Sketches to photos

Daytime to nighttime conversion (Source: Isola et al, 2016)

Discover the world at Leiden University

Semantic image to photo translation (Source: Isola et al, 2016)

Discover the world at Leiden University

Daytime to nighttime conversion (Source: Isola et al, 2016)

Discover the world at Leiden University

Sketche to photo conversion (Source: Isola et al, 2016)

Discover the world at Leiden University

Discover the world at Leiden University Learning to see: Gloomy Sunday (Source: Memo Atkin)
Image Super-Resolution
Super-Resolution GANs
have generators that are
trained to convert low
resolution images to high
resolution images
The input to the generator
is a combination of the low
resolution image and a
noise vector

Comparison of image super-resolution (Source: Ledig et al, 2017)

Discover the world at Leiden University

Image Super-Resolution
Super-Resolution (SR)
GANs have generators that
are trained to convert low
resolution images to high
resolution images
The input to the generator
is a combination of the low
resolution image and a
noise vector
They were shown to
outperform the state-of-
the-art: producing sharp
details in SR images

Comparison of image super-resolution (Source: Ledig et al, 2017)

Discover the world at Leiden University

Photo In-painting
Photo in-painting requires
the generator to be
conditioned on an image
with a missing section and
produce a plausible
completed image

Examples of photo in-painting (Source: Pathak et al, 2016)

Discover the world at Leiden University

Photo In-painting
Photo in-painting requires
the generator to be
conditioned on an image
with a missing section and
produce a plausible
completed image
The Context Encoder
model shares many of the
features of a GAN, it is not
referred to in the paper as
a GAN model

Examples of photo in-painting (Source: Pathak et al, 2016)

Discover the world at Leiden University

Next Frame Prediction
FutureGAN is an example
of a GAN with a generator
trained to predict the next
frame in a video
The input to the generator
is conditional on one or
more previous frames and
require to produce the
next frame
FutureGAN builds on
ProGAN and takes a
progressive approach to
training the network

Examples of next frame prediction (See: Aigner and K rner, 2018))

Discover the world at Leiden University

ö
Text-to-Image Generation
text2image goes further
and learns a mapping from
natural language
descriptions to images
Text is first encoded, e.g.,
with an LSTM, and
combine with noise

An architecture for text-to-image generation (Source: Reed et al, 2016)

Discover the world at Leiden University

An architecture for text-to-image generation (Source: Reed et al, 2016)

Discover the world at Leiden University

Text-to-Image Generation
text2image goes further
and learns a mapping from
natural language
descriptions to images
Text is first encoded, e.g.,
with an LSTM, and
combine with noise
Early papers showed
ability to generate low
resolution images
StackGAN showed the
output could be improved
using a pair of GANs
Architecture of StackGAN (Source: Zhang et al, 2017)

Discover the world at Leiden University

Text-to-Image Generation
text2image goes further
and learns a mapping from
natural language
descriptions to images
Text is first encoded, e.g.,
with an LSTM, and
combine with noise
Early papers showed
ability to generate low
resolution images
StackGAN showed the
output could be improved
using a pair of GANs
Examples of image improvement in StackGAN (Source: Zhang et al, 2017)

Discover the world at Leiden University

Creative Adversarial Networks (CANs)
Adjust Loss Function to
Produce Novel Styles
Discriminator: Minimise
Real/Fake = Art/Not Art
and Art-style classification
Generator: Maximise
Real/Fake = Art/Not Art
and Style Ambiguity

Source: Creative Adversarial Networks

Discover the world at Leiden University

CAN: Creative Adversarial Networks (Elgammal et al., 2017)
Adjust Loss Function to
Produce Novel Styles
Discriminator: Minimise
Real/Fake = Art/Not Art
and Art-style classification
Generator: Maximise
Real/Fake = Art/Not Art
and Style Ambiguity

Source: Creative Adversarial Networks

Discover the world at Leiden University

GAN Challenges

Discover the world at Leiden University

GAN Challenges
Uninformative Loss
Value of loss is less
informative than in
traditional networks,
making training trickier

Oscillating Loss
The loss of the
discriminator and
generator can start to
oscillate wildly, rather than
exhibiting long-term
stability.

Oscillating Loss (Source: Generative Deep Learning)

Discover the world at Leiden University

Mode Collapse
If the generator finds a
small number of outputs
that fool the discriminator
Pressure on the generator
to produce diverse outputs
reduces dramatically
Generator tends to map
every point in the latent
space to these outputs
Gradient of loss function
collapses to near 0

Mode collapse results in outputs being very similar (Source: Generative Deep Learning)

Discover the world at Leiden University

Tutorial Exercise
Today’s tutorial exercise is to build and
train a GAN on the MNIST dataset
The GAN is a Deep Convolutional GAN,
so is able to learn high level features of the
from the MNIST dataset

The tutorial includes a graded assignment

to apply and extend the approach to a
different dataset
Some suggestions are given for other
small datasets, but even these will require
some experimentation with the
architecture of the GAN to be effective

Discover the world at Leiden University

AI Linkedin Exam Answers
100% (2)
AI Linkedin Exam Answers
6 pages
2017 Beginner's Review of Generative Adversarial Networks (GAN) Architectures
No ratings yet
2017 Beginner's Review of Generative Adversarial Networks (GAN) Architectures
9 pages
Gans
No ratings yet
Gans
14 pages
Master of Technology in Computer Science: Generative Adversarial Network
No ratings yet
Master of Technology in Computer Science: Generative Adversarial Network
11 pages
Evolutionary Generative Adversarial Networks
No ratings yet
Evolutionary Generative Adversarial Networks
14 pages
Ait401 DL Syllubus
100% (1)
Ait401 DL Syllubus
13 pages
Generative Adversarial Networks (GANs) - Engine and Applications PDF
No ratings yet
Generative Adversarial Networks (GANs) - Engine and Applications PDF
13 pages
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
No ratings yet
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
24 pages
Gan June 2019
No ratings yet
Gan June 2019
28 pages
A Survey On Generative Adversarial Networks (GANs)
No ratings yet
A Survey On Generative Adversarial Networks (GANs)
5 pages
A Survey of Image Synthesis and Editing With Generative Adversarial Networks PDF
No ratings yet
A Survey of Image Synthesis and Editing With Generative Adversarial Networks PDF
15 pages
GAN Variants: A Comprehensive Survey
No ratings yet
GAN Variants: A Comprehensive Survey
8 pages
Sketch To Image Using GAN
No ratings yet
Sketch To Image Using GAN
5 pages
GANs for M.Tech Students
No ratings yet
GANs for M.Tech Students
11 pages
Aiml Demo
No ratings yet
Aiml Demo
12 pages
Anime Gan
No ratings yet
Anime Gan
1 page
Rishab Paper Final
No ratings yet
Rishab Paper Final
7 pages
The Nature of Generative Adversarial Networks
No ratings yet
The Nature of Generative Adversarial Networks
4 pages
Week 3 - Post - GAN
No ratings yet
Week 3 - Post - GAN
38 pages
A Survey On Video Based Human Action Recognition: Recent Updates, Datasets, Challenges, and Applications
No ratings yet
A Survey On Video Based Human Action Recognition: Recent Updates, Datasets, Challenges, and Applications
64 pages
DL Unit6 Gan
No ratings yet
DL Unit6 Gan
44 pages
2A Report
No ratings yet
2A Report
29 pages
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
No ratings yet
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
17 pages
The Six Fronts of The Generative Adversarial Networks
No ratings yet
The Six Fronts of The Generative Adversarial Networks
11 pages
Gans
No ratings yet
Gans
26 pages
Generative AI
0% (1)
Generative AI
3 pages
Frank Gabel Eml2018 Report
No ratings yet
Frank Gabel Eml2018 Report
15 pages
12-DL-Deep Learning For GANS
No ratings yet
12-DL-Deep Learning For GANS
75 pages
DL Unit5
No ratings yet
DL Unit5
15 pages
GAN Technical Final Report
No ratings yet
GAN Technical Final Report
21 pages
Applsci 13 10637 v2
No ratings yet
Applsci 13 10637 v2
29 pages
2019 - Data Augmentation Using GANs - Fabio Henrique
No ratings yet
2019 - Data Augmentation Using GANs - Fabio Henrique
16 pages
ASWIN TS GAN Simplified Notes Unit 4 Gen Ai
No ratings yet
ASWIN TS GAN Simplified Notes Unit 4 Gen Ai
5 pages
Generative Ai and Prompt Basis Rules For Beginners
No ratings yet
Generative Ai and Prompt Basis Rules For Beginners
114 pages
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
No ratings yet
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
22 pages
Generative Adversarial Network
No ratings yet
Generative Adversarial Network
22 pages
Generative Adversarial Network An Overview of Theory and Applications
No ratings yet
Generative Adversarial Network An Overview of Theory and Applications
9 pages
Paper4 (GAN)
No ratings yet
Paper4 (GAN)
24 pages
Multisensor Data Fusion For Cloud Removal in Global and All-Season Sentinel-2 Imagery
No ratings yet
Multisensor Data Fusion For Cloud Removal in Global and All-Season Sentinel-2 Imagery
13 pages
DisCo: Disentangled Representation via Contrast
No ratings yet
DisCo: Disentangled Representation via Contrast
40 pages
Gen AI Unit 3
No ratings yet
Gen AI Unit 3
52 pages
Underwater Single Image Restoration Using Cyclegan
No ratings yet
Underwater Single Image Restoration Using Cyclegan
16 pages
Introduction Generative Adversarial Networks
No ratings yet
Introduction Generative Adversarial Networks
41 pages
A Survey On The Vulnerability of Neural Network Pruning - A Question On Their Susceptibility To Membership Inference Attacks
No ratings yet
A Survey On The Vulnerability of Neural Network Pruning - A Question On Their Susceptibility To Membership Inference Attacks
10 pages
Advances in AI
No ratings yet
Advances in AI
16 pages
Fast Video Deblurring for Engineers
No ratings yet
Fast Video Deblurring for Engineers
14 pages
Lecture16 GAN Cont
No ratings yet
Lecture16 GAN Cont
35 pages
Chapter8 GANs
No ratings yet
Chapter8 GANs
24 pages
Generative Adversarial Network
No ratings yet
Generative Adversarial Network
19 pages
Full Text
No ratings yet
Full Text
15 pages
Week 8
No ratings yet
Week 8
61 pages
Unit 5
No ratings yet
Unit 5
46 pages
CGAN-Based Collaborative Intrusion Detection For UAV Networks A Blockchain-Empowered Distributed Federated Learning Approach
No ratings yet
CGAN-Based Collaborative Intrusion Detection For UAV Networks A Blockchain-Empowered Distributed Federated Learning Approach
13 pages
Deep & Reinforcement - Unit 3
No ratings yet
Deep & Reinforcement - Unit 3
8 pages
Unit-V Deep Generative Models Part-02
No ratings yet
Unit-V Deep Generative Models Part-02
35 pages
Knowledge Guided Data Centric AI in Healthcare 1685207849
No ratings yet
Knowledge Guided Data Centric AI in Healthcare 1685207849
21 pages
3rd Unit Notes
No ratings yet
3rd Unit Notes
16 pages
PDL Unit 5-GAN
No ratings yet
PDL Unit 5-GAN
36 pages
MUST READ - DRB-GAN A Dynamic ResBlock Generative Adversarial Network For
No ratings yet
MUST READ - DRB-GAN A Dynamic ResBlock Generative Adversarial Network For
20 pages
图像处理与深度学习注册
100% (2)
图像处理与深度学习注册
8 pages
AI-Driven Testing for Microservices
No ratings yet
AI-Driven Testing for Microservices
34 pages
Generative Adversarial Network GAN A General Review On Different Variants of GAN and Applications
No ratings yet
Generative Adversarial Network GAN A General Review On Different Variants of GAN and Applications
8 pages
18 Image Generation Using Gan's
No ratings yet
18 Image Generation Using Gan's
5 pages
Landing Trajectory Prediction For UAS Based On Generative Adversarial Network
No ratings yet
Landing Trajectory Prediction For UAS Based On Generative Adversarial Network
10 pages
Semantic Comms for 6G Networks
No ratings yet
Semantic Comms for 6G Networks
16 pages
Deepfakefinalppt
No ratings yet
Deepfakefinalppt
12 pages
GAN Report by Manisha
No ratings yet
GAN Report by Manisha
30 pages
Introduction To Gen Ai
No ratings yet
Introduction To Gen Ai
13 pages
Dss16 DL Gan
No ratings yet
Dss16 DL Gan
51 pages
Final Draft Research Paper
No ratings yet
Final Draft Research Paper
7 pages
E W P W - B AIA: Nhancing Orkplace Roductivity and ELL Eing Using Gents
No ratings yet
E W P W - B AIA: Nhancing Orkplace Roductivity and ELL Eing Using Gents
11 pages
Generative AI
No ratings yet
Generative AI
69 pages
Huijun 20250309
No ratings yet
Huijun 20250309
1 page
MODULE 6 - 2 Generative Adversarial Network (GAN)
No ratings yet
MODULE 6 - 2 Generative Adversarial Network (GAN)
33 pages
Dlamini-Fahim2021 Article DGMADataGenerativeModelToImpro
No ratings yet
Dlamini-Fahim2021 Article DGMADataGenerativeModelToImpro
12 pages
Department of Artificial Intelligence and Data Science: National Engineering College, K.R.Nagar, Kovilpatti
No ratings yet
Department of Artificial Intelligence and Data Science: National Engineering College, K.R.Nagar, Kovilpatti
8 pages
On Ai in Games
No ratings yet
On Ai in Games
10 pages
Week 9 - GANs - AC
No ratings yet
Week 9 - GANs - AC
44 pages
Generative Adversarial Networks Review in Earthquake-Related Engineering Fields
No ratings yet
Generative Adversarial Networks Review in Earthquake-Related Engineering Fields
52 pages
A Method For Improving CNN-Based Image Recognition Using Dcgan
No ratings yet
A Method For Improving CNN-Based Image Recognition Using Dcgan
12 pages
Class Notes DL Unit 1
No ratings yet
Class Notes DL Unit 1
2 pages
Synthetic Generator For Ionospheric Amplitude Scintillation Fading Channels Using Generative Adversarial Networks
No ratings yet
Synthetic Generator For Ionospheric Amplitude Scintillation Fading Channels Using Generative Adversarial Networks
19 pages
Unit6 Aml
No ratings yet
Unit6 Aml
63 pages
DeepLearning Glossary
No ratings yet
DeepLearning Glossary
5 pages
Module 6.2 GAN
No ratings yet
Module 6.2 GAN
29 pages
Advanced Image Generation Using Generative Adversarial Networks GANs Innovations in Creating High-Quality Synthetic Images With AI
No ratings yet
Advanced Image Generation Using Generative Adversarial Networks GANs Innovations in Creating High-Quality Synthetic Images With AI
5 pages
Generative Adversarial Network
No ratings yet
Generative Adversarial Network
8 pages
Unit 5
No ratings yet
Unit 5
26 pages

06 Generative Adversarial Networks

Uploaded by

06 Generative Adversarial Networks

Uploaded by

Generative AI

Generative Adversarial Networks

Extensions to ‘Vanilla’ Generative Adversarial Networks

Applications of Generative Adversarial Networks

Discover the world at Leiden University

Discover the world at Leiden University

Discover the world at Leiden University

Discover the world at Leiden University

Discover the world at Leiden University

Schematic of a Generative Adversarial Network Training

Discover the world at Leiden University

Discover the world at Leiden University

Schematic of Conditional GAN

Discover the world at Leiden University

Examples of Outputs from a Conditional GAN

Discover the world at Leiden University

Discover the world at Leiden University

Illustration of organisation of latent space in InfoGAN (Source: Chen et al, 2016)

Discover the world at Leiden University

Discover the world at Leiden University

High Quality Image Generation: generate HQ images e.g., ProGAN, BigGAN

Image-to-Image Translation: convert one class of image to another, e.g., pix2pix

Text-to-Image Generation: generate an image from a text description, e.g., StackGAN

Text-to-Speech Generation: generate speech from text input, e.g., GAN-TTS

Discover the world at Leiden University

Source: Radford et al (2016)

Discover the world at Leiden University

Source: Radford et al (2016)

Discover the world at Leiden University

Discover the world at Leiden University

Source: Karras et al (2018)

Discover the world at Leiden University

Source: Karras et al (2018)

Discover the world at Leiden University

Discover the world at Leiden University

Source: Brock, Donahue and Simonyan, 2018

Discover the world at Leiden University

Source: Brock, Donahue and Simonyan, 2018

Discover the world at Leiden University

Daytime to nighttime conversion (Source: Isola et al, 2016)

Discover the world at Leiden University

Semantic image to photo translation (Source: Isola et al, 2016)

Discover the world at Leiden University

Daytime to nighttime conversion (Source: Isola et al, 2016)

Discover the world at Leiden University

Sketche to photo conversion (Source: Isola et al, 2016)

Discover the world at Leiden University

Comparison of image super-resolution (Source: Ledig et al, 2017)

Discover the world at Leiden University

Comparison of image super-resolution (Source: Ledig et al, 2017)

Discover the world at Leiden University

Examples of photo in-painting (Source: Pathak et al, 2016)

Discover the world at Leiden University

Examples of photo in-painting (Source: Pathak et al, 2016)

Discover the world at Leiden University

Examples of next frame prediction (See: Aigner and K rner, 2018))

Discover the world at Leiden University

An architecture for text-to-image generation (Source: Reed et al, 2016)

Discover the world at Leiden University

An architecture for text-to-image generation (Source: Reed et al, 2016)

Discover the world at Leiden University

Discover the world at Leiden University

Discover the world at Leiden University

Source: Creative Adversarial Networks

Discover the world at Leiden University

Source: Creative Adversarial Networks

Discover the world at Leiden University

Discover the world at Leiden University

Oscillating Loss (Source: Generative Deep Learning)

Discover the world at Leiden University

Discover the world at Leiden University

The tutorial includes a graded assignment

Discover the world at Leiden University

You might also like