Score-matching-project

Project on Generative Modeling by Score Estimation. This work was conducted as part of the Probabilistic Graphical Models and Deep Generative Models class given by Pr. Pierre LATOUCHE and Pr. Pierre-Alexandre Mattei (MVA Master, December 2023).

This work is mainly based on the paper Generative Modeling by Estimating Gradients of the Data Distribution by Yang Song and Stefano Ermon (2019), and presents as well Score Matching techniques developed in A Connection Between Score Matching and Denoising Autoencoders (Vincent, 2011) and Estimation of Non-Normalized Statistical Models by Score Matching (Hyvärinen, 2005).

Introduction

The goal of this project is to present the techniques used in score-based generative models. The authors propose a new method for generative modeling based on the estimation of the score function of the data distribution. The score function is estimated by a neural network conditioned by a noise parameter perturbing the original data. The generative process is then performed using an Annealed version of the Monte-Carlo Langevin Dynamics. The authors have shown at the time of publication that their method was competitive with the then state-of-the-art methods on the MNIST, CIFAR10 and CelebA datasets.

Method

We give some context regarding generative modeling and the relevance of score-based models. We present the method proposed by the authors in the paper and perform toy experiments to illustrate the method and its limitations.

Experiments

Score Matching on Toy Distributions

We first perform score matching on toy distributions to illustrate the method. We use the following distributions:

Gaussian Mixture Model (GMM) with 2 components
Banana-shaped distribution

A Score Network is trained on each dataset to estimate the score function of the data distribution. We compare the obtained vector field with the true score function, and show the distance between the two with respect to to the $\ell_2$-norm.

Langevin Dynamics on Toy Distributions

We then perform Langevin Dynamics on the toy distributions to illustrate the method. We plot the trajectories of the particles in the true vector field.

Trajectories of the chains

GMM	Banana

Monte-Carlo Langevin Dynamics

GMM	Banana

We observe in the GMM case that the sampling can't reconcile properly the proportions between the two modes of the distribution.

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
assets		assets
dataset		dataset
mcmc_sampling		mcmc_sampling
notebooks		notebooks
score_matching		score_matching
utils		utils
.gitignore		.gitignore
Poster.pdf		Poster.pdf
README.md		README.md
Report.pdf		Report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Score-matching-project

Introduction

Method

Experiments

Score Matching on Toy Distributions

Langevin Dynamics on Toy Distributions

Trajectories of the chains

Monte-Carlo Langevin Dynamics

About

Uh oh!

Releases

Packages

Languages

HalvardBariller/Score-based-generative-modeling-

Folders and files

Latest commit

History

Repository files navigation

Score-matching-project

Introduction

Method

Experiments

Score Matching on Toy Distributions

Langevin Dynamics on Toy Distributions

Trajectories of the chains

Monte-Carlo Langevin Dynamics

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages