Image Generation¶

Variational Autoencoder (VAE)¶

Autoencoder Variational Autoencoder Image Generation

A Variational Autoencoder (VAE) is a generative model that learns to encode input data into a probabilistic latent space and reconstruct it through a decoder. Unlike traditional autoencoders, VAEs model uncertainty by learning a distribution over latent variables, enabling smooth sampling and interpolation.

D. P. Kingma and M. Welling, “Auto-Encoding Variational Bayes,” International Conference on Learning Representations (ICLR), 2014.

Name	Model	Input Shape
VAE	VAE	\((N,C,H,W)\)

DDPM¶

Diffusion Image Generation

A Denoising Diffusion Probabilistic Model (DDPM) is a generative model that learns to generate data by reversing a gradual noising process. It adds Gaussian noise over several timesteps and trains a neural network to denoise and recover the original data distribution through a Markovian process.

J. Ho, A. Jain, and P. Abbeel, “Denoising Diffusion Probabilistic Models,” Advances in Neural Information Processing Systems (NeurIPS), 2020.

Name	Model	Input Shape	Parameter Count
DDPM	DDPM	\((N,C,H,W)\)	20,907,649 (Default)

To be implemented…🔮