Skip to content

Instantly share code, notes, and snippets.

@fblissjr
Forked from madebyollin/notes_on_sd_vae.md
Created September 5, 2024 21:35
Show Gist options
  • Save fblissjr/55c52ff410dd8ea6375a9fd7052c131e to your computer and use it in GitHub Desktop.
Save fblissjr/55c52ff410dd8ea6375a9fd7052c131e to your computer and use it in GitHub Desktop.
notes_on_sd_vae

Notes / Links about Stable Diffusion VAE

Stable Diffusion's VAE is a neural network that encodes images into a compressed "latent" format and decodes them back. The encoder performs 48x lossy compression, and the decoder generates new detail to fill in the gaps.

(Calling this model a "VAE" is sort of a misnomer - it's an encoder with some very slight KL regularization, and a conditional GAN decoder)

This document is a big pile of various links with more info.

VAE Versions & Lineage

Other SD-VAE-related Codebases

Other Info

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment