Image Generation Using Text - Stable Diffusion
Introduction Stable Diffusion is a text-to-image model trained on 512x512 images from a subset of the LAION-5B dataset . The goal of this notebook is to demonstrate how easily you can implement text-to-image generation using the 🤗 Diffusers library, which is the go-to library for state-of-the-art pre-trained diffusion models for generating images, audio, and 3D structures. Before jumping into the coding, however, we need to know what exactly is Stable Diffusion . What is Stable Diffusion? Architecture of the Stable Diffusion Stable Diffusion is based on a type of diffusion model that is called Latent Diffusion, which details can be seen in the paper High-Resolution Image Synthesis with Latent Diffusion Models . Diffusion models are a type of generative model that is trained to denoise an object, such as an image, to obtain a sample of interest. The model is trained to slightly denoise the image in each step, until a sample is ...