Posts

Showing posts from May, 2024

Image Generation Using Text - Stable Diffusion

Image
Introduction Stable Diffusion  is a text-to-image model trained on 512x512 images from a subset of the  LAION-5B dataset . The goal of this notebook is to demonstrate how easily you can implement text-to-image generation using the  🤗 Diffusers  library, which is the go-to library for state-of-the-art pre-trained diffusion models for generating images, audio, and 3D structures. Before jumping into the coding, however, we need to know what exactly is  Stable Diffusion . What is Stable Diffusion? Architecture of the Stable Diffusion Stable Diffusion is based on a type of diffusion model that is called Latent Diffusion, which details can be seen in the paper  High-Resolution Image Synthesis with Latent Diffusion Models . Diffusion models are a type of generative model that is trained to  denoise  an object, such as an image, to obtain a sample of interest. The model is trained to slightly  denoise  the image in each step, until a sample is ...