Chapter 27

Writes › Book › Deep Learning with PyTorch › Part VIII › Chapter 27 ›

Forward Diffusion Processes

Diffusion models are generative models built around a simple idea: learn to reverse a gradual corruption process.

Writes › Book › Deep Learning with PyTorch › Part VIII › Chapter 27 ›

Reverse Denoising Processes

The forward diffusion process gradually transforms data into noise.

Writes › Book › Deep Learning with PyTorch › Part VIII › Chapter 27 ›

Score Matching

Diffusion models can be understood from multiple mathematical viewpoints.

Writes › Book › Deep Learning with PyTorch › Part VIII › Chapter 27 ›

Noise Schedules

A diffusion model needs a rule for how noise increases during the forward process.

Writes › Book › Deep Learning with PyTorch › Part VIII › Chapter 27 ›

Latent Diffusion

Early diffusion models operated directly in pixel space. A model generated images by iteratively denoising tensors such as

Writes › Book › Deep Learning with PyTorch › Part VIII › Chapter 27 ›

Text-to-Image Systems

Text-to-image generation aims to synthesize images from natural language descriptions. A model receives a prompt such as:

Writes › Book › Deep Learning with PyTorch › Part VIII › Chapter 27 ›

Video diffusion extends image diffusion from still images to moving sequences. Instead of generating one image, the model generates a sequence of frames that should remain visually coherent over time.

Writes › Book › Deep Learning with PyTorch › Part VIII › Chapter 27 ›

Diffusion Transformers

Early diffusion systems used convolutional U-Nets as denoising networks. U-Nets worked well because images contain strong local structure, and convolutions efficiently model nearby spatial relationships.

Sections

Forward Diffusion Processes

Reverse Denoising Processes

Score Matching

Noise Schedules

Latent Diffusion

Text-to-Image Systems

Video Diffusion Systems

Diffusion Transformers