Theoretical Foundations of Diffusion Models
Denoising diffusion models achieve state of the art quality on image generation
tasks. In this line of work we introduce a deterministic framework for reasoning
about, improving and potentially discovering new applications of diffusion
models. We interpret diffusion models as projection onto the support of the
training set, with sampling as approximate gradient descent on the distance
function to this set. Applying this interpretation, we derive a simple yet
efficient diffusion sampler, as well as a framework for incorporating
constraints (such as minimizing the drag coefficient of vehicle images) into the
generation process.
Frank Permenter* and Chenyang Yuan*. “Interpreting and Improving Diffusion
Models from an Optimization Perspective” ICML 2024.
[arxiv] [poster] [code]
Nikos Arechiga*, Frank Permenter*, Binyang Song* and Chenyang Yuan*,
“Drag-guided diffusion models for vehicle image generation”, NeurIPS 2023 Workshop on Diffusion Models.
[arxiv] [poster]
Binyang Song, Chenyang Yuan, Frank Permenter, Nikos Arechiga and Faez Ahmed,
“Surrogate Modeling of Car Drag Coefficient with Depth and Normal Renderings”,
IDETC 2023.
[arxiv]
Talks:
- Interpreting and improving diffusion models from an optimization perspective [slides]