Diffusion Models

发布日期: 2022-03-11

2022-03-11 更新

Diffusion Autoencoders: Toward a Meaningful and Decodable Representation

Authors:Konpat Preechakul, Nattanat Chatthee, Suttisak Wizadwongsa, Supasorn Suwajanakorn

Diffusion probabilistic models (DPMs) have achieved remarkable quality in image generation that rivals GANs’. But unlike GANs, DPMs use a set of latent variables that lack semantic meaning and cannot serve as a useful representation for other tasks. This paper explores the possibility of using DPMs for representation learning and seeks to extract a meaningful and decodable representation of an input image via autoencoding. Our key idea is to use a learnable encoder for discovering the high-level semantics, and a DPM as the decoder for modeling the remaining stochastic variations. Our method can encode any image into a two-part latent code, where the first part is semantically meaningful and linear, and the second part captures stochastic details, allowing near-exact reconstruction. This capability enables challenging applications that currently foil GAN-based methods, such as attribute manipulation on real images. We also show that this two-level encoding improves denoising efficiency and naturally facilitates various downstream tasks including few-shot conditional sampling. Please visit our project page: https://Diff-AE.github.io/
PDF Please visit our project page: https://Diff-AE.github.io/

论文截图

Harvey

https://ipaper.today/2022/03/11/2022-03-11-diffusion-models/

本博客所有文章除特別声明外，均采用 CC BY 4.0 许可协议。转载请注明来源 Harvey !

Diffusion Models

Vision Transformer

2022-03-11 Vision Transformer

Vision Transformer

GAN

2022-03-11 GAN

GAN

Diffusion Models

2022-03-11 更新

Diffusion Autoencoders: Toward a Meaningful and Decodable Representation

打赏用于支持本站流量费