Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy Data

Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy Data

2024 | Giannis Daras, Alexandros G. Dimakis, Constantinos Daskalakis
This paper introduces a novel framework for training diffusion models using only noisy data, enabling them to sample from the uncorrupted distribution. The key contributions are: (1) an efficient method for learning optimal denoisers for all noise levels $ \sigma \geq \sigma_n $, achieved by applying Tweedie's formula twice; and (2) a consistency loss function for learning denoisers at noise levels $ \sigma \leq \sigma_n $. The framework addresses the challenge of training diffusion models from corrupted data, which has been hindered by previous approximations. The method reduces memorization of training data while maintaining competitive performance. Experiments show that the framework outperforms existing approaches in terms of memorization reduction and generates high-quality samples. The framework is applied to fine-tune Stable Diffusion XL using corrupted data, demonstrating its effectiveness in generating samples from a distribution using only noisy data. The method also mitigates the issue of memorization, which can lead to copyright and privacy concerns. The framework is open-sourced for further research.This paper introduces a novel framework for training diffusion models using only noisy data, enabling them to sample from the uncorrupted distribution. The key contributions are: (1) an efficient method for learning optimal denoisers for all noise levels $ \sigma \geq \sigma_n $, achieved by applying Tweedie's formula twice; and (2) a consistency loss function for learning denoisers at noise levels $ \sigma \leq \sigma_n $. The framework addresses the challenge of training diffusion models from corrupted data, which has been hindered by previous approximations. The method reduces memorization of training data while maintaining competitive performance. Experiments show that the framework outperforms existing approaches in terms of memorization reduction and generates high-quality samples. The framework is applied to fine-tune Stable Diffusion XL using corrupted data, demonstrating its effectiveness in generating samples from a distribution using only noisy data. The method also mitigates the issue of memorization, which can lead to copyright and privacy concerns. The framework is open-sourced for further research.
Reach us at info@study.space
[slides] Consistent Diffusion Meets Tweedie%3A Training Exact Ambient Diffusion Models with Noisy Data | StudySpace