Open
Description
Really nice work! When reading through the paper, I have some questions about the proposed soft-min-snr loss. Would appreciate your feedback on this.
- In eq (5) of the hourglass diffusion transformers, it's mentioned that
c_out^{-2}(\sigma)
is incorporated, however, based on the definition ofc_out
, eq (5) should be
min(SNR, \gamma) * (\sigma_data^2 + \sigma^2) / (\sigma_data^2 * \sigma^2).
- In the implementation:
k-diffusion/k_diffusion/layers.py
Lines 64 to 65 in 6ab5146
The \gamma=4 or 5
proposed in the paper doesn't seem to be used. Am I missing anything here?
Metadata
Metadata
Assignees
Labels
No labels