VAE Loss: The weight of BCE vs. KL

The BCE loss is averaged over the **batch dimension**, but KL is averaged over both **batch** and **image pixel dimension**. I hope to know the reason for this. 

I tried both 
- averaging the BCE loss over both dimensions 
- averaging the KL loss just over batch dimension

But the reconstructed images and generated images in test time are rather faint and blurry (just like digit 8), which is far worse than the setting here.

If the setting here is just to balance between two loss, why do you use the number of image pixels (which is intuitively supposed to have some mathematical meaning) for scaling. Why not use some number like 10, 100 or 1000?

Thanks for help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

VAE Loss: The weight of BCE vs. KL #234

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

VAE Loss: The weight of BCE vs. KL #234

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions