One possible way to the LDM training fails is due to the inappropriate range values that the latent representation of the autoencoder might create. This issue is more crucial in the upscalers, but it could also affect in the image generation. Would be okay to include an extra step in the tutorials about LDM, calculating the scale factor (similar to what is already discussed in
)? What do you think
@SANCHES-Pedro @JessyD ?
One possible way to the LDM training fails is due to the inappropriate range values that the latent representation of the autoencoder might create. This issue is more crucial in the upscalers, but it could also affect in the image generation. Would be okay to include an extra step in the tutorials about LDM, calculating the scale factor (similar to what is already discussed in
GenerativeModels/tutorials/generative/2d_super_resolution/2d_stable_diffusion_v2_super_resolution.py
Line 302 in fd362dc