Skip to content

reproduce layout results as in the original stable diffusion paper? #19

@CreamyLong

Description

@CreamyLong
          @CreamyLong, have you reproduce layout results as in the original stable diffusion paper? 

I used your layout config and have trained for 10 epochs (COCO dataset,batch size=16), but the log images obtained in training phrase are as follows,
which I think are not correct training results. The training loss have awalys keep around 0.27 not decreased during the whole training process.

input bbox
bbox_image_gs-080000_e-000011_b-002829

input image
inputs_gs-080000_e-000011_b-002829

decoded recontruction directly from first-stage embeded latent
reconstruction_gs-080000_e-000011_b-002829

sample image from latent diffusion model (ddim_step=200, eta=0.)
samples_gs-080000_e-000011_b-002829

Originally posted by @Tonsty in #14 (comment)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions