What a great job!
I noticed that the model used was FLUX1.0-dev, and this model underwent Guidance distillation. Therefore, a new parameter, guidance, was added in the forward process.
May I ask how you set up this guidance throughout the entire training process?
Set it to 0 and then treat it as a model without the influence of this parameter? Is there any better solution?
Looking forward to your reply