ImageReward icon indicating copy to clipboard operation
ImageReward copied to clipboard

How to train stabilityai/stable-diffusion-xl-base-1.0 using ImageReward model

Open emelpolaris opened this issue 1 year ago • 6 comments

I found the code snippet for training CompVis/stable-diffusion-v-1-4 using ImageReward. Based on this, I tried to make the code for training stabilityai/stable-diffusion-xl-base-1.0 but failed. Is it possible to train stabilityai/stable-diffusion-xl-base-1.0 using ImageReward model? Thanks

emelpolaris avatar Apr 15 '24 06:04 emelpolaris

Hello, thanks for your attention! You can refer to our ReFL code and just add the ReFL action similarly in the SDXL fine-tuning script.

xujz18 avatar Apr 15 '24 16:04 xujz18

Thanks for your response. I tried to add the ReFL action in the SDXL fine-tuning, but no luck.

emelpolaris avatar Apr 15 '24 18:04 emelpolaris

@xujz18 stuck with an error. could you provide a code for it?

emelpolaris avatar Apr 19 '24 13:04 emelpolaris

If anyone is interested in it and resolve this issue, please reach out to me https://join.slack.com/t/xrunnergroup/shared_invite/zt-2h89gocpx-KjI4Vf0Z8ZtA1kP2DvsX1Q

emelpolaris avatar Apr 20 '24 15:04 emelpolaris

Hello, thanks for your attention! You can refer to our ReFL code and just add the ReFL action similarly in the SDXL fine-tuning script.

I was trying the same thing but no luck. Could you please elaborate the steps

amulyaprasanth avatar Apr 22 '24 17:04 amulyaprasanth