open-webui icon indicating copy to clipboard operation
open-webui copied to clipboard

feat: DALL-E Compatibility Improvements

Open tusing opened this issue 2 years ago • 1 comments

Some feature requests from issues I've run into during my own usage of DALL-E via this project:

  • API feature implementation
    • [ ] DALL-E 3's API has two image quality options: standard and hd, but there is no way to select which one is used.
    • We should support multiple image generation for DALL-E 2/3.
      • [ ] For DALL-E 2, the n parameter (1≤n≤10) controls how many images can be generated.
      • [ ] For DALL-E 3, the n parameter must be 1, but we can request more images in parallel.
  • User-experience related
    • [ ] The DALL-E 2/3 API does not take a steps parameter, but it is visible in the UI, pre-set to 50, which may cause confusion.
    • [ ] DALL-E 3 can only take 3 resolutions (1024x1024, 1024x1792, or 1792x1024) but the resolution input is free-form, which may cause confusion.
    • [ ] DALL-E settings are not preserved, and there does not seem to be a way to preserve them between Docker container restarts. The API key and API settings are both wiped, even when passing in the API key environment variable and ensuring the volume is mounted correctly.
  • Prompt-revision related
    • [ ] DALL-E 3 "automatically re-writes the prompts for safety reasons" but it is possible to disable this via the string provided in the API guidelines. We could make this an option.
    • [ ] We should show what the prompt used for the image was, which is provided by the API response.

Related: DALL-E API Documentation

tusing avatar Apr 02 '24 07:04 tusing

I was about to say something ;) For now Open-WebUI only support 512x512 and 1024x1024 repectively for DALL-E 2 and DALL-E 3.

  • but does'nt support DALL-E 3 in size like 1024×1792 or 1792x1024, nor HD

JOduMonT avatar Jun 06 '24 07:06 JOduMonT