StableSwarmUI icon indicating copy to clipboard operation
StableSwarmUI copied to clipboard

UI bugs encountered

Open richardtallent opened this issue 1 year ago • 8 comments

First time user, on Mac. Noticed a few bugs:

  • It's not clear for path-based fields whether they should have a trailing "/" or not (turns out no).
  • Changing model, etc. paths requires restarting the whole thing. Not mentioned on screen. Refresh buttons on LoRA, etc. doesn't do the truck.
  • It's complaining in the log that I don't have ffmpeg.exe. I do have ffmpeg, but it's looking for the wrong executable.

Some UI suggestions for improvement:

  • Make font sizes more consistent. I have to scale text on my 30" 4K monitor, but that makes some fonts way too big.
  • Many settings use TextArea fields, but are too small to actually wrap correctly when a value is entered. Give them more room by default or switch them to normal inputs.
  • Ideally, now we can use File System Access API's showDirectoryPicker to have the user choose a folder! Requires user permission but easier than trying to get and paste an absolute path.
  • Lots of "fit and finish" issues with padding, margins, color, text/field/icon alignment, etc. Makes the UI hard to grok.
  • Add support in the cards and metadata modal for embeddings and LoRA for the JSON files that are cached by the A1111 CivitAI plugin. These have, for example, the trigger words.
  • Move Sampler setting out of the comfyUI heading, it's a critical configuration setting.

The good news:

  • Installed with minimum fuss
  • Generates images (the most important thing!)
  • Being able to use absolute or relative paths allowed me to point to my A1111 folders
  • Having ComfyUI around in the background as an option is nice for future flexibility
  • The built-in image viewer is nice

richardtallent avatar Mar 10 '24 23:03 richardtallent

It's not clear for path-based fields whether they should have a trailing "/" or not (turns out no). shouldn't make a difference? I wouldn't expect anything to break from having it or not, and if it does please post the specific error and I'll fix that (ie have it automatically correct).

Changing model, etc. paths requires restarting the whole thing. Not mentioned on screen. Refresh buttons on LoRA, etc. doesn't do the truck. yes automating the reset of the model tracker after settings change is on the todo

It's complaining in the log that I don't have ffmpeg.exe. I do have ffmpeg, but it's looking for the wrong executable. huh, that's from an upstream lib and it sounds like https://github.com/imageio/imageio/issues/665#issuecomment-1263237651 there might be a standing unsolved issue with mac? Stability IT sent me a macbook recently so I'll have to test that. The comfy node that uses ffmpeg is explicitly separated from the rest as I don't trust it to reliably install, having it separate means only things that use it won't work (ie SVD video export).

Make font sizes more consistent. I have to scale text on my 30" 4K monitor, but that makes some fonts way too big. which text specifically is too big or too small? Mostly are all in the same size approximately, the only odd-one-out I'm aware of is the Welcome text (which is just a giant jumbotron, it's meant to be weirdly large and not really a concern if scaling makes it so big it scrolls or whatever)

Many settings use TextArea fields, but are too small to actually wrap correctly when a value is entered. Give them more room by default or switch them to normal inputs. Can you be more specific here? Which fields don't fit right?

Ideally, now we can use File System Access API's showDirectoryPicker to have the user choose a folder! Requires user permission but easier than trying to get and paste an absolute path. doesn't look to be usable https://developer.mozilla.org/en-US/docs/Web/API/Window/showDirectoryPicker notably it requires HTTPS (not valid in localhost) and is considered experimental / not available in all browsers.

Lots of "fit and finish" issues with padding, margins, color, text/field/icon alignment, etc. Makes the UI hard to grok. That's vague/broad but I welcome PRs to improve that.

Add support in the cards and metadata modal for embeddings and LoRA for the JSON files that are cached by the A1111 CivitAI plugin. These have, for example, the trigger words. Can you post an example json + its filename?

Move Sampler setting out of the comfyUI heading, it's a critical configuration setting. yeah lol - EDIT: okay that's been on the todo for a while and is finally doe, it's under "Sampling" now.

mcmonkey4eva avatar Mar 11 '24 17:03 mcmonkey4eva

~~Oh I just noticed this is the modelspec repo heh -- please post at https://github.com/Stability-AI/StableSwarmUI/issues~~

edit: transferred it myself, neat

mcmonkey4eva avatar Mar 11 '24 17:03 mcmonkey4eva

@ "Fit and finish": one thing I noticed with the default theme is that dropdowns in the generation parameters have a black background, but textboxes do not, making them look like uneditable text.

Another is the image editor. When you click "Edit Image", the "Use As Init" button stays visible, but the "Init Image" generation parameter (which it normally sends the image to) disappears without telling you why. Then, when I clicked "Use As Init" under the original image on the right and generated a new image, it was actually based on the edited image on the left. It took me some time to realize that the "Init Image" field is indeed hidden on purpose while the image editor is open. (= It's not a bug as I initially thought, and the "Use As Init" button effectively does nothing in this state).

The image editor's layer stack doesn't indicate whether a layer is a Mask layer or an Image layer. Then again, there seems to be no difference between these types in the first place - a Mask layer functions like a regular color layer and does not, in fact, function as an inpainting mask. (I found no documentation explaining how it's supposed to work.)

When you close the image editor, all your changes are lost without warning.

arcusmaximus avatar Mar 11 '24 22:03 arcusmaximus

one thing I noticed with the default theme is that dropdowns in the generation parameters have a black background, but textboxes do not, making them look like uneditable text. This is my struggle with UI/UX - I have a strong vision for functionality/accessibility but not aesthetic clarity, so I lean on external commentary/suggestions for that, but it's always dragged back-n-forth. Here's what the UI looked like about a month ago: image The very change you're commenting on here is one made in a PR https://github.com/Stability-AI/StableSwarmUI/pull/233 trying to make the UI more visually clear (dropdowns were given black backgrounds and the rest gray), but for you it made it less clear.

(= It's not a bug as I initially thought, and the "Use As Init" button effectively does nothing in this state). ooh. Yeah that's one to fix, marking that more clearly as disabled or hiding it.

The image editor's layer stack doesn't indicate whether a layer is a Mask layer or an Image layer. Ye I want to clean that up - the image editor is still very underdeveloped and needs a lot more work.

Then again, there seems to be no difference between these types in the first place - a Mask layer functions like a regular color layer and does not, in fact, function as an inpainting mask. (I found no documentation explaining how it's supposed to work.) No a mask layer is a mask layer that acts as a mask. The one unexpected oddity for some is that the "default mask" is all-on currently, ie you have to add a mask layer and draw black to mask things to not draw, and can optionally then add another layer and draw white on it to add things back to keep in. I want to make controlling the base of the mask easier since it's rather non-obvious atm, and make a clearer yes/no mask selection vs just drawing black n white with the standard brush.

When you close the image editor, all your changes are lost without warning. For clarity on how early-in-dev the image editor specifically was, I only recently added said close button after many complaints that the dev-preview of the image editor was unclosable without refreshing lol

mcmonkey4eva avatar Mar 11 '24 22:03 mcmonkey4eva

but for you it made it less clear

It's a matter of consistency, I'd say. I would expect editable fields to use the same colors regardless of their type. If that makes things look too cluttered, maybe the many "?" buttons could be removed to compensate - they could be replaced by hover tooltips, or a single Help button in the corner of the window that lets you click the item you want help with.

No a mask layer is a mask layer that acts as a mask

When I wrote that it doesn't, I had just drawn a red blob on a Mask layer, generated an image, and found that the blob made it into the result. Now, however, I can't reproduce that. I was probably drawing on an Image layer without realizing it :)

The one unexpected oddity for some is that the "default mask" is all-on currently

Aha. I would've never guessed that - mask layers are transparent by default, which made me assume they work the same as the Inpaint tab in A1111 (transparent means excluded and colored means included). Maybe this could be changed to match the A1111 behavior? After all, the main use case for masking will be to fix small areas, not regenerate everything except for a small area.

make a clearer yes/no mask selection vs just drawing black n white with the standard brush

That would be nice. Could also make mask layers translucent by default.

Another idea would be to have only one mask layer that can be toggled on/off and is always on top. I'm not sure why you'd ever want multiple mask layers, or have them obscured by the stuff they're supposed to mask.

the image editor is still very underdeveloped and needs a lot more work

One possibility would be to keep it basic and give users the option to use an integrated Photopea instead. I currently use an A1111 extension that does this and find it fantastic for combining the best parts of different generations.

arcusmaximus avatar Mar 12 '24 10:03 arcusmaximus

maybe the many "?" buttons could be removed to compensate - they could be replaced by hover tooltips there is an option under user settings to hide the ? and replace with hover tooltips or hide entirely.

mcmonkey4eva avatar Mar 12 '24 18:03 mcmonkey4eva

Removing the ? makes a world of a difference and I did not know that you could change this.

Need to make editable boxes more clear for the numbers etc

Joopz0r avatar Mar 12 '24 20:03 Joopz0r

Add support in the cards and metadata modal for embeddings and LoRA for the JSON files that are cached by the A1111 CivitAI plugin. These have, for example, the trigger words. A user on discord shared some metadata files from it - apparently there's two different versions with different formats. I've added recognition for both in https://github.com/Stability-AI/StableSwarmUI/commit/44f89604eae4a9ce38f64a0fa0848133a3337742

mcmonkey4eva avatar Mar 17 '24 21:03 mcmonkey4eva

The new Modern Dark theme should cover a lot of the fit-and-finish complaints I imagine. Or if not... well at least it was designed by an actual UI designer and a Webdev rather than my programmer brain.

The sampler settings are in a proper group now.

The image editor is a lot further along than it used to be. Still not done, but pretty nice to use for basic tasks at least now.

Things aren't perfect yet still, but I'd say at least generally covered what's brought up here enough that any lingering issues should be brought up separately.

mcmonkey4eva avatar May 20 '24 06:05 mcmonkey4eva