training-image-processor
training-image-processor copied to clipboard
badly coded gui for a quick streamlined workflow to produce 512x512 images suitable to train Stable Diffusion
Training Image Processor
This project speeds up the process of manually preparing images for training Stable Diffusion embeddings.
It provides a streamlined UI for selecting a square portion of a picture and then scales that section to 512x512 for training, and then outputs it and loads the next image, all in a single click!
The code has been thrown together quickly. It's disorganized and unprincipled. I made the minimum viable project to speed up my own workflow I was interested in. Pull requests are welcome.

Installation and Launching
Windows
- Install python 3.x from python.org.
- Download this repository as a zip file and extract
- Enter the extracted folder with your file browser
- Double click setup (setup.bat) - This only needs done once
- Double click launch (launch.bat)
Linux - and other Unix-likes/BSDs/MacOS
Ya'll probably know what to do already...
- Make sure python is installed however is standard for your distribution.
- Git clone the repo or download the zip and extract
- Enter the project folder in a bash terminal
- run
make - activate the venv with
source ./venv/bin/activateor alternative script if not using bash... - run the script
python src/training_image_processor.py
Usage
- First open a directory with pictures
- Use the buttons at the top to rotate/flip immages as needed
- Resize selection square with mouse wheel
- Click the image to process it and load the next image. The 512x512 image is placed in a new 'outputs' folder and the original goes in a new 'originals' folder, inside the open directory
Licensing
The contents under the assets folder are from the adwaita-icons project and are released under the license LGPL-v3.0. The rest of the project is GPL-v2 only.