Jonathan Rahn comments

Results 13 comments of


                                            Jonathan Rahn

Convert Flax trained model to PyTorch

yes, it's currently not in the right place (should go into the `__init__.py` you mention), but i created a variable for that

Convert Flax trained model to PyTorch

this runs without exceptions now: ```python from diffusers import FlaxStableDiffusionPipeline, StableDiffusionPipeline from diffusers import AutoencoderKL, UNet2DConditionModel import jax.numpy as jnp pipe, params = FlaxStableDiffusionPipeline.from_pretrained('CompVis/stable-diffusion-v1-4', revision='bf16', dtype=jnp.bfloat16) pipe.save_pretrained('output-flax', params) pipe2 =...

Convert Flax trained model to PyTorch

this is my updated test scenario: ```python from diffusers import FlaxStableDiffusionPipeline, StableDiffusionPipeline from diffusers import AutoencoderKL, UNet2DConditionModel import jax.numpy as jnp import jax from flax.jax_utils import replicate from flax.training.common_utils import...

Convert Flax trained model to PyTorch

Thank you!

Convert Flax trained model to PyTorch

Thanks, I'll give it a try

Convert Flax trained model to PyTorch

Ok, so I have a lokal version, that works for this: ```python from diffusers import FlaxStableDiffusionPipeline, StableDiffusionPipeline from diffusers import AutoencoderKL, UNet2DConditionModel import jax.numpy as jnp pipe, params = FlaxStableDiffusionPipeline.from_pretrained('CompVis/stable-diffusion-v1-4',...

is it work on Win platform?

I've taken a quick look at this e.g. [here](https://stackoverflow.com/questions/19849077/error-loading-dll-in-python-not-a-valid-win32-application) and [here](https://stackoverflow.com/questions/57187566/python-ctypes-loading-dll-throws-oserror-winerror-193-1-is-not-a-valid-win) - but have not done such ports before. My initial thoughts are: - currently bitsandbytes loads libbitsandbytes.so which won't...

is it work on Win platform?

I've spent a bit more time looking at how [DeepSpeed](https://github.com/microsoft/DeepSpeed) solves building on Windows. They make use of `BuildExtension`, `CppExtension` and `CudaExtension` from `torch.utils.cpp_extension` and this build process is run...

Running train.py on 2060 GPU

To add one data point: I'm running unmodified `python train.py` with `--batch_size=8` on ~22gb vram.

Inference on the model

> Note that it's using FP16 weights, not int8, so the memory requirements are 2x of the int8 quantized model. But personally I'm skeptical that the model can be quantized...