Passenger12138 comments

Results 5 comments of


                                            Passenger12138

Question About register_cross_attention_hook and replace_call_method_for_sd3 in Attention Map Visualization

I have implemented a basic functionality to visualize the attention maps of VDM (such as CogVideo) models. My first version of the code is available at [[my repository](https://github.com/Passenger12138/attention-map-diffusers-vdm.git)](https://github.com/Passenger12138/attention-map-diffusers-vdm.git). Currently, the...

咨询cogvideox1.5训练和推理框架支持现状

我更新了一版基于cogvideo 1.5的lora微调代码，主要用于支持bucket训练并解决了cogvideo1.5位置编码和pfs 编码报错的问题以下是我的代码https://github.com/Passenger12138/CogVideoX-5B-I2V-v1.5-lora-train.git

咨询cogvideox1.5训练和推理框架支持现状

🚀 **We are excited to release the LoRA fine-tuning code for CogVideoX 1.5 by Diffusers**, designed specifically for image-to-video (image2video) tasks! This update brings significant improvements and new features to...

Loading local non-Mistral tokenizer incorrectly trigger fix_mistral_regex warning.

I’m fine-tuning **InternVL3_5-1B** (which uses a Qwen3-based LLM) with `transformers==4.57.3` and encountered the following warning: > *The tokenizer you are loading from '...' has an incorrect regex pattern [...] You...

torch.compile + channels_last support for Wan 2.2 (T2V / I2V) fails with RuntimeError + Dynamo Unsupported behavior

Hi, @DN6, I tried the solution you suggested such as ``` # group offload apply_group_offloading( pipe.transformer, offload_type="leaf_level", offload_device=torch.device("cpu"), onload_device=torch.device("cuda"), use_stream=True, ) apply_group_offloading( pipe.transformer_2, offload_type="leaf_level", offload_device=torch.device("cpu"), onload_device=torch.device("cuda"), use_stream=True, ) apply_group_offloading( pipe.text_encoder,...