Maze comments

Results 9 comments of


                                            Maze

ValueError: Attempting to unscale FP16 gradients.

因为 amp 要求可训练参数是`torch.float32`类型。lora模块的参数是`torch.float32`类型，但是`modules_to_save='embed_tokens,lm_head'`中的参数在`from_pretrained`时初始化为`torch.float16`，又同时参与amp更新梯度，所以会报错。解决方案： 1. 对于llama模型可以手动转换`embed_tokens`和`lm_head`层为`torch.float32` 2. 对于任意模型，可以遍历参数，把`requires_grad`的参数全都手动设为`torch.float32` ```python model.print_trainable_parameters() # monkey patch logger.info(f"model.modules_to_save: {model.modules_to_save}") trainable_not_float32 = [name for name, param in model.named_parameters() if param.requires_grad and param.dtype != torch.float] if...

finetune的时候加上 --fp16报错，RuntimeError: expected scalar type Half but found Float

```python model = AutoModel.from_pretrained( MODEL_PATH, load_in_8bit=True, torch_dtype=torch.float16, device_map='auto', ) model = prepare_model_for_int8_training(model) ``` The reason is simple. Use the above example to illustrate. `torch_dtype=torch.float16` means load model weights with `torch.float16`,...

Maze

ValueError: Attempting to unscale FP16 gradients.

finetune的时候加上 --fp16报错，RuntimeError: expected scalar type Half but found Float

TextDiffusers: question about Mario-LAION annotations

TextDiffuser - How the LayoutTransformer model trained?

hw910B NPU显卡t2v效果复现问题

在百度网盘上下载的数据集压缩包有密码，能提供下密码嘛？？

关于CV-CUDA用户体验的感受

Full-parameter finetune后，生成视频的主体空间扭曲

Full-parameter finetune后，生成视频的主体空间扭曲