心流 issues

Results 7 issues of


                                            心流

How to load a fine-tuned model for inference？

@sanchit-gandhi I used the script from https://github.com/huggingface/distil-whisper/tree/main/training/flax/finetuning_scripts to fine-tune a model and obtained a model named flax_model.msgpack. How can I load this model for inference? Additionally, why did the size...

AttributeError: 'PluginConfig' object has no attribute '_remove_input_padding'. Did you mean: '_remove_input_padding'?

### System Info A10 tensorrt-cu12-10.2.0.post1 tensorrt-cu12-bindings-10.2.0.post1 tensorrt-cu12-libs-10.2.0.post1 tensorrt_llm-0.12.0.dev2024072300 python==3.10 ### Who can help? @Tracin ### Information - [X] The official example scripts - [ ] My own modified scripts ###...

bug

微调的过程中，在保存模型的时候出错

@shuaijiang 如标题，在保存微调模型时报错： Some tensors share memory, this will lead to duplicate memory on disk and potential differences when loading them again: {failing}. A potential way to correctly save your model...

AssertionError: tensor WhisperEncoder/encoder_layers/0/attention_layernorm/layer_norm_L5155/NORMALIZATION_0_output_0 has an invalid shape

### System Info colab T4 ### Who can help? @ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An...

bug

triaged

stale

心流

How to load a fine-tuned model for inference？

AttributeError: 'PluginConfig' object has no attribute '_remove_input_padding'. Did you mean: '_remove_input_padding'?

微调的过程中，在保存模型的时候出错

AssertionError: tensor WhisperEncoder/encoder_layers/0/attention_layernorm/layer_norm_L5155/NORMALIZATION_0_output_0 has an invalid shape

Error with translate-en_zh-1_9.argosmodel

Open source training code

How to train a bidirectional translation model