Yen-Ting Lin comments

Results 16 comments of


                                            Yen-Ting Lin

Error during inference with Mixtral 7bx8 GPTQ

got the same error with finetuned Mixtral 7bx8

Error connecting to Remote SSH

also having the same issue

Remote-SSH with ProxyJump not working

have the same issue

Support for AWQ quantization in TGI

For quantized model, i only tried with AWQ on vllm. you can find -awq model on my huggingface

想請教關於Fine tuning時的資料集要求

如果你自己寫腳本訓練，我建議用 1 就好，簡單有效。這問題可以回答有深有淺，會關乎你要不要 1. 訓練在 user input / 2. use flash attention? / 3. packing? 等等等，所以我建議你直接熟悉 axolotl 哈哈哈哈他會幫你準備這些 model input。

關於在 LM Studio 使用此模型…

有可能是量化時 calibration data 的選用問題。之後的模型會 *盡量* 量化好一起釋出。

如果你用多(>=8)張高級顯卡 (A/H 系列) 建議用 NVIDIA 原生 [Nemo](https://github.com/NVIDIA/NeMo), Megatron, 或是開源的 [nanotron](https://github.com/huggingface/nanotron/tree/main)。非以上情境，我個人最喜歡用 [axolotl](https://github.com/OpenAccess-AI-Collective/axolotl)，雖然時不時會有小坑哈哈 TRL 是相對乾淨的套件，如果想掌握全程，也蠻推薦的。 BTW twllm 這個 project v1, v2 訓練腳本是幾乎我自己重寫的，但現在建議任何階段都用現有套件就好。

Yen-Ting Lin

Error during inference with Mixtral 7bx8 GPTQ

Error connecting to Remote SSH

Remote-SSH with ProxyJump not working

Support for AWQ quantization in TGI

想請教關於Fine tuning時的資料集要求

關於在 LM Studio 使用此模型…

請問訓練用的程式碼是用哪一套?

請問訓練用的程式碼是用哪一套?

請問訓練用的程式碼是用哪一套?

請問訓練用的程式碼是用哪一套?