zy-sunshine

Results 10 comments of zy-sunshine

You can use nvidia-smi command to check the memory usage of your GPU actually, maybe another program use some GPU memory?

bloom 支持中文模型呀,不过效果可能也不太好 opt-1.3b 如果都是I'm sorry,那你训练失败了~

RTX4070TI(12G) memory is not enough to train the ds-chat 1.3b model, I got this error before, because my GPU RTX3090 temperature is too high and GPU not working at that...

Thanks for your share. I trained the model with rtx3090 24GB by single_gpu scripts, the step1 and step2 are same as your. In the step3 I add two parameters "--per_device_train_batch_size...

> python3 chat.py --path output/actor-models/1.3b Human: can you speak chinese ? Assistant: I can speak some Chinese, but I don't know much. I can read a few words and phrases,...

> step3训练完后我感觉应该是在output/step3-models/1.3b/actor

QUOTE: (GPU 0; 31.75 GiB total capacity; 28.05 GiB already allocated; 137.69 MiB free; 28.08 GiB reserved in total by PyTorch) I think it is something weird, because you have...

> Thanks for your response, we will follow up this feature, and we will use it to implement the dynamic push app feature, and we have made some progress on...

> A big concern here is the overlap between "load-time" dynamic linking and the component model. Both technologies are about how a Wasm module instances cooperating with others. So, it...

https://github.com/WebAssembly/component-model/blob/main/design/high-level/UseCases.md#performance I just read the component-model doc. I think it is a complex mechanism, and does not meet my needs. I do not need a component(model) runtime, because we will...