Minxiangliu
Minxiangliu
現在1.9.8的版本能使用QRcode登入嗎? 使用beanfun網頁可以正常登入 但是使用登入器就不行....
Hi @havakv , In my case, I can see that I have set up the GPU correctly, and then during model training, the GPU memory is being used, but the...
I know why I'm wrong old code: ``` class getDataset(Dataset): def __init__(self, datasets): self.dataset=datasets self._trans=transforms.Compose(....) def __getitem__(self, index): return self._trans(self.dataset[index]) ...... dataset_train = getDataset(....) dl_train = DataLoader(dataset_train, ...) ``` new...
Hi @yikuanli , I also want to know.
Hi @lmolhw5252 , I currently have one A100 (40GB) GPU and I am training using your recommendations. However, I encounter an issue where it ultimately displays `exits with return code...
> you may check the resource like memory,or cpu ,you can set batch=1 Are you suggesting setting both `per_device_train_batch_size` and `per_device_eval_batch_size` to 1?
I also encounter a similar issue when fine-tuning llama, and I hope someone can assist in answering it! ``` /root/miniconda3/envs/vicuna/lib/python3.10/site-packages/transformers/training_args.py:1388: FutureWarning: using `--fsdp_transformer_layer_cls_to_wrap` is deprecated. Use fsdp_config instead warnings.warn( Loading...
> i have the same problem, help pls, i need to end my final qualifying work in one week :( Hi @whk6688 @Void-fun , In the end, I used the...
Hi @CSerxy , I have not attempted fine-tuning the Vicuna model. You can replace `--model_name_or_path` with the path to the pre-trained Vicuna model. Since both Llama and Vicuna models are...
> I ran into the same problem and worked around it using the solution proposed above. Wondering if anyone knows the root cause and could explain why? Here is my...