Neo comments

Results 12 comments of

Neo

Qwen1.5-MoE-A2.7B-Chat微调GPU利用率很低

> 全量finetune，ZeRO3，设置output_router_logits=True。训练过程中会突然卡住，GPU利用率突然到100% ![image](https://private-user-images.githubusercontent.com/96909430/321122435-096c34cf-fb9c-4e1e-b694-47a5a104d6b9.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTMzNjA5MTAsIm5iZiI6MTcxMzM2MDYxMCwicGF0aCI6Ii85NjkwOTQzMC8zMjExMjI0MzUtMDk2YzM0Y2YtZmI5Yy00ZTFlLWI2OTQtNDdhNWExMDRkNmI5LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA0MTclMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNDE3VDEzMzAxMFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWZjYTYxZjk4ZTkyODJjNzRkZGI2ZjAwMDE3ZmE1YWI1OWRmMmY4MDEyNzQ0ZGYwNmM2OWY4MDlmOWY5NGUxOWEmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.rV3YPjOhzllPlrg94KOjiEjUaHpOKJqx_653NeWWc54) 请问有解决吗

the msra2src.py may have some problems

> hello！ I also met some problems when running the msra2mrc.py?do you know how to deal with the problem of invilid inputs?thank you so much if you could give me...

RuntimeError: "triu_tril_cuda_template" not implemented for 'BFloat16'

> ```python > torch.set_default_tensor_type(torch.cuda.HalfTensor) > ``` i have same problem when i train llama3, in modeling_llama.py 1095: ``` causal_mask = torch.triu(causal_mask, diagonal=1) ``` i fix this by : ``` causal_mask...

[VATT] Construct and load datasets using DMVR

excuse me ,can you help me convert the raw videos and text to TFRecords,i have downloaded the two datasets ![image](https://github.com/google-research/google-research/assets/55910045/6b5dccfc-7f01-4198-88b2-f34379af8478) how can i convert them? thank you

RuntimeError: cannot pin 'CUDABFloat16Type' only dense CPU tensors can be pinned

> @cooper12121, please share repro steps, such as scripts, full stack trace, and ds_config. **I use transformers' Trainer to train my model, Here is the full error report:** ``` 11.219.19.47:...

[VATT] OP_REQUIRES failed at strided_slice_op.cc:108 : INVALID_ARGUMENT: slice index 0 of dimension 0 out of bounds.

> Hi, @hassanhub. I encountered this problem and failed to solve it after searching for several solutions to similar(just look like similar) issues. > > As you mentioned in #962,...

[VATT] OP_REQUIRES failed at strided_slice_op.cc:108 : INVALID_ARGUMENT: slice index 0 of dimension 0 out of bounds.

> Have you figured out the solution by any chance? > It would be very helpful if you could guide me past the error. after i test ,this error must...

RuntimeError: "triu_tril_cuda_template" not implemented for 'BFloat16'

> I'm using Llama via HuggingFace. Is there a good way to make this edit through their modules at all? > > I tried doing > > ``` > #...

How to prepare Multi Modal Dataset

> google-scenic uses dmvr as da have you fix it? i also met the same problem,and it must be the dataset problem

使用Adgen广告数据微调Qwen-MoE，输出问题

> May I know how you have finetuned model? Is it full-parameter finetuning or something else? Based on the limited information, it is hard to tell why (especially the ``)....