RuntimeError: mat1 and mat2 shapes cannot be multiplied (1280x1024 and 768x4096)

Open DeerEyre opened this issue 2 years ago • 1 comments

I got this problem when I follow your restruction to run finetune_visualglm.py who know how to solve this problem?

**[2023-06-16 17:34:52,083] [INFO] [RANK 0] do_test ...................... False [2023-06-16 17:34:52,083] [INFO] [RANK 0] iteration .................... 0 Traceback (most recent call last): File "/mnt/dolphinfs/hdd_pool/docker/user/hadoop-search/ganshu01/projects/VisualGLM-6B-main/ganshu_finetune.py", line 188, in training_main(args, model_cls=model, forward_step_function=forward_step, create_dataset_function=create_dataset_function, collate_fn=data_collator) File "/mnt/dolphinfs/hdd_pool/docker/user/hadoop-search/ganshu01/projects/VisualGLM-6B-main/sat/training/deepspeed_training.py", line 132, in training_main iteration, skipped = train(model, optimizer, File "/mnt/dolphinfs/hdd_pool/docker/user/hadoop-search/ganshu01/projects/VisualGLM-6B-main/sat/training/deepspeed_training.py", line 276, in train lm_loss, skipped_iter, metrics = train_step(train_data_iterator, File "/mnt/dolphinfs/hdd_pool/docker/user/hadoop-search/ganshu01/projects/VisualGLM-6B-main/sat/training/deepspeed_training.py", line 350, in train_step forward_ret = forward_step(data_iterator, model, args, timers, **kwargs) File "/mnt/dolphinfs/hdd_pool/docker/user/hadoop-search/ganshu01/projects/VisualGLM-6B-main/ganshu_finetune.py", line 84, in forward_step logits = model(input_ids=tokens, image=image, pre_image=pre_image)[0] File "/home/sankuai/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, **kwargs) File "/home/sankuai/conda/lib/python3.8/site-packages/deepspeed/utils/nvtx.py", line 15, in wrapped_fn ret_val = func(*args, **kwargs) File "/home/sankuai/conda/lib/python3.8/site-packages/deepspeed/runtime/engine.py", line 1735, in forward loss = self.module(*inputs, **kwargs) File "/home/sankuai/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, **kwargs) File "/mnt/dolphinfs/hdd_pool/docker/user/hadoop-search/ganshu01/projects/VisualGLM-6B-main/sat/model/official/chatglm_model.py", line 192, in forward return super().forward(input_ids=input_ids, attention_mask=attention_mask, position_ids=position_ids, past_key_values=past_key_values, **kwargs) File "/mnt/dolphinfs/hdd_pool/docker/user/hadoop-search/ganshu01/projects/VisualGLM-6B-main/sat/model/base_model.py", line 144, in forward return self.transformer(*args, **kwargs) File "/home/sankuai/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, **kwargs) File "/mnt/dolphinfs/hdd_pool/docker/user/hadoop-search/ganshu01/projects/VisualGLM-6B-main/sat/model/transformer.py", line 451, in forward hidden_states = self.hooks['word_embedding_forward'](input_ids, output_cross_layer=output_cross_layer, **kw_args) File "/mnt/dolphinfs/hdd_pool/docker/user/hadoop-search/ganshu01/projects/VisualGLM-6B-main/model/visualglm.py", line 20, in word_embedding_forward image_emb = self.model(**kw_args) File "/home/sankuai/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, **kwargs) File "/mnt/dolphinfs/hdd_pool/docker/user/hadoop-search/ganshu01/projects/VisualGLM-6B-main/model/blip2.py", line 67, in forward return self.glm_proj(out) File "/home/sankuai/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, kwargs) File "/home/sankuai/conda/lib/python3.8/site-packages/torch/nn/modules/linear.py", line 114, in forward return F.linear(input, self.weight, self.bias) RuntimeError: mat1 and mat2 shapes cannot be multiplied (1280x1024 and 768x4096)

Jun 16 '23 09:06 DeerEyre

someone says using half-precision instead of int8. Who know how to modify this in the code?

Jun 16 '23 10:06 DeerEyre