Bowen Yuan comments

Results 9 comments of


                                            Bowen Yuan

Some files for VQA v2 may be missing..

Sorry for disturbing you, I find it in LXMERT. : )

Llava-video slowfast mode

I'm concerned about that, too. I find the following statements in the paper: "_We consider the same video representation configurations for the training and inference stages. On 128 NVIDIA H100...

Llava-video slowfast mode

Also, I'm curious why not use it on the 7B model.

Llava-video slowfast mode

> > Maybe the slowfast mode also used in the 72B model's training stage, instead of only for 72B model's inference stage? > > Based on the config.json of 72B...

Do not support for finetuning on only one dataset..

besides, why i have some unexpected key when loading the model like this: `unexpected_keys=['llama_model.model.layers.0.self_attn.rotary_emb.inv_freq', 'llama_model.model.layers.1.self_attn.rotary_emb.inv_freq', 'llama_model.model.layers.2.self_attn.rotary_emb.inv_freq', 'llama_model.model.layers.3.self_attn.rotary_emb.inv_freq',.....................`

Bowen Yuan

Some files for VQA v2 may be missing..

Llava-video slowfast mode

Llava-video slowfast mode

Llava-video slowfast mode

Do not support for finetuning on only one dataset..

Questions about VideoChat2_HD

llava-video使用llms-eval测试出错

llava-video使用llms-eval测试出错

llava-video使用llms-eval测试出错