Kunchang Li issues

Results 9 issues of


                                            Kunchang Li

Pretrained model for sth-sth v1 and kinetics got lower accuracy

Hello, thanks for your great job!!! Today when I tried to fine-tuning your model and test the accuracy of your model, something strange happened. I used uniform sampling, but it...

Why using ReLU before split-attention?

I have read your codes about split-attention and I found that you use ReLU before split-attention. https://github.com/zhanghang1989/ResNeSt/blob/76debaa9b9444742599d104609b8ee984b207332/resnest/torch/splat.py#L48-L76 **However, in MobileNetV3,CBAM-ResNet or other models which use attention mechanism, activation is usually...

Question about the class token

Good job! My question is that why to use different class tokens for each stage but **only the final class token is used for classification**? https://github.com/microsoft/CvT/blob/34d1af94c95442b19fb9470e0c9dd5ee11be2024/lib/models/cls_cvt.py#L607

Do you plan to make your toy dataset public?

Thanks for your great job! I notice you create an interesting toy dataset in Section 4. As I'm not familiar with creating a dataset, do you plan to make your...

Some question

In the paper, EViT with oracle can obtain higher accuracy when training longer epochs. Similar results are also shown in the DeiT paper. Thus I think the comparison is not...

Ask anything in video

Hi! We have simply extended MiniGPT-4 for video question answering in our project [Ask-Anything](https://github.com/OpenGVLab/Ask-Anything). Without extra instruction fine-tuning, current results are not satisfactory. ![image](https://user-images.githubusercontent.com/24236723/233040034-af46e3d3-19bc-44c8-ac59-b8838db8f78d.png) In our other try, we simply...

Difference between the prompts of v0 and v1

Thanks for your great job! I have checked the code of v0 and v1, and found some differences between the prompts: ```python # Vicuna-v0 sep = "###" train_prompt = "system.###...

Will the multi-choice version be released

Hi! Thanks for the exciting jobs! I have noticed another multi-choice version dataset but can not find the annotation files. Will it be released? 😁

Should input_embeddings and out_embeddings be updated in Stage2?

Hi! Thanks for your interesting job! I just find that when using LoRA, the `input_embeddings` and `out_embeddings` are not updated with the following code. https://github.com/eric-ai-lab/MiniGPT-5/blob/2121c745b2cb2d7e842e03b4bcaa89c63f2ee6c1/minigpt4/models/mini_gpt5.py#L115-L116 Considering the LoRA is used...