Shawn issues

Results 14 issues of


                                            Shawn

freihand dataset v1

hi, how can I get the dataset for version 1?

Add Xiaolong Ma as new faculty in Clemson University and Dongkuan Xu in North Carolina State University

# Contributing to CSrankings Thanks for contributing to CSrankings! Please read and indicate you agree with **all** these guidelines to getting your pull request accepted. Note that pull requests may...

pretrain code

Hi, I think it is a great work! did you guys consider to release the pretrain code? that would be helpful!

pretrained weights for small size model

Hi, Is there any pretrained weights (ckpts) for small size models?

How to get the accuracy on ISBI dataset

Hi, I found that you have shown the loss in the training and testing part, But how to compute the accuracy like the paper you cited in the readme? Thanks!

Question related to the model tuning

Hi, Great work first! I am confused with the model tuning part. According to the code, it seemed that you used the lora method. This, in my opinion, will destroy...

No mask used in evaluation process

Can you show that how you evaluate the model performance with 'attention_mask' ? according to this line: https://github.com/kssteven418/LTP/blob/8ab31a623fb71c5f4f8208e878097f214484e848/src/transformers/models/ltp/modeling_ltp.py#L305C27-L305C27 the 'attention_mask' is never used outside the for loop. So, I think...

compresso performance on wikitext2, ptb and c4 datasets

Hi, I have read the paper of compresso and it is a great job. I would like to know if there is any result for the datasets like wikitext2, ptb...

Training with multi gpus, increase the batch size, and how to evaluate?

Hi, 1. When I use the command on 8 gpus: ``` python3 qalora.py --model_path $llama_7b_4bit_g32 ``` it will show the error: ``` File "/home/shawn/anaconda3/envs/qalora/lib/python3.8/site-packages/transformers/models/llama/modeling_llama.py", line 830, in forward logits =...

Pretrain code of Mistral-Pro-8B-v0.1

Hi, I would like to know when the pretrain code will be released?