Shawn

Results 14 issues of Shawn

hi, how can I get the dataset for version 1?

# Contributing to CSrankings Thanks for contributing to CSrankings! Please read and indicate you agree with **all** these guidelines to getting your pull request accepted. Note that pull requests may...

Hi, I think it is a great work! did you guys consider to release the pretrain code? that would be helpful!

Hi, Is there any pretrained weights (ckpts) for small size models?

Hi, I found that you have shown the loss in the training and testing part, But how to compute the accuracy like the paper you cited in the readme? Thanks!

Hi, Great work first! I am confused with the model tuning part. According to the code, it seemed that you used the lora method. This, in my opinion, will destroy...

Can you show that how you evaluate the model performance with 'attention_mask' ? according to this line: https://github.com/kssteven418/LTP/blob/8ab31a623fb71c5f4f8208e878097f214484e848/src/transformers/models/ltp/modeling_ltp.py#L305C27-L305C27 the 'attention_mask' is never used outside the for loop. So, I think...

Hi, I have read the paper of compresso and it is a great job. I would like to know if there is any result for the datasets like wikitext2, ptb...

Hi, 1. When I use the command on 8 gpus: ``` python3 qalora.py --model_path $llama_7b_4bit_g32 ``` it will show the error: ``` File "/home/shawn/anaconda3/envs/qalora/lib/python3.8/site-packages/transformers/models/llama/modeling_llama.py", line 830, in forward logits =...

Hi, I would like to know when the pretrain code will be released?