yucc-leon

Results 16 issues of yucc-leon

第一行是输入 ```shell Baichuan-User: 登鹳雀楼->王之涣\n夜雨寄北->\n 登鹳雀楼->王之涣\n夜雨寄北->\n客至->\n望月怀远->\n凉州词->\n 一、根据诗句的意思,给下列古诗找出对应的作者 1、一览众山小\n( \n) 2、飞流直下三千尺\n( \n) 3、春风又绿江南岸\n( \n) 4、但看黄河入海流\n( \n) 5、日暮乡关何处是\n( \n) 6、但愿人长久\n( \n) 7、举头望明月,低头思故乡\n( \n) 8、故人西辞黄鹤楼\n( \n) 9、无边落木萧萧下\n( \n) 10、一览众山小 作者( \n) 11、飞流直下三千尺 作者( \n)...

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior 指 ptuning-v2 的方式:https://github.com/THUDM/ChatGLM-6B/blob/main/ptuning/README.md ### Expected Behavior _No response_ ### Steps To...

I noticed that even though _bigcode/starcoder(2)_ is much opener than code llama and deepseekcoder, eg. open-sourced datasets, clearly described data processing and training, and so on, it is still not...

There is an input format mismatch between the eval and training process. Do you intend to **_emphasize_** the problem before the model generates its output? When doing the Humaneval(+) eval,...

Just started reproducing Magicoder and could not help wondering, would a bigger OSS-Instruct dataset work better and how much better? PS. There are 12,000,000 files in Python inside bigcode/Starcoderdata, with...

discussion

### Describe the bug This happens with datasets-2.18.0; I downgraded the version to 2.14.6 fixing this temporarily. ``` Traceback (most recent call last): File "/home/xxx/miniconda3/envs/py310/lib/python3.10/site-packages/datasets/load.py", line 2556, in load_dataset builder_instance...

Your paper suggested Instruction Boosting and Self-Compare FT would be very helpful but IB looks like Wizard-Evol and IB is very similar to PHP and according to the tech report,...

预印本上附了项目链接,打开是空的?

### Model introduction The model was finetuned by AIGCode based on DeepSeek-Coder-6.7B-base using open-source and private datasets. ### Model URL https://huggingface.co/aigcode/AIGCodeGeek-DS-6.7B ### Additional information (Optional) We have provided code for...

model eval

I mean tell the readers what have you done in details. I'm sorry but I cannot get enough information from this repo for now (2023/09/21 14:28 UTC+8).