paulcx issues

Results 20 issues of


                                            paulcx

Testing Dataset

Could you provide a small training data sets even just for running the codes as the dataset is quite difficult and time-consuming to get. Thx

duplicate

question

How does this algorithm apply on Chinese text?

Any hint? It will be grateful if you could provide some examples.

Something wrong with the features exporting

end preprocessing (18L, 1L, 208L, 208L, 208L) Traceback (most recent call last): File "main.py", line 62, in test_detect(test_loader, nod_net, get_pbb, bbox_result_path,config1,n_gpu=config_submit['n_gpu']) File "/home/xx/py/DSB2017-master/test_detect.py", line 55, in test_detect **output = split_comber.combine(output,nzhw=nzhw)**...

是否考虑多卡并行训练？

目前只能在一张卡训练。

overfit and why?

So far all my attempts, with different models, sizes, and datasets have led to one issue: the evaluation loss keeps increasing. see my log ![image](https://user-images.githubusercontent.com/738834/231302840-4d58ff2e-1022-440e-9484-6ae39e708897.png)

question

llm finetuning is overfitting?

So far all my attempts, with different models (bloom, gpt), sizes, accelerate framework, datasets have led to one issue: the evaluation loss keeps increasing. plz see my log (deepspeed)![image](https://user-images.githubusercontent.com/738834/233752055-0f4fbda2-3641-4e6d-9889-291c21464e4c.png)

paulcx

Testing Dataset

How does this algorithm apply on Chinese text?

Something wrong with the features exporting

是否考虑多卡并行训练？

overfit and why?

llm finetuning is overfitting?

Support Yi-VL-6B/34B

Long waiting time for shard loading

support docker deployment

Support ORPO