opencompass icon indicating copy to clipboard operation
opencompass copied to clipboard

[Bug] 缺乏game24以及其他数据集

Open nuochenpku opened this issue 1 year ago • 5 comments

先决条件

  • [X] 我已经搜索过 问题讨论 但未得到预期的帮助。
  • [X] 错误在 最新版本 中尚未被修复。

问题类型

我正在使用官方支持的任务/模型/数据集进行评估。

环境

python run.py
--datasets korbench_mixed_gen_d00bdd bbh_gen TheoremQA_5shot_gen_6f0af8 NPHardEval_gen hellaswag_gen \

重现问题 - 代码/配置示例

python run.py
--datasets korbench_mixed_gen_d00bdd bbh_gen TheoremQA_5shot_gen_6f0af8 NPHardEval_gen hellaswag_gen \

重现问题 - 命令或脚本

python run.py
--datasets korbench_mixed_gen_d00bdd bbh_gen TheoremQA_5shot_gen_6f0af8 NPHardEval_gen hellaswag_gen \

重现问题 - 错误信息

Please make sure ./data/NPHardEval/GCP/ is correct

lack of some datasets like game24 and nphard

其他信息

缺乏一些数据集

nuochenpku avatar Jan 07 '25 06:01 nuochenpku

Thanks for the report. We are working on the dataset validity check and will update the dataset recently.

tonysy avatar Jan 07 '25 11:01 tonysy

Thanks for the report. We are working on the dataset validity check and will update the dataset recently.

@tonysy @liushz @nuochenpku @changlan @hzhwcmhf OpenFinData和FinanceIQ这两个数据集目前也出现了上述问题,问题应该在于opencompass/utils/datasets_info.py,里面缺失一些数据集的下载方式,请尽快上传缺失数据集至http://opencompass.oss-cn-shanghai.aliyuncs.com/datasets/data/并更新代码。谢谢你们的贡献!

small-noob avatar Jan 30 '25 11:01 small-noob

Thanks for the report. We are working on the dataset validity check and will update the dataset recently.

hi, any update? i need evaluate financeIQ and don't know how to download it. or I can just download from HF but is it compatible with opencompass?

GenerallyCovetous avatar Mar 28 '25 08:03 GenerallyCovetous

Lots of datasets are missing. I met the same problems in 1) FinanceIQ, 2) LawBench, 3) MedBench/DrugCA, 4) AGIEval/data/v1/

Could you please fix it?

Youth-49 avatar Apr 02 '25 04:04 Youth-49

Lots of datasets are missing. I met the same problems in 1) FinanceIQ, 2) LawBench, 3) MedBench/DrugCA, 4) AGIEval/data/v1/

Could you please fix it?

ArkNightmaster avatar Oct 19 '25 13:10 ArkNightmaster