AgentBench
AgentBench copied to clipboard
[Assistance] Number of problems in the OS dataset
Hi, I have counted the number of data samples or problems in the 'os_interaction' folder, and my count shows a total of 191 samples. However, the table that provides statistics reports a different number of samples, specifically 170 samples. Not sure if I was looking at the correct folder. Appreciate your help. thanks!
Thank you for your interest.
- Could you provide the detail of the split whose count is wrong?
- File
data/os_interaction/data/6-backup.jsonis deprecated and we don't contain it in our dataset. Details are shown inmain/configs/tasks/os.yaml.
thank you @Longin-Yu
apart form the 6-backup the total is 182, including 26 dev.
test 156
dev 26
but the stat here shows the test with 144 samples?