AgentTuning issues

训练数据中指令与模型行为不匹配

查看huggingface dataset上ALFWorld和Mind2Web的训练数据，发现根据提供的指令，模型不可能产生预期的行为，比如下面两条数据，这个是符合预期的吗？

haichao592

魔塔上的 AgentInstruct 数据集的 conversation 都是空值

你好，我下了魔塔上的 AgentInstruct 数据集，但 conversation 都是空值，请问是数据不开源了嘛？

XianglongTan

基于fastchat部署，推理异常

3

用fastchat部署AgentLM-13B，推理的时候格式是乱的，尤其是streaming的模式，每行只有几个字符就切换到下一行了，一个单词被切成了好几个字母或字母组合。如果手动用transformer加载并用gradio展示的话就没有这个问题，用fastchat的debug模式看了一下，用的是LlamaForCausalLM加载的模型，应该没错

ruifengma

貌似hotpotqa测试脚本跑不起来？

1

如题，我按照规定要求设置的环境，但是跑不起来，有更清晰的环境设置么？

Fu-Dayuan

通用数据如何筛选

7

想问一下，通用数据ShareGPT_Vicuna_unfiltered有9w条，你们是如何筛选到5w条的？能提供一下脚本吗

LuoKaiGSW

if it is possible to conduct RLHF from env

1

Thanks for open-sourced agentTuning code , I am quite interested in training the model, i see the training framework is not open-sourced https://github.com/THUDM/AgentTuning/issues/1, The discussion mentioned that it could support...

SHITIANYU-hue

Can I run AgentInstruct data on the AgentBench?

1

Hey, thank you for your great work. I just wanted to know how can I run evaluation on the open source AgentInstruct data on the AgentBench repo. I will be...

harshraj172