MaxKB [FEATURE]MaxKB消耗的tokens数量和实际LLM消耗的tokens数量出入比较大

1.1.3

MaxKB消耗的tokens数量和实际LLM消耗的tokens数量出入比较大，应该是MaxKB内部相关的向量计算也参与了计算。建议额外提供一个返回值，专门提供实际LLM消耗的tokens数量

No response

No response

May 21 '24 01:05 xiaobug0929

感谢反馈，我们统一使用了 GPT-2的模型计算的输入和输出的 tokens，并没有计算 embedding ，与在线模型的API计算方式可能不一样。

May 21 '24 02:05 baixin513

为什么我用的v1.1.3版本没有显示token消耗量的？

May 23 '24 01:05 marxy

v1.1.3

你用的什么模型？

May 31 '24 09:05 baixin513

v1.1.3

你用的什么模型？

我用ollama v0.1.38部署的qwen

Jun 04 '24 01:06 marxy

您可以升级到最新版本看看，有解决过这个问题。

Apr 16 '25 01:04 baixin513

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

You can upgrade to the latest version to see, and this problem has been solved.

Apr 16 '25 01:04 shaohuzhang1