EvanSong77 issues

Results 9 issues of


                                            EvanSong77

我想问一下如果是属于其它类别如何做比较好，就比如按照您提供的数据，有10种类别和描述： ![image](https://github.com/user-attachments/assets/83a1e801-e4b1-47b1-966f-eecfc3e1c8de) 如果不属于这10种中的任何一类，称为‘其它’，这种情况如何解决比较好。我有一种想法，按照您这种方式训练完之后，我是不是可以根据结果设定一个阈值，如果没有任何一个类别超过改阈值，认定为‘其它’。期待您的回复，祝好！

目录解析

### Description of the bug | 错误描述目录解析效果很差 ![snipaste_20241009_143832](https://github.com/user-attachments/assets/ea62346f-68b0-4dde-9543-0bfbd3731cbb) ![snipaste_20241009_143848](https://github.com/user-attachments/assets/1fa0f6b2-f1de-49ab-a6e7-7f34508c4198) ### How to reproduce the bug | 如何复现 magic-pdf -p Apex+Software+Integration+-+Initial+Overview.pdf -o './' ### Operating system | 操作系统 Linux ###...

bug

Why is the throughput of A40 15 token/s, slower than vllm?

docker logs: ```bash WARNING 02-12 03:17:26 cuda.py:23] You are using a deprecated `pynvml` package. Please install `nvidia-ml-py` instead, and make sure to uninstall `pynvml`. When both of them are installed,...

读取文件

请问如何读取上传的json或者excel文件，然后使用代码做一定数据处理，文档提取器提取的内容貌似不可以

支持encoder-only模型

您好，请问您是否考虑支持encoder-only模型？如：bge-m3 embedding model等

bug

[Bug] AttributeError: 'AWQMarlinConfig' object has no attribute 'weight_block_size' when deploying Qwen3-235B-A22B-AWQ

### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest...

关于FIM ( Fill-in-the-Middle ) 微调的一些问题

数据集格式： ```bash { "instruction": "\n// Operation/PtzRainBrush.cpp\n{\n\n}\n\n\n\nCPtzRainBrush::~CPtzRainBrush()\n\n{\n\n\tif (m_wrapper)\n\n\t{\n\n\t\tm_wrapper->release(m_wrapper);\n\n\t}\n\n\n\n\tif (m_startUpTimer.IsStarted())\n\n\t{\n\n\t\tm_startUpTimer.Stop();\n\n\t}\n\n\t\n\n\tif(m_rainSensorTimer.IsStarted())\n\n\t{\n\n\t\tm_rainSensorTimer.Stop();\n\n\t}\n\n\t\n\n\tif(m_autoTimer.IsStarted())\n\n\t{\n\n\t\tm_autoTimer.Stop();\n\n\t}\n\n\t\n\n\tm_pDriver->detachPtzCheckStatus(CDriver::PtzCheckStatusProc(&CPtzRainBrush::onPtzCheckStatus, this));\n\n\tm_pDriver->detachPtzCapabilityStatus(CDriver::PtzCapabilityProc(&CPtzRainBrush::onPtzCapability, this));\n\n\tCPtzMoveStatus::instance(m_channel)->detachStatusProc(CPtzMoveStatus::StatusProc(&CPtzRainBrush::onProcessStatus, this));\n\n}\n\n\n\nvoid CPtzRainBrush::init()\n\n{\n\n\tprocessRainBrushCaps();\n\n\tPeripheralDesc desc;\n\n\tmemset(&desc, 0, sizeof(desc));\n\n\tdesc.channel = m_channel;\n\n\tif ((createPeripheral(&desc, &m_wrapper)) < 0)\n\n\t{\n\n\t\terrorf(\"createPeripheral error!\\n\");\n\n\t\tm_wrapper = NULL;\n\n\t}\n\n\n\n\tstd::stringstream stream(\"\");\n\n\tstream getConfig(\"RainBrushMode\", rainBrushModeCfg))\n\n\t\t{\n\n\t\t\tif (\"Auto\" == rainBrushModeCfg[m_channel][\"Mode\"].asString())\n\n\t\t\t{\n\n\t\t\t\tm_backupMode...

关于FIM任务的训练

您好，我想请教一下关于FIM任务的训练方式以及数据构造，我先讲一下我目前的做法吧。我目前是将更大规模的Coder模型FIM补全生成的结果中实际用户（指定某一部分的用户，C、C++）使用并accept的数据作为训练数据，使用Qwen2.5-Coder-7B来训练，训练数据大约1w条左右（只有C、C++的），评估数据约500条，然后采用Qwen的Chat Template来进行lora微调，最终通用生成指标都比较高（bleu和rouge）,对比评估数据的label发现也没什么问题,实际部署上去发现Qwen2.5-Coder-7B-sft的补全接受率比原版的Qwen2.5-Coder-7B接受率还低一点，请问我这种训练方式存在什么问题？或者您是否有什么好的建议吗？ ```bash ***** predict metrics ***** predict_bleu-4 = 85.0577 predict_model_preparation_time = 0.0044 predict_rouge-1 = 91.14 predict_rouge-2 = 85.1151 predict_rouge-l = 89.8826 predict_runtime = 0:02:54.48 predict_samples_per_second = 2.734 predict_steps_per_second...

EvanSong77

代码报错