moro0v0 issues

Results 7 issues of


                                            moro0v0

识别效果很一般

我拿中文英文的论文都试了一下，用他们给的官网，发现首先存在吞字的问题，也就是说有时候一页的识别是从某个中间位置开始的，不知道为什么？其次很多公式的识别也不准确。感觉效果很一般？

Out of memory

I am using a server with two 24GB 3090 GPUs. When I run the bash run_baselines_lora.sh script, I encounter an error indicating insufficient GPU memory. How can the code be...

question

视觉模型/Omni模型开启api之后怎么传递视频执行预测？

### Reminder - [x] I have read the above rules and searched the existing issues. ### System Info 视觉模型/Omni模型开启api之后怎么传递视频执行预测？ ### Reproduction ```text Put your message here. ``` ### Others _No...

bug

pending

SenseVoiceSmall模型导出onnx格式之后GPU版运行特别慢

我测试了导出的量化INT8的SenseVoiceSmall产生的onnx模型直接拿cpu和gpu去预测一个语音模型，观察到语音模型在使用cpu第一次只要1s多第二次只要0.1s。然后使用gpu反而更忙了第一次20s，第二次1s。这是什么问题

question

funasr支持SenseVoiceLarge的微调吗？

我看项目的model里面只有sensevoice，里面结构对应的是SenseVoiceSmall的结构，funasr是不是不能微调SenseVoiceLarge？

question

funasr里面没有Whisper的微调支持呀

whisper的model.py文件里面forward函数只有一个pass

question

我做的是一个声音分类的任务，然后给的结果是类别，然后我看训练损失在验证集上 { "epoch": 26.845637583892618, "eval_loss": 0.07754058390855789, "eval_runtime": 18.252, "eval_samples_per_second": 12.163, "eval_steps_per_second": 1.041, "step": 4000 }也只有0.077，但是我在合并lora之后预测的结果准确率只有百分之15，请问问题出在哪？我如果要排查的话怎么排查问题

moro0v0

识别效果很一般

Out of memory

视觉模型/Omni模型开启api之后怎么传递视频执行预测？

SenseVoiceSmall模型导出onnx格式之后GPU版运行特别慢

funasr支持SenseVoiceLarge的微调吗？

funasr里面没有Whisper的微调支持呀

训练结果存疑