RichardQin1

Results 7 comments of RichardQin1

> I have tested it with the ct-transformer large version, the result is expected. Maybe you could try it: > > https://github.com/alibaba-damo-academy/FunASR/tree/main/egs_modelscope/punctuation/punc_ct-transformer_cn-en-common-vocab471067-large sorry, Let me update the sample content tmp_str...

> Check if VAD didn't cut off those missing segments. how to check vad? sorry,im beginner

> Check if VAD didn't cut off those missing segments. ![image](https://github.com/user-attachments/assets/83e9aad9-b07c-444a-b5cf-f01df52c22e1) Does this prove that the time lost by audio was discarded by VAD? How should I optimize @Purfview

I want to achieve recognition of objects in videos after frame extraction, and I have already completed frame extraction and video object list.

> 演示的哪一部分对您不起作用? run demo_gradio or demo_with_text print ``` Traceback (most recent call last): File "/home/phoenix/python_project/videoQuestionAnswer/Tracking-Anything-with-DEVA/demo/demo_with_text.py", line 15, in from deva.ext.grounding_dino import get_grounding_dino_model File "/home/phoenix/anaconda3/envs/VQA/lib/python3.10/site-packages/deva/ext/grounding_dino.py", line 16, in from segment_anything import...

hellow ,Now I want to use the hotwords feature in the latest version of FastWhisper. Which branch do you recommend using. Additionally, I attempted to use the fastwhisper model separately...