RichardQin1
RichardQin1
> I have tested it with the ct-transformer large version, the result is expected. Maybe you could try it: > > https://github.com/alibaba-damo-academy/FunASR/tree/main/egs_modelscope/punctuation/punc_ct-transformer_cn-en-common-vocab471067-large sorry, Let me update the sample content tmp_str...
> Check if VAD didn't cut off those missing segments. how to check vad? sorry,im beginner
> Check if VAD didn't cut off those missing segments.  Does this prove that the time lost by audio was discarded by VAD? How should I optimize @Purfview
I want to achieve recognition of objects in videos after frame extraction, and I have already completed frame extraction and video object list.
> 演示的哪一部分对您不起作用? run demo_gradio or demo_with_text print ``` Traceback (most recent call last): File "/home/phoenix/python_project/videoQuestionAnswer/Tracking-Anything-with-DEVA/demo/demo_with_text.py", line 15, in from deva.ext.grounding_dino import get_grounding_dino_model File "/home/phoenix/anaconda3/envs/VQA/lib/python3.10/site-packages/deva/ext/grounding_dino.py", line 16, in from segment_anything import...
hellow ,Now I want to use the hotwords feature in the latest version of FastWhisper. Which branch do you recommend using. Additionally, I attempted to use the fastwhisper model separately...