SenseVoice icon indicating copy to clipboard operation
SenseVoice copied to clipboard

how to set skip_special_tokens and timestamp level?

Open MrRace opened this issue 1 year ago • 8 comments

How to set parameters similar to skip_special_tokens when generating ASR results? Additionally, does it support ASR results at the timestamp level?

MrRace avatar Jul 08 '24 11:07 MrRace

same problem

ggzzzzz628 avatar Jul 15 '24 05:07 ggzzzzz628

skip_special_tokens:please update funasr-1.1.1 timestamp: on going

LauraGPT avatar Jul 16 '24 03:07 LauraGPT

When is the timestamp level expected to be released? We really need this ability. Thanks

yaohongfenglove avatar Jul 16 '24 03:07 yaohongfenglove

How to implement timestamp function? Would you give me some ideas

yaohongfenglove avatar Jul 16 '24 05:07 yaohongfenglove

How to implement timestamp function? Would you give me some ideas use the forced_align provided by the torchaudio like:

alignment,  scores = torchaudio.functional.forced_align(ctc_probs, preds.unsqueeze(0), None, None, blank=0)

gaochangfeng avatar Jul 30 '24 09:07 gaochangfeng

alignment, scores = torchaudio.functional.forced_align(ctc_probs, preds.unsqueeze(0), None, None, blank=0) 呢個係點用嘅?

laubonghaudoi avatar Aug 11 '24 23:08 laubonghaudoi

@laubonghaudoi 睇下呢度 https://colab.research.google.com/github/pytorch/audio/blob/gh-pages/main/_downloads/97729a601eea05725da9715649633311/ctc_forced_alignment_api_tutorial.ipynb#scrollTo=AeFF8bo3YLoq

indiejoseph avatar Sep 28 '24 11:09 indiejoseph

Is there any solution now, about the timestamp level

superme32767 avatar Oct 25 '24 02:10 superme32767