whisper.cpp icon indicating copy to clipboard operation
whisper.cpp copied to clipboard

With suppress_non_speech_tokens set to true I'm still getting non speech tokens

Open itsthisjustin opened this issue 1 year ago • 1 comments

Transcriptions are including things such as "Music Playing" and "Bell Rings". Am I missing something silly here or does this param just not work? Using the Swift package fyi

itsthisjustin avatar May 10 '24 03:05 itsthisjustin

only these are suppressed

static const std::vector<std::string> non_speech_tokens = {
    "\"", "#", "(", ")", "*", "+", "/", ":", ";", "<", "=", ">", "@", "[", "\\", "]", "^",
    "_", "`", "{", "|", "}", "~", "「", "」", "『", "』", "<<", ">>", "<<<", ">>>", "--",
    "---", "-(", "-[", "('", "(\"", "((", "))", "(((", ")))", "[[", "]]", "{{", "}}", "♪♪",
    "♪♪♪","♩", "♪", "♫", "♬", "♭", "♮", "♯"
};

KTibow avatar Aug 22 '24 16:08 KTibow