Results 3 comments of king

我把wishiper 抽取语音特征,以及最后口型图片贴回面部都用gpu改写了。能到50fps 替换 audio_processor.feature_extractor(pcm_array, sampling_rate=16000, return_tensors="pt").input_features import torch import torchaudio import torchaudio.transforms as T import numpy as np class FastWhisperFeatureExtractor: """ 完全对齐 OpenAI Whisper 的特征提取器。 输出为 [1, 80, T],T

速度确实很快,具体做了哪些优化呢?