Katsuya Iida

Results 4 repositories owned by Katsuya Iida

voice100

26
Stars
3
Forks
Watchers

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.

Kokoro-Speech-Dataset

33
Stars
3
Forks
Watchers

A public domain single speaker Japanese speech dataset

soundstream-pytorch

51
Stars
9
Forks
Watchers

Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint

NeMoOnnxSharp

15
Stars
2
Forks
Watchers

Text-to-speech and speech recognition, VAD with NVIDIA NeMo and ONNX Runtime for .NET Core.