Sangyoon Bae
Sangyoon Bae
Has anyone tried reproducing the code on refcocog dataset? My environment is python=3.7 and tensorflow=2.13 . I've succeeded on debugging errors caused by version change , but when I train...
should I change start_epoch in config or change path of pretrained_weights in config?
hi, thanks for your codes. I am trying to use model of "TOLD : A novel two-stage overlap-aware framework for Speaker Diarization", but cannot find the model(Found only eend-ola code)....
While following tutorial "Voice_Activity_Detection.ipynb", I confronted errors in "exp_dir = exp_manager(trainer, config.get("exp_manager",None))" I've changed nothing on the tutorial, and I am getting errrors. WHen I first run the cell ,...
Is refcocog and Grefcoco dataset same?
Hi, I am trying to finetune vad and speaker diarization model on korean dataset. However, I cannot find tutorial related to vad. I've found [File not found (github.com)](https://github.com/NVIDIA/NeMo/blob/main/tutorials/asr/06_Voice_Activiy_Detection.ipynb) tutorial from...
how can I change number of tokens of whisper?
How can I perform SV with other pretrained models on my data ? Can I get example codes?
Redimnet
I can see Redimnet in your code (redimnet.py), but there is no mention aboutr redimnet in pretrained model readme.md or docs. Is it able to use redimnet?
Does it support TTS? How can I perform TTS?