hbwu-ntu
hbwu-ntu
Thank you for reaching out. We will have a check.
Thank you for reaching out. The only thing you should take care is to generate the synthesised speech and audio by yourself, and put them under `syn_path`: https://github.com/voidful/Codec-SUPERB/blob/SLT_Challenge/run.sh#L5. The data...
Basically yes. One more thing: the evaluation data is small, so it doesn't take long to re-synthesis data. You may use ChatGPT, give it the `ref_path` folder structure, let ChatGPT...
Q1: Yes Q2: Either is acceptable. If you have multiple codec models, please specify which codec corresponds to each sampling rate during submission.
Thank you for bringing up this question. Yes, resampling to 16kHz for both encoding and reconstruction is allowed. However, please note that the evaluation pipeline expects the audio to be...
@redmist328 Hi, Redmist, thank you for bringing up this point. We will compare codec models with the same sampling rate.
Thanks for submitting the results. Could you also refer to section 4.2 of the rule (https://codecsuperb.github.io/Codec-SUPERB-rule.pdf) to let us know how to do inference using your model (we will leverage...
Here is one suggestion: the codec model ckpt can be uploaded to huggingface or google drive (with an instruction to use `gdown` to download the model)
If possible, could you follow section 4.2 of https://codecsuperb.github.io/Codec-SUPERB-rule.pdf to submit the inference instructions for your model?
Thank you very much for submitting the results. Here are two reminders: 1. There are some missing numbers in your submitted GitHub issue. Could you please `git pull` firstly and...