TalkNet2-pytorch icon indicating copy to clipboard operation
TalkNet2-pytorch copied to clipboard

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

TalkNet 2 [WIP]

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.


Official TalkNet 2 repo here

Work remains:

  • [x] Add masking to all QuartzNet Blocks.
  • [x] Add PostNet to Mel-Spectrogram generator.
  • [ ] Clean up and modify all model implementation as per best practices.
  • [ ] Add Text and Audio processing code.
  • [ ] Add dataloader and training code.
  • [ ] Test the whole Talknet2 setup and post result.

Citation:

@misc{beliaev2021talknet,
      title={TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model Stanislav Beliaev, Boris Ginsburgfor Speech Synthesis with Explicit Pitch and Duration Prediction}, 
      author={Stanislav Beliaev and Boris Ginsburg},
      year={2021},
      eprint={2104.08189},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
}