TalkNet 2 [WIP]

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

Official TalkNet 2 repo here

Work remains:

[x] Add masking to all QuartzNet Blocks.
[x] Add PostNet to Mel-Spectrogram generator.
[ ] Clean up and modify all model implementation as per best practices.
[ ] Add Text and Audio processing code.
[ ] Add dataloader and training code.
[ ] Test the whole Talknet2 setup and post result.

Citation:

@misc{beliaev2021talknet,
      title={TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model Stanislav Beliaev, Boris Ginsburgfor Speech Synthesis with Explicit Pitch and Duration Prediction}, 
      author={Stanislav Beliaev and Boris Ginsburg},
      year={2021},
      eprint={2104.08189},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
}

TalkNet2-pytorch
TalkNet2-pytorch copied to clipboard

Metadata

TalkNet 2 [WIP]

Work remains:

Citation:

← Metadata

Owner

Metadata

TalkNet2-pytorch TalkNet2-pytorch copied to clipboard

Metadata

TalkNet 2 [WIP]

Work remains:

Citation:

← Metadata

Owner

Metadata

TalkNet2-pytorch
TalkNet2-pytorch copied to clipboard