alokprasad 85470b258e | 4 years ago | |
---|---|---|
.. | ||
audio | 4 years ago | |
data | 4 years ago | |
img | 4 years ago | |
model_new | 4 years ago | |
results | 4 years ago | |
tacotron2 | 4 years ago | |
text | 4 years ago | |
transformer | 4 years ago | |
waveglow | 4 years ago | |
.gitignore | 4 years ago | |
LICENSE | 4 years ago | |
README.md | 4 years ago | |
alignments.zip | 4 years ago | |
dataset.py | 4 years ago | |
fastspeech.py | 4 years ago | |
glow.py | 4 years ago | |
hparams.py | 4 years ago | |
loss.py | 4 years ago | |
modules.py | 4 years ago | |
optimizer.py | 4 years ago | |
preprocess.py | 4 years ago | |
run_inference.sh | 4 years ago | |
synthesis.py | 4 years ago | |
train.py | 4 years ago | |
utils.py | 4 years ago |
The Implementation of FastSpeech Based on Pytorch.
data
.alignments.zip
*waveglow/pretrained_model
;python preprocess.py
.* if you want to calculate alignment, don't unzip alignments.zip and put Nvidia pretrained Tacotron2 model in the Tacotron2/pretrained_model
Run python train.py
.
Run python synthesis.py
.
results
.