Daniel Muckerman 9d4c4432e1 | 4 years ago | |
---|---|---|
fastspeech_squeezewave@839ea13617 | 4 years ago | |
tacotron2@185cd24e04 | 4 years ago | |
.gitignore | 4 years ago | |
.gitmodules | 4 years ago | |
Dockerfile | 4 years ago | |
README.md | 4 years ago | |
merged.dict_1.1.txt | 4 years ago | |
requirements.txt | 4 years ago | |
synthesize.py | 4 years ago |
Experimental TTS for CPU inference using Tacotron2 and Squeezewave.
Initialize the submodules:
git submodule update --init --recursive
Install the python dependencies:
pip install -r requirements.txt
Copy your models into the directory. This was trained on 22khz tacotron2 and squeezewave models. Squeezewave is loaded using a state_dict so we can take advantage of the existing pretrained models provided by the paper's author while maintaining compatibility with the tweaked architecture to enable denoising without necessitating retraining the vocoder.
Run the project:
python synthesize.py