|
|
@ -20,10 +20,11 @@ Visit our [website] for audio samples. |
|
|
|
1. Download and extract the [LJ Speech dataset](https://keithito.com/LJ-Speech-Dataset/) |
|
|
|
2. Clone this repo: `git clone https://github.com/NVIDIA/tacotron2.git` |
|
|
|
3. CD into this repo: `cd tacotron2` |
|
|
|
4. Update .wav paths: `sed -i -- 's,DUMMY,ljs_dataset_folder/wavs,g' filelists/*.txt` |
|
|
|
4. Initialize submodule: `git submodule init; git submodule update` |
|
|
|
5. Update .wav paths: `sed -i -- 's,DUMMY,ljs_dataset_folder/wavs,g' filelists/*.txt` |
|
|
|
- Alternatively, set `load_mel_from_disk=True` in `hparams.py` and update mel-spectrogram paths |
|
|
|
5. Install [PyTorch 1.0] |
|
|
|
6. Install python requirements or build docker image |
|
|
|
6. Install [PyTorch 1.0] |
|
|
|
7. Install python requirements or build docker image |
|
|
|
- Install python requirements: `pip install -r requirements.txt` |
|
|
|
|
|
|
|
## Training |
|
|
|