Instructions to use speechbrain/tts-mstacotron2-libritts with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- speechbrain
How to use speechbrain/tts-mstacotron2-libritts with speechbrain:
# interface not specified in config.json
- Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -41,7 +41,7 @@ The following is an example of converting text-to-speech with the speaker voice
|
|
| 41 |
**Note:**
|
| 42 |
- The model generates speech at a rate of 22050 Hz, but it's important to note that the input signal, crucial for capturing speaker identities, must be sampled at 16kHz.
|
| 43 |
|
| 44 |
-
```
|
| 45 |
import torchaudio
|
| 46 |
from speechbrain.inference.TTS import MSTacotron2
|
| 47 |
from speechbrain.inference.vocoders import HIFIGAN
|
|
@@ -66,7 +66,7 @@ torchaudio.save("synthesized_sample.wav", waveforms.squeeze(1).cpu(), 22050)
|
|
| 66 |
|
| 67 |
If you want to generate a random voice, you can use the following:
|
| 68 |
|
| 69 |
-
```
|
| 70 |
import torchaudio
|
| 71 |
from speechbrain.inference.TTS import MSTacotron2
|
| 72 |
from speechbrain.inference.vocoders import HIFIGAN
|
|
@@ -92,7 +92,7 @@ torchaudio.save("synthesized_sample.wav", waveforms.squeeze(1).cpu(), 22050)
|
|
| 92 |
If you want to generate multiple sentences in one-shot, you can do it this way:
|
| 93 |
Note: The model internally reorders the input texts in the decreasing order of their lengths.
|
| 94 |
|
| 95 |
-
```
|
| 96 |
import torchaudio
|
| 97 |
from speechbrain.inference.TTS import MSTacotron2
|
| 98 |
from speechbrain.inference.vocoders import HIFIGAN
|
|
|
|
| 41 |
**Note:**
|
| 42 |
- The model generates speech at a rate of 22050 Hz, but it's important to note that the input signal, crucial for capturing speaker identities, must be sampled at 16kHz.
|
| 43 |
|
| 44 |
+
```python
|
| 45 |
import torchaudio
|
| 46 |
from speechbrain.inference.TTS import MSTacotron2
|
| 47 |
from speechbrain.inference.vocoders import HIFIGAN
|
|
|
|
| 66 |
|
| 67 |
If you want to generate a random voice, you can use the following:
|
| 68 |
|
| 69 |
+
```python
|
| 70 |
import torchaudio
|
| 71 |
from speechbrain.inference.TTS import MSTacotron2
|
| 72 |
from speechbrain.inference.vocoders import HIFIGAN
|
|
|
|
| 92 |
If you want to generate multiple sentences in one-shot, you can do it this way:
|
| 93 |
Note: The model internally reorders the input texts in the decreasing order of their lengths.
|
| 94 |
|
| 95 |
+
```python
|
| 96 |
import torchaudio
|
| 97 |
from speechbrain.inference.TTS import MSTacotron2
|
| 98 |
from speechbrain.inference.vocoders import HIFIGAN
|