mlx-community/Irodori-TTS-500M-v2-VoiceDesign-fp16

This model was converted to MLX format from Aratako/Irodori-TTS-500M-v2-VoiceDesign using mlx-audio version 0.4.3.

Refer to the original model card for more details on the model.

The VoiceDesign variant generates speech conditioned on a text description of the desired voice instead of a reference audio clip.

Use with mlx-audio

pip install -U mlx-audio

CLI Example

python -m mlx_audio.tts.generate \
  --model mlx-community/Irodori-TTS-500M-v2-VoiceDesign-fp16 \
  --text "ใ“ใ‚“ใซใกใฏใ€ใƒ†ใ‚นใƒˆใงใ™ใ€‚" \
  --instruct "็ฉใ‚„ใ‹ใง่ฝใก็€ใ„ใŸๅฅณๆ€งใฎๅฃฐใ€‚ใ‚†ใฃใใ‚Šใจ่ฉฑใ™ใ€‚"

Python Example

from mlx_audio.tts.generate import generate_audio

generate_audio(
    model="mlx-community/Irodori-TTS-500M-v2-VoiceDesign-fp16",
    text="ใ“ใ‚“ใซใกใฏใ€ใƒ†ใ‚นใƒˆใงใ™ใ€‚",
    instruct="็ฉใ‚„ใ‹ใง่ฝใก็€ใ„ใŸๅฅณๆ€งใฎๅฃฐใ€‚ใ‚†ใฃใใ‚Šใจ่ฉฑใ™ใ€‚",
    file_prefix="output",
)
Downloads last month
86
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support