Is it possible to prompt the model with instructions, like Unmute?

by deathknight0 - opened Jun 29, 2025

Jun 29, 2025

Thank you for your release. Just wondering if this repo supports prompting the model natively - like what you implemented in Unmute.

vvolhejn

Kyutai org Jun 30, 2025

Hi, what would that mean for a speech-to-text? It is possible to feed in a prefix of audio + transcript which would condition the model to transcribe words in certain ways. For that you'd want to hack around in the Pytorch implementation, see here, we don't provide a high-level interface for that atm. But if you mean something like "transcribe every proper name in upper case", that is not possible.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment