Custom voices allows you to use your own voice with Text to Speech, letting you generate clips of your trained voice speaking any text you input. Each custom voice costs 300 credits to train.

Creating a custom voice

Select Record audio in the Generative Audio prompt box, then click Try it now.
Upload a voice sample between 2 to 5 minutes in length. The sample must be of your voice or someone whose voice you have explicit permission to clone. Ensure your recording has a good range of voice and tone, and reduce background noise as much as possible.
If you don't have a pre-recorded voice sample to upload, click View example for a script that you can record directly in the app.
Click Record Consent to proceed, and complete voice verification.
Give your voice a name for easy identification and click Submit.

Using your cloned voice

Input any text into the Generative Audio prompt box, then click on your custom voice to select it.

To view all your custom voices, you can click the Custom tab under the Voice menu. Hover over the name of your custom voice to delete, rename, or share it.

How to create a custom voice

Creating a custom voice

Using your cloned voice