Text to Audio
With Text to Audio you convert text into an audio file. This is useful for voice-overs, instructional videos, internal communications, training material, and scripts.
Starting from the dashboard
On the dashboard, select Text to Audio under the input field. The input field expands so you can comfortably enter longer scripts. You can then fill in the text and generate audio.
Settings
Use the settings button next to the input field to adjust the speech settings.
| Setting | Description |
|---|---|
| Model | Choose the text-to-speech model. |
| Language | Choose the language in which the text should be spoken. |
| Voice | Choose a voice suitable for the selected language. |
| System prompt | Provide instructions for pronunciation, tone, speed, accent, and special terms. |
| Style reference | Add extra cues about the desired speaking style. |
The voice list is filtered by the chosen language. If a voice is intended for only certain languages, you will see that language listed with the voice.
Pronunciation and style
The system prompt dictates how the voice should sound. For example, you can specify:
- that the speaker should sound like a native Dutch speaker;
- that words like AI, AI-Corporate, ChatGPT, OpenAI and Gemini may be pronounced in English;
- that Claude should be pronounced as a French name;
- or that the tone should be calm, warm, corporate, informal, low-energy or energetic.
When you choose another language, AI-Corporate adjusts the default instructions to that language.
Saving and restoring
You can save your settings to your account. AI-Corporate will remember, among other things, the model, language, voice, and system prompt. By Restore defaults you remove these saved preferences.
Result
After generation, the audio file appears directly in the chat. You can play it there with the audio player and download it with the download button.
During generation, the input form is temporarily disabled to prevent multiple audio generations from running simultaneously.