Guide to Generating Speech

Learn how to generate speech in getimg.ai: write a script, pick a voice, and direct tone and pace with inline cues for natural text to speech.

Step 1. Write your script

Log into getimg.ai and open Create speech in the sidebar, under the Audio group. Type your script in the prompt box, exactly as it should be read. You can write in languages other than English.

You can also direct the delivery from inside the script. Add a cue in brackets before a line to set its tone and pace:

[gentle, emotional] I never thought we would make it this far.
[bright, upbeat] Let's get started.

Because the cues live in the script, you can change the tone, pace, and emotion as often as you like, so one read can move from calm to energetic and back.

Step 2. Choose a voice

Pick a voice from the list. There are plenty to choose from, and each has a name and a short descriptor for its character, such as soft, smooth, upbeat, or warm. Choose the one that fits your script.

Step 3. Generate

Click the arrow to generate. When the audio appears in the gallery, you can play it, download it as a .wav file, or add it to a folder. If the script changes, edit the text and generate again instead of re-recording.

Speech is priced per 1,000 characters, so a read costs what its length costs.

Frequently Asked Questions

Was this guide helpful?

Guide to Generating Speech

Step 1. Write your script

Step 2. Choose a voice

Step 3. Generate

Frequently Asked Questions

How do I control tone and emotion?

Can I generate speech in other languages?

How is speech priced?

What format does the audio download in?

What if my script changes?

Can I use the audio commercially?