Render text to speech with timing information and subtitles

This method takes text input and generates an audio clip, along with word timing data. Word timing output includes word level timing in a json file, an srt file, and vtt file.

Body Params
integer
enum
required
string
required
length between 1 and 1000
string
enum

Which model to use. Note that not all voices are available on all models.

Allowed:
library_ids
array of strings
library_ids
audio_configs
object
Headers
boolean

Enables limited SSML translation for input text

Responses

401

API key is missing or invalid

429

Rate limit has been exceeded

Language
Credentials
Header
Response
Click Try It! to start a request and see the response here! Or choose an example:
*/*