Render text to speech with timing information and subtitles

This method takes text input and generates an audio clip, along with word timing data. Word timing output includes word level timing in a json file, an srt file, and vtt file.

Language
Credentials
Header
Click Try It! to start a request and see the response here!