The avatar request will return information about the available voices in our text-to-speech API, such as speaker_ID, accent, avatar ID, and characteristics. The endpoint will assist your workflow in the following ways:

  1. Voice Selection: Programmatically select the most suitable voice for your application based on criteria such as ID, accent, and speaking style.
  2. Customization: Tailor the voice selection to match the specific needs or preferences of your users.
  3. Flexibility: Easily experiment with different voices and styles, allowing for creative and innovative applications.
  4. Efficiency: Quickly retrieve the latest information about available voices without the need for manual configuration, saving time and effort.

How do you use the endpoint?

Request

Make a GET request to this URL, including your API key in the header:

https://api.wellsaidlabs.com/v1/tts/avatars

An example curl command:

curl --location 'https://api.wellsaidlabs.com/v1/tts/avatars' \
--header 'X-API-KEY: <api-key-here>'

Response

Below is an example of a successful response showing one avatar with the following structure:

{
	"avatars": [
		{
			"name": "Alana B.",
			"id": 3,
			"style": "Narration",
			"accent_type": "United States",
			"characteristics": [
				"Clear",
				"Crisp",
				"Focused",
				"Informative",
				"Strong"
			],
			"otherTags": [],
			"preview_audio": "https://media.graphassets.com/Ogq9S0cXRsOyUkYRjxWe",
			"locale": "en_US",
			"language": "English",
			"language_variant": "United States",
			"source": null
    }
  ]
}

Details of response

Fieldtypenotes
namestringThe name of the voice
idintegerthe ID is unique to the voice and voice style. For example, Wade C. conversational style is ID 46 and Wade C. narration style is ID 30
stylestringAvailable styles include: Narration, promo, conversational, character and custom
NOTE: custom styles are used for our non-English language voices
accent_typestringIndicates if there is a specific regional accent, such as United States β€” Appalachia
characteristicsarrayIndicates subjective characteristics of the voice, such as crisp, informative, friendly, etc.
preview_audiostringReturns a sample of the voice if the sample is available. If no sample is available, the result is null
localestringShows a combination of language and the language variant, i.e. en_UK is English, United Kingdom
languagestringIndicates the language that the voice speaks
language_variantstringIndicates the country where the language is spoken, i.e. United Kingdom for English or Mexico for Spanish.
sourcestringThis field should be null
otherTagsarrayOther tags might include New or Featu

πŸ“˜

While not frequent, the details of an avatar may change, i.e. a new ID may be available for an avatar or a characteristic may change.