Text-to-Speech Models

Generate natural-sounding speech from text with Assisters TTS, our advanced voice synthesis model.

Assisters TTS v1

Model IDstring

assisters-tts-v1

Our state-of-the-art text-to-speech model with 300+ natural voices in 100+ languages.

Specification	Value
Model ID	`assisters-tts-v1`
Voices	300+
Languages	100+
Max Input	4,096 characters
Price	$0.01 / 1,000 characters
Latency	~100ms first audio

Capabilities

Natural Voices: Human-like speech with proper intonation
Multilingual: 100+ languages with native accents
Voice Variety: 300+ unique voices (male, female, various ages)
Streaming: Real-time audio streaming
Multiple Formats: MP3, WAV, OGG, FLAC output

Example Usage

from openai import OpenAI

client = OpenAI(
    base_url="https://api.assisters.dev/v1",
    api_key="your-api-key"
)

response = client.audio.speech.create(
    model="assisters-tts-v1",
    voice="alloy",
    input="Hello! Welcome to Assisters. I'm excited to help you build amazing applications."
)

# Save audio file
response.stream_to_file("output.mp3")

With Different Voices

# Female voice
response = client.audio.speech.create(
    model="assisters-tts-v1",
    voice="nova",
    input="This is a friendly female voice."
)

# Male voice
response = client.audio.speech.create(
    model="assisters-tts-v1",
    voice="onyx",
    input="This is a deep male voice."
)

Streaming Audio

from openai import OpenAI
import pyaudio

client = OpenAI(
    base_url="https://api.assisters.dev/v1",
    api_key="your-api-key"
)

# Stream audio in real-time
with client.audio.speech.with_streaming_response.create(
    model="assisters-tts-v1",
    voice="alloy",
    input="This text is being converted to speech in real-time!"
) as response:
    for chunk in response.iter_bytes():
        # Play or process audio chunks
        audio_player.write(chunk)

Available Voices

Voice	Description	Best For
`alloy`	Neutral, balanced	General purpose
`echo`	Warm, friendly	Customer service
`fable`	Expressive, storytelling	Audiobooks
`onyx`	Deep, authoritative	Professional content
`nova`	Bright, energetic	Marketing, tutorials
`shimmer`	Soft, gentle	Meditation, wellness

300+ additional voices are available. See the voice gallery for the complete list with audio samples.

Parameters

Parameter	Type	Default	Description
`input`	string	required	Text to convert (max 4096 chars)
`model`	string	required	Model ID (`assisters-tts-v1`)
`voice`	string	required	Voice ID to use
`response_format`	string	"mp3"	Audio format
`speed`	float	1.0	Speed multiplier (0.25-4.0)