Voices - Boson

Choose a preset voice, clone a voice from reference audio, or create a reusable custom voice.

Preset voices

Voice	Style	Sample input
`chloe`	A friendly and clear female voice with an engaging, informative tone and a standard American accent.	`<\|emotion:amusement\|>Hi, I'm Chloe Adams! <\|emotion:enthusiasm\|>I love showing you the little tricks that make everything click so much faster. <\|prosody:pause\|> Ready? Let's jump right in and make this easy and fun.`
`eleanor`	A calm, articulate female voice with a clear, professional Standard American English accent, perfect for educational or explanatory content.	`<\|emotion:contentment\|>Hello, I'm Eleanor Reed. I love making tricky ideas feel simple and easy to follow. <\|prosody:speed_slow\|>We'll take it one step at a time, no rushing. <\|emotion:awe\|>And honestly, there's something wonderful about the moment when it all clicks.`
`jake`	A male speaker with an energetic and slightly dramatic tone, conveying passion and enthusiasm, especially about sports.	`<\|emotion:enthusiasm\|>What's up, I'm Jake Rivers! <\|emotion:elation\|>I live for the big moments, the buzzer-beaters, the comebacks, the whole roller coaster. <\|sfx:laughter\|>Haha, win or lose, I'm all in, and trust me, I bring the energy every single time!`
`marcus`	A male speaker with an enthusiastic, confident, and slightly professorial American delivery.	`<\|emotion:enthusiasm\|>Hello, I'm Marcus Webb, and I genuinely get excited about ideas. <\|emotion:elation\|>Give me any topic and I'll happily walk you through it like it's the most fascinating thing in the world. <\|sfx:laughter\|>Haha, fair warning, I do tend to ramble when something's interesting!`
`nora`	A female speaker with a calm, clear, and narrative voice, using standard American English.	`<\|emotion:contentment\|>Hi there, I'm Nora Vance. I have a soft spot for stories <\|prosody:pause\|> the kind that pull you in slowly. <\|prosody:speed_slow\|>So settle in, and let me paint you a little picture of wherever we're headed.`
`oliver`	A calm, articulate male voice with a thoughtful American accent, well-suited for explanatory or reflective content.	`<\|emotion:contemplation\|>Hi, I'm Oliver Grant. I tend to think out loud, slowly working my way toward what really matters. <\|prosody:pause\|> Self-awareness, meaning, the quiet questions, that's where I like to linger. <\|emotion:contentment\|>Pull up a chair and let's reflect together.`

Use voice to select a preset speaker.

curl https://api.boson.ai/v1/audio/speech \
  -H "Authorization: Bearer $BOSON_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "higgs-audio-v3-tts",
    "input": "Hello, this is a test.",
    "voice": "berlinda"
  }' \
  --output out.mp3

Reference voice

Clone a voice instantly with ref_audio and ref_text.

curl https://api.boson.ai/v1/audio/speech \
  -H "Authorization: Bearer $BOSON_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "higgs-audio-v3-tts",
    "input": "Hello, this is a test.",
    "ref_audio": "https://docs.boson.ai/public/audio/sample.mp3",
    "ref_text": "Same voice, same words, and uh, a completely different presence. I was built for chat native voice, real-time, expressive, and controllable."
  }' \
  --output out.mp3

ref_audio can be a URL or the base64-encoded bytes of a local audio file. Supported formats are wav, mp3, opus, pcm and flac. We recommend 5-30 seconds of clean speech, with no music or background voices. Although ref_text is optional, we recommend providing a verbatim transcript of the reference audio, including filler words.

You must own the right to clone the voice.

Custom voices

Custom voices work like reference voices, but you can reuse the returned voice ID instead of sending ref_audio on every request. First, create a voice ID from reference audio and text. Then pass that ID to voice, just like a preset voice.

​Preset voices

​Reference voice

​Custom voices

Preset voices

Reference voice

Custom voices