Higgs models
Higgs Audio v3 TTS
Natural, expressive text-to-speech in 100 languages, with controllable style for real-time voice applications.
Higgs Avatar v1
Avatar video generation from a single still image and a driving voice, built for real-time interaction.
Higgs Audio v3 Instruct
Audio-in LLM with voice-agent reflexes: understands speech and text, follows instructions, calls tools, handles interruptions, and returns grounded text output.