Tech »  Topic »  Guide to prompting Gemini 3.1 Flash TTS (text-to-speech)

Guide to prompting Gemini 3.1 Flash TTS (text-to-speech)


Today, Gemini 3.1 Flash TTS, our latest text-to-speech model, is available on Google AI Studio and Vertex AI. It delivers precise controllability and expressivity, empowering developers and enterprises to build advanced AI-speech applications.

The new TTS model introduces a high level of controllability by allowing you to steer the delivery using 200+ audio tags. We'll share how to get strong results from the model, whether you are building accessible gaming soundtracks, banking systems, or audiobooks. Learn more about the model here

What you will learn:

  1. Model overview

  2. Voice style instructions

  3. The core prompting framework for audio tags

  4. Directing expression and pacing

  5. Use cases: accessibility and inclusive design, creative and entertainment, enterprise use cases

1. Model overview

Gemini 3.1 Flash TTS is available on Google AI Studio and Vertex AI in public preview. 

The model delivers high-fidelity speech and precise control across 70+ languages. These core optimizations bring ...


Copyright of this story solely belongs to google cloudblog . To see the full text click HERE