Google ships Gemini 3.5 Live Translate for real-time voice conversations

Google’s latest model turns spoken language into spoken language while keeping the speaker’s tone, pace, and pitch intact.

Google ships Gemini 3.5 Live Translate for real-time voice conversations

*Google’s latest model turns spoken language into spoken language while keeping the speaker’s tone, pace, and pitch intact.*

Google announced Gemini 3.5 Live Translate, a voice-to-voice translation system that converts speech in one language to speech in another with near-zero added delay. The output retains the original speaker’s tone, pacing, and pitch rather than flattening them into a generic synthetic voice. Every translated segment carries a SynthID watermark.

The feature appears in three Google products at launch: Google AI Studio, the Google Translate app, and Google Meet. In Google Meet it can run during live calls, converting one participant’s speech for listeners on the other side of the language barrier. The same model powers the updated Google Translate mobile experience and a new endpoint inside AI Studio for developers.

Prior translation pipelines at Google separated recognition, text translation, and text-to-speech steps. Each hand-off introduced latency and stripped away vocal characteristics. Gemini 3.5 Live Translate collapses the pipeline into a single model pass trained to map audio directly to audio while preserving prosody.

No independent benchmarks or third-party reviews have been published yet. Google’s own announcements do not include accuracy figures for different language pairs or acoustic conditions.

Why it matters

For product teams that already route calls or meetings through Google infrastructure, the change removes a visible seam. Meetings that once required a separate interpreter or slow caption-and-read-aloud workflow can now run with continuous spoken translation. The addition of SynthID watermarks gives enterprises a verifiable chain of custody for translated audio, which matters when the output may be stored or forwarded.

The move also tightens Google’s hold on the conversation layer of its cloud services. Teams evaluating competing translation APIs will now weigh whether the convenience of staying inside Meet or Translate outweighs any accuracy or cost advantages elsewhere.

---

Sources:

{
  "excerpt": "Google launches Gemini 3.5 Live Translate, delivering near real-time voice-to-voice translation that preserves tone and pitch across Meet, Translate, and AI Studio.",
  "suggestedSection": "ai",
  "suggestedTags": ["gemini", "translation", "google-ai"],
  "imagePrompt": "Abstract translucent audio waveforms cross a dark, empty studio space and merge into a single continuous ribbon. Muted color palette, cinematic lighting, 16:9."
}

No comments yet