News (EN)

Gemini Live changes voices and changes cadence in recent updates

Gemini
Photo: Gemini - Primakov / Shutterstock.com

Gemini Live users notice changes to the voice options of the Google artificial intelligence assistant. The changes include variations in speech cadence, tone and even the mix of regional accents during real-time interactions. Essas changes frequently occur after template updates, such as the recent version 3.1 Flash Live, and affect the personalized conversations experience.

Many reports indicate that the voice previews in the app do not match the actual sound when using the Live feature. The Capella option, which plays a female British accent, has more obvious changes since the initial release. Outras Regional voices also exhibit similar consistency issues.

Changes in the cadence and tone of voices

Changes in speech cadence represent one of the most common complaints among users using different voice options on Gemini Live. Speech patterns slow down in various settings, while high-pitched tones are noticeably reduced. In some cases, responses alternate between Australian accents and more neutral variations of American during ongoing conversations.

These adjustments occur gradually after resetting the application, when the selected accent remains for a short period before transforming into a hybrid version. The experience can become uncomfortable for those who expect consistency in interactions. Usuários Those who have longer conversations notice these transitions more frequently.

Gemini
Gemini – mundissima/ Shutterstock.com

Comparison between preview and actual use of the resource

The audio preview available in the Gemini Live settings often differs from the results obtained in active conversation sessions. Essa difference especially affects personalized voices, which lose original characteristics over time. Relatos accumulated in recent months point to a progressive deterioration in several available options.

  • The slower cadence impacts the natural flow of responses.
  • High tones are softened, changing the personality of the voice.
  • Mixes of accents occur unpredictably in dialogues.
  • Temporarily resetting the app partially restores the initial behavior.

These observations come in a context of frequent updates to Google’s AI models, which aim to improve overall performance but generate side effects on voices.

Audio artifacts in Gemini Live sessions

Sound artifacts such as pops, pops and hiss appear sporadically during use of the Gemini Live. Esses noises are not directly linked to voice changes, but represent another recurring complaint on the company’s support forums. The occurrence varies depending on the voice option selected and is not always repeated identically.

Many users are able to reproduce the problem in specific tests, while others observe artifacts only in specific conditions. Audio quality remains stable in quick voice commands or in Android Auto mode integrated with Android Auto in vehicles. Essa difference suggests that the problem is concentrated in longer conversation sessions or in certain contexts of use.

Behavior in different interaction scenarios

The Gemini Live’s voices maintain greater stability when the assistant is activated for brief commands or simple voice controls. However, during deeper conversational interactions, changes in cadence and tone become more apparent. The feature on cars via Android Auto also better preserves the original characteristics of selected options.

Google has received inquiries about these behaviors, although there is no official confirmation of recognition or fixes in progress at this time. Usuários continues to test different available voices, including Capella, to identify which ones show the least variations over time.

Available options and in-app adjustments

The Gemini Live offers multiple customizable voices with distinct accents and tones, such as options that simulate British, neutral American, and other regional variations. Users can change selection directly in the app’s settings to find the setting that best suits individual preferences. Changing voices does not always solve cadence problems permanently.

  • Options include voices with higher or lower pitch characteristics.
  • Some better preserve the accent chosen in initial sessions.
  • Resetting the app may temporarily restore expected behavior.
  • Template updates influence the overall performance of voices.

These features allow for greater customization, but the reported inconsistencies highlight the need for adjustments by the company responsible for development.

Evolution of voices in Gemini Live over time

Over the past few months, several voice options for Gemini Live have undergone modifications that alter aspects such as speech speed and mix of accents. Essas changes coincide with improvements in other aspects of AI models, including response speed and contextual understanding. The feature continues to evolve, with updates that aim to make interactions more fluid.

Users who rely on specific voices for daily tasks or accessibility see direct impacts on usability. Consistency between the audio preview and the actual execution remains a point of attention for those who use the assistant in prolonged conversations. Google continues to improve the system, based on feedback received about the performance of the voices.