Google officially announced the launch of the voice mode Gemini Live, which will soon be available in the Gemini mobile application. This marks another escalation in the competition between Google and OpenAI in the field of AI voice assistants.
Gemini Live is a brand new mobile conversation experience that supports natural language communication, responding with human-like voices and rhythms. It offers 10 voice options, supports hands-free functionality, and allows for interruptions and topic changes at any time. The English version is currently available on Android devices, with iOS version and support for more languages to be released in the coming weeks.
Compared to OpenAI, Google has advantages in terms of launch speed and potential user scale. Gemini Live will be available to over 3 billion Android users and 2.2 billion iOS users worldwide. However, during the live demonstration, Gemini Live experienced two minor incidents, indicating that its functionality still needs improvement.
Google stated that Gemini redefines AI assistants, capable of integrating with multiple Google applications and tools to complete various tasks. More extended features will be introduced in the future, including Keep, Tasks, and others.
On Android systems, users can access Gemini by long-pressing the power button or using voice activation. It can understand screen content and interact with the currently used applications. Google has also introduced a new model, Gemini 1.5 Flash, to improve response speed and quality.
Additionally, Google launched Pixel Studio, an AI image generation application based on Imagen 3.
Overall, Google is pushing forward the development of AI assistants, attempting to gain an edge in the competition with OpenAI and Apple.