News (EN)

Google reveals technology for Gemini to control applications on Android without relying on the cloud

Google
Google - daily_creativity/ shutterstock.com

The Google presented significant advances for the Android 17 ecosystem, focused on the autonomy of artificial intelligence agents within the operating system. The new tools, called AppFunctions and Intelligent UI automation, allow assistants like Gemini to perform complex operations in third-party software without the need for manual touches on the screen. Essa evolution aims to transform the cell phone into an active assistance platform, where the system understands and executes the user’s contextual demands in a fluid and integrated way.

The technology giant’s strategy prioritizes executing processes directly on the device’s hardware, avoiding the constant sending of information to remote servers. Essa local processing approach seeks to mitigate privacy risks and ensure that sensitive data remains under the control of the device owner. Além of security, the “on-device” architecture drastically reduces latency, providing almost instantaneous responses to voice commands and automated interactions.

Application developers will have access to easy resources to connect their products to the system’s virtual assistants. The intention is to create an environment where navigation between different services occurs invisibly, eliminating barriers between installed apps. The official statement was directed to the programming community, detailing how these implementations can be carried out in the next updates.

Technical operation and integration via AppFunctions

The core functionality, called AppFunctions, introduces a new platform API accompanied by a specific Jetpack library. Esse feature allows software creators to expose functionality and data from their applications for direct access by artificial intelligence. The system is thus able to skip traditional visual navigation steps, going straight to executing the requested task.

By adopting this framework, AI gains the ability to interpret intentions and manipulate the application in the background, without necessarily opening it in the main interface. The architecture was designed to be implemented quickly:

– Não requires a complete rewrite of the original code of existing applications.

– Permite internal searches in media apps and order management on service platforms.

– Garante interoperability between different services installed on the smartphone.

– Economiza system resources and battery when preventing full graphical rendering of the app.

Initial tests with Gemini already demonstrate the tool’s versatility in real scenarios. The ability to connect user intentions with concrete actions within applications promises to change the dynamics of smartphone use.

Experience on Samsung and Pixel devices

The first practical demonstrations of the new technologies are taking place on high-performance hardware, with emphasis on the Galaxy S26 series and the Pixel 10 models. The user can verbally request to organize or view photos of specific events, and the assistant performs the task autonomously.

These tests reinforce the close collaboration between Google and partner manufacturers to standardize AI experiences in the market. The OneUI 8.5 interface, present on the evaluated Samsung devices, already offers native support for these system calls, indicating a trend for the future of Android. The expectation is that this integration will become the industry standard with the consolidation of Android 17.

For users of the Pixel 10 line, the focus is on operational efficiency and eliminating intermediate menus. Interaction with stored content becomes more direct, reinforcing the proposal for an operating system less dependent on the traditional tactile interface.

Visual automation and data security

For applications that have not yet adopted dedicated APIs, Google has introduced the Intelligent UI automation feature. Trata is a framework capable of visually interpreting the screen and performing actions such as clicking, scrolling and filling in fields. Essa solution allows AI to operate software autonomously, based only on reading visual elements, functioning as a bridge to universalize artificial intelligence access to the app ecosystem.

Security remains a fundamental pillar in the development of these technologies. Executing commands locally minimizes exposure to cyberattacks that could occur when transmitting data to the cloud. The operating system will include granular permission layers, ensuring that the user has full control over which applications can be manipulated by intelligent agents, preventing sensitive actions without explicit consent.

To Top