Apps¶

Environment: a Family of Apps¶

One of Agents Research Environments’ core principles is “apps.” As explained in the previous section, environments are based on a major concept: applications. To train agents to solve daily and tedious tasks for users, it is essential to get as close as possible to the real deployment environment of these agents. A first env we introduce primarily aims at training agents capable of evolving in environments similar to those of operating systems such as iOS, Android, Meta RayBan OS, QuestOS, etc. It is therefore natural to design this environment by integrating applications, much like a mobile phone.

Definition: An application, in the platform, is a coherent set of APIs. Its purpose is to interact with a data source. For example, a Mails app will have tools such as send_email or delete_email, as well as some database of emails. These applications are “stateful,” meaning that the result of two consecutive calls to the same API with the same arguments is not guaranteed to be identical, as it depends on the internal state of the application. This design aims to train agents to reason by taking into account a global context.

Applications¶

Some apps in the initial environment play a crucial role in the execution of scenarios. We give more details below about the most important ones, giving a glimpse of the possibilities offered by the platform. Thanks to the high flexibility of the system, it is also possible to integrate MCP interfaces ( MCPApp - Model Context Protocol) and any other interface as long as they are wrapped to match the platform’s standards. For instance, some recent efforts have integrated SQL databases to test agents’ compatibility with different search strategies.

Agent-User Interface

The Agent-User Interface (AUI) app is a required component in every environment, serving as the communication bridge between users and AI agents. Unlike traditional chat models where interaction happens through direct messaging, our system models user-agent communication through a dedicated ARE application.

This design requires the agent to make specific tool calls when reporting information to the user, rather than simply generating responses. When users send messages to the agent through the AUI, their content is automatically injected into the agent’s memory for processing. We highlight that this is the current implementation choice and that better alternatives may exist, like using the Notification System to surface user messages to the agent (see Notifications).

This tool-based communication model provides several advantages:

It allows precise control over when the agent should communicate with the user versus when it should continue working silently on tasks
It enables users to interact with the agent even while the agent is actively engaged in completing other objectives

The AUI thus creates a more realistic interaction pattern that mirrors how people actually work with digital assistants in complex, multi-tasking environments.

Note

Code Pointer: More information can be found in the AUI App (see apps/agent_user_interface.py).

System App

Some applications operate at a higher abstraction level than conventional data sources. For instance, the System app offers agents essential tooling for temporal operations, including:

Retrieving the current simulation time
Waiting for specified durations
Suspending execution until new events arrive in the system

The waiting APIs represent a particularly sophisticated feature set that directly interfaces with the simulation’s core execution model. These APIs possess the unique capability to modify environment execution flow, allowing agents to pause their operations in a controlled manner that integrates seamlessly with the event-driven architecture.

Crucially, these waiting mechanisms serve critical evaluation purposes by enabling the acceleration of long-horizon task execution, allowing researchers to efficiently evaluate agent performance across extended temporal scenarios without requiring real-time execution delays.

This system-level functionality demonstrates how the platform not only supports application-specific interactions but also fundamental simulation control mechanisms that enable complex agent behaviors and efficient evaluation methodologies.

Note

Code Pointer: More information can be found in the System App (see apps/system.py).

Next Steps

Keep reading the Foundations guide to learn more about Events.
Check the technical details of Apps in Apps API.