Task
A Task is a directory containing everything an agent needs to complete a goal.instruction.md
Natural language description of what the agent should accomplish. This becomes part of the agent’s prompt.
task.toml
Configuration including Docker image, GUI mode, timeouts, and resource limits.
Learn more about Tasks
See how to create and configure tasks
Gateway
The Gateway is the abstraction layer over LLM providers. It routes requests to the appropriate backend based on the model identifier.| Model Pattern | Provider |
|---|---|
gemini/* or contains gemini | Google Gemini |
claude-* | Anthropic Direct |
bedrock/* or contains anthropic. | AWS Bedrock |
openai/* or computer-use-preview | OpenAI |
Model Configuration
See all supported models and how to configure them
Environment
The Environment provisions isolated containers for agent execution. Helios supports two providers:Docker (Local)
Run containers locally using Docker. Supports headless and GUI modes with VNC.
Daytona (Cloud)
Run in cloud sandboxes for scalable, distributed execution.
Execution Modes
| Mode | Description | Use Case |
|---|---|---|
| Headless | No GUI, bash and editor tools only | CLI tasks, scripting, file manipulation |
| GUI | Full desktop with VNC | Browser automation, desktop apps, visual tasks |
Environment Configuration
Learn about Docker images, GUI mode, and cloud providers
Tools
Tools are the interfaces agents use to interact with the environment.bash
bash
Execute shell commands inside the container. Available in all modes.
editor
editor
Make structured file edits. Useful for multi-line changes where shell commands would be brittle.
- View file contents
- Insert or replace text
- Create new files
computer
computer
Mouse and keyboard interactions. Only available in GUI mode.
| Action | Description |
|---|---|
left_click | Click at coordinates |
right_click | Right-click |
double_click | Double-click |
triple_click | Select line |
type | Type text |
key | Press key (e.g., Return, ctrl+c) |
scroll | Scroll up/down |
screenshot | Capture screen |
Tools Reference
Complete tool documentation and examples
Verifier
The Verifier runstests/test.sh inside the container after the agent completes (or times out) and checks the outcome.
Reward Values
| Value | Meaning |
|---|---|
1 | Pass |
0 | Fail |
0.0-1.0 | Partial credit |
How It Works
/logs/verifier/reward.txt, which Helios reads to determine success or failure.
Verification Guide
Write robust verification scripts
Web Viewer
The Web Viewer provides real-time observability into agent execution.What You Can See
- Live VNC view of the desktop (GUI mode)
- Screenshots at each step
- Tool calls and their outputs
- LLM messages and responses
- Verification results
Web Viewer Guide
Learn how to use the web viewer