Agent YAML Format
An agent config is a YAML file with two fields:bash -c in the workspace directory.
The agent should read INSTRUCTION.md and modify the workspace.
Examples
Claude Code
Gemini CLI
Shell Script
Python Agent
The Filesystem Contract
When your agent runs, the workspace contains:| File | Description |
|---|---|
INSTRUCTION.md | The task description (always present) |
| Source files | Workspace template files (if the benchmark provides them) |
- Read
INSTRUCTION.mdto understand the task - Create or modify files in the current directory
- Exit when done (zero or non-zero exit code)
Optional: Agent Journal
If your agent writes ajournal.yaml to the workspace, the benchmark parses it for efficiency metrics:
Optional: Trajectory Reference
If your agent writes atrajectory-ref.txt file containing a URI or path to trace data,
the benchmark records it in the trial result for later analysis.