INFORMATIVEACTIVE
Truth Source: Repository schemas and tests are authoritative.
FLOW-03: Single Agent – With Tools
Source of Truth: tests/golden/flows/flow-03-single-agent-with-tools/
Purpose
Tool integration via agent_role field. Validates protocol semantics for tool-enabled workflows where steps invoke external tools.
Scope
This evaluation scenario validates:
- Plan steps with
agent_roleindicating tool usage - Trace recording of tool invocation events and results
- Tool outputs captured in trace events
- Protocol-level tool role semantics
Non-Goals
This scenario does NOT evaluate:
- Basic 2-step flows (see FLOW-01)
- Large plan volumetrics (see FLOW-02)
- LLM enrichment (see FLOW-04)
- Multi-round approval (see FLOW-05)
L2 Modules Exercised
| Module | Role in Flow |
|---|---|
| Context | Establishes problem domain for tool-enabled workflows |
| Plan | Defines steps with agent_role for tool execution |
| Trace | Records tool invocation events and results |
Key Protocol Fields
Plan (Tool Steps)
steps[]: Mix of tool-based and agent logic steps- Each tool step:
step_id: UUID v4description: Task descriptionagent_role: Role/tool indicator (e.g., "curl_executor", "jq_processor")dependencies: Optional sequencing
Tool Roles (via agent_role)
agent_role: "curl_executor"→ HTTP/curl toolagent_role: "jq_processor"→ JSON processingagent_role: "agent"→ Standard agent logic
Trace (Tool Events)
- Tool invocation events (type, timestamp, step reference)
- Tool result events (output summary, errors)
- Deterministic outputs for golden tests
Integration Dimensions (L3/L4)
Tool Integration (via Runtime)
- Tools are runtime capabilities, not L2 protocol constructs
agent_roleprovides protocol-level indication of tool usage- Actual tool execution happens at L4 (Runtime Agent Execution layer)
- For golden tests: tool outputs are pre-determined
Evidence
| Type | Location | Status |
|---|---|---|
| Golden Flow | tests/golden/flows/flow-03-single-agent-with-tools/ | ✅ Passed |
| Input Fixtures | tests/golden/flows/flow-03-single-agent-with-tools/input/ | Available |
| Expected Fixtures | tests/golden/flows/flow-03-single-agent-with-tools/expected/ | Available |
Expected Behavior
- Tool steps execute via
agent_rolemechanism - Tool outputs captured in trace events
- No
tool_nameorparametersfields (these are runtime concerns) - Context remains immutable throughout
- Trace correctly records tool invocation sequence
Invariants Tested
- Steps with tool roles have valid
agent_rolevalues - Tool invocations produce trace events
- Output determinism for golden test validation
- No schema violations from tool-related fields
Document Status: Informative (Evaluation Scenario)
Source of Truth: tests/golden/flows/flow-03-single-agent-with-tools/README.md