Skip to main content
INFORMATIVEACTIVEDocumentation Governance

FLOW-03: Single Agent – With Tools

Source of Truth: tests/golden/flows/flow-03-single-agent-with-tools/

Purpose

Tool integration via agent_role field. Validates protocol semantics for tool-enabled workflows where steps invoke external tools.

Scope

This evaluation scenario validates:

  • Plan steps with agent_role indicating tool usage
  • Trace recording of tool invocation events and results
  • Tool outputs captured in trace events
  • Protocol-level tool role semantics

Non-Goals

This scenario does NOT evaluate:

  • Basic 2-step flows (see FLOW-01)
  • Large plan volumetrics (see FLOW-02)
  • LLM enrichment (see FLOW-04)
  • Multi-round approval (see FLOW-05)

L2 Modules Exercised

ModuleRole in Flow
ContextEstablishes problem domain for tool-enabled workflows
PlanDefines steps with agent_role for tool execution
TraceRecords tool invocation events and results

Key Protocol Fields

Plan (Tool Steps)

  • steps[]: Mix of tool-based and agent logic steps
  • Each tool step:
    • step_id: UUID v4
    • description: Task description
    • agent_role: Role/tool indicator (e.g., "curl_executor", "jq_processor")
    • dependencies: Optional sequencing

Tool Roles (via agent_role)

  • agent_role: "curl_executor" → HTTP/curl tool
  • agent_role: "jq_processor" → JSON processing
  • agent_role: "agent" → Standard agent logic

Trace (Tool Events)

  • Tool invocation events (type, timestamp, step reference)
  • Tool result events (output summary, errors)
  • Deterministic outputs for golden tests

Integration Dimensions (L3/L4)

Tool Integration (via Runtime)

  • Tools are runtime capabilities, not L2 protocol constructs
  • agent_role provides protocol-level indication of tool usage
  • Actual tool execution happens at L4 (Runtime Agent Execution layer)
  • For golden tests: tool outputs are pre-determined

Evidence

TypeLocationStatus
Golden Flowtests/golden/flows/flow-03-single-agent-with-tools/✅ Passed
Input Fixturestests/golden/flows/flow-03-single-agent-with-tools/input/Available
Expected Fixturestests/golden/flows/flow-03-single-agent-with-tools/expected/Available

Expected Behavior

  • Tool steps execute via agent_role mechanism
  • Tool outputs captured in trace events
  • No tool_name or parameters fields (these are runtime concerns)
  • Context remains immutable throughout
  • Trace correctly records tool invocation sequence

Invariants Tested

  • Steps with tool roles have valid agent_role values
  • Tool invocations produce trace events
  • Output determinism for golden test validation
  • No schema violations from tool-related fields

Document Status: Informative (Evaluation Scenario)
Source of Truth: tests/golden/flows/flow-03-single-agent-with-tools/README.md