Skip to main content
INFORMATIVEDRAFTDocumentation Governance

Conformance & Evaluation

"Can I prove what the agents did, why they did it, and whether the system stayed within approved lifecycle guarantees?"

MPLP's answer is: evidence-first lifecycle governance, not "trust the prompt" or "trust best practices".

2. What This Section Is NOT

This section does not provide:

  • Certification, endorsement, or compliance badges
  • Legal advice or regulatory determinations
  • A mandatory runtime, SDK, or framework adapter
  • "One true" implementation pattern for agents

MPLP is vendor-neutral by design. Conformance is judged by evidence, not by brand names or frameworks.

3. How to Use This Section

Your GoalStart Here
Understand conformance modelConformance Model
Understand evidence structureEvidence Model
See evaluation axesEvaluation Dimensions
Understand result semanticsResults & Status
See future plansRoadmap
Practical evaluation stepsConformance Guide
Validate with Golden FlowsGolden Flows

4. Core Principles

4.1 Evidence-First

Conformance is judged by exported evidence, not runtime inspection:

Evidence TypePurpose
PlanIntent declaration
ConfirmGovernance gate record
TraceExecution history
SnapshotState checkpoint
ManifestConfiguration declaration

4.2 Vendor Neutral

Evidence does not depend on proprietary runtime features to be interpretable. Any conformant runtime MUST produce evidence that:

  • Uses MPLP JSON Schemas
  • Is self-describing (contains protocol version)
  • Is replayable (trace can reconstruct timeline)

4.3 Non-Certification

MPLP does not provide:

  • Certification programs
  • Compliance badges
  • Runtime endorsements

Conformance is a binary outcome (conformant / non-conformant / incomplete-evidence), not a score or certification level.

5. Minimal Conformance Checklist

Use this as a quick sanity check before deeper evaluation:

#CheckEvidence Source
1Protocol version declaredmeta.protocolVersion in artifacts
2Plan → Confirm → Trace chain existsLinked IDs across objects
3High-risk actions gatedConfirm objects for gated steps
4ReplayabilityTrace segments with timestamps
5Bounded failureRecovery events or safe-stop records

6. Conformance Documents

DocumentPurpose
Conformance ModelConformance classes (L1/L2/L3) and outcomes
Evidence ModelWhat constitutes valid evidence
Evaluation Dimensions6 axes for judging conformance
Results & StatusOutcome semantics and reporting
RoadmapFuture plans and boundaries
Conformance GuidePractical evaluation workflow
Conformance ChecklistVendor self-verification template

Document Status: Informative (Navigation Entry Point)
Scope: Evidence-based conformance evaluation
Exclusions: Certification, legal compliance, vendor endorsement