Skip to main content

[!FROZEN] MPLP Protocol v1.0.0 Frozen Specification Freeze Date: 2025-12-03 Status: FROZEN (no breaking changes permitted) Governance: MPLP Protocol Governance Committee (MPGC) License: Apache-2.0 Note: Any normative change requires a new protocol version.

Monitoring Guide

1. Purpose

This guide defines observability standards for MPLP production systems.

2. Key Metrics

2.1 Runtime Metrics

MetricDescriptionAlert Threshold
mplp_plan_execution_duration_msPlan execution time> 30s
mplp_step_failure_rateStep failure percentage> 5%
mplp_llm_token_usageLLM token consumptionBudget-based
mplp_psg_operations_per_secPSG read/write ops> 1000/s

2.2 Health Checks

CheckEndpointExpected
Runtime alive/health200 OK
Schema validation/health/schema200 OK
LLM connectivity/health/llm200 OK

3. Logging

3.1 Log Levels

LevelUsage
ERRORFailures requiring attention
WARNDegraded performance
INFONormal operations
DEBUGTroubleshooting (dev only)

3.2 Trace Correlation

All logs should include:

  • trace_id
  • context_id
  • plan_id

4. Alerts

AlertConditionAction
High Latencyp99 > 30sScale or investigate
Error Rate> 5%Check logs
Token Budget> 80%Review usage

5. Dashboards

Recommended dashboard panels:

  • Plan execution success/failure rate
  • Step latency histogram
  • LLM token usage over time
  • Error trends

2025 Bangshi Beijing Network Technology Limited Company Licensed under the Apache License, Version 2.0.