Skip to content

SWARM — Open-Source Multi-Agent AI Safety Framework

Self-Modification Governance Implementation Checklist

swarm-ai-safety/swarm

Self-Modification Governance Implementation Checklist¶

This checklist translates the governance architecture in self-modification-governance-byline.md into concrete engineering work items.

Scope and assumptions¶

Runtime: SWARM orchestrator + governance engine.
Goal: ship a minimally auditable two-gate modification loop before adding full compositional simulation and advanced rollout automation.
Current calibration proxy mapping:
tau_min proxy -> governance.refinery_p_threshold
K_max proxy -> governance.memory_write_rate_limit_per_epoch

Phase 0: Hardening prerequisites¶

Define trust boundaries in code ownership:
immutable governance policy surfaces
mutable agent/runtime surfaces
Add signed policy bundle loading path (hash + signer + version).
Add policy-hash and artifact-hash fields to run metadata.
Add failure mode: any attestation mismatch blocks promotion.

Phase 1: Byline provenance foundation¶

Phase 2: Gate 1 (`tau_min`) implementation¶

Phase 3: Gate 2 (`K_max`) implementation¶

Phase 4: Deterministic risk-tier classifier¶

Phase 5: Promotion workflow and rollout safety¶

Phase 6: Calibration and reproducibility¶

Phase 7: Release criteria¶

Byline completeness >= 99.9% for modification events.
Deterministic replay success >= 95% on sampled events.
Mean rollback latency < 10 minutes in fault-injection tests.
No unresolved critical governance incident older than 24 hours.
Documentation updated:
architecture doc
operator runbook
calibration instructions

Current calibration snapshot¶

Latest run (seeded sweep):

Artifacts:
runs/20260214-020518_tau_k_calibration/runs.csv
runs/20260214-020518_tau_k_calibration/summary.json
runs/20260214-020518_tau_k_calibration/recommendation.json
Recommended values from that run:
tau_min = 0.55
K_max = 6

Reproduce:

python scripts/calibrate_tau_k_memory.py