Skip to main content

Validation Evidence

ANVIL-FS Phase 3 Wave 3-A — E2E Validation Report

Date: 2026-05-07

Operator: Proveo (Angie Jones)

Parent MC: #99637

Validation MC: #99703


Validation Overview

Method: 10-probe end-to-end validation covering inventory diff, daemon health, boot health, discovery tools, canonical path preservation, reference integrity, documentation deliverables, archive integrity, and MC chain integrity.

Final Verdict: PARTIAL (9/10 PASS)

All critical probes passed. One probe (Archive Integrity) returned PARTIAL due to explainable count variance (36 tars vs 30 expected, caused by Wave 1-C additions not counted in original plan estimate). No corruption detected.


10-Probe Results Table

Probe Result Notes
1: Inventory diff PASS 4 home items deleted, ALAI/clients=13, system/clients gone, new ALAI subdirs present
2: Daemon health PASS Exact 53 count, 3 phantoms absent, running daemons healthy
3: Boot health PASS exit 0, MC line present, no FATAL
4: Discover.js verify PASS All 9 checks OK, LightRAG reachable
5: Canonical paths PASS All 4 surprise-canonical intact, all 5 CEO-excluded present
6: Reference integrity PASS security refs, mehanik-marker, hooks, DEPLOY-MAP all valid
7: ALAI/CLAUDE.md PASS 0 broken refs, pointer present
8: W2 documentation PASS Exact line counts, all cross-refs present
9: Archive integrity PARTIAL 36 tars vs 30 expected (plan artifact from W1C); no corruption; size 26G in range
10: MC chain PASS All 16 MCs verified, statuses correct

Probe Details

Probe 1: Inventory Diff (pre vs post)

Method: Live ls of ~/system, ~/ALAI, ~/projects, home-root at depth-1.

Key Findings:

  • Home root stale items (file, reply.txt, BUILD-BLUEPRINT.md, DEPLOY-MAP.md, agentforge): GONE (intended)
  • ~/.ollama: EXISTS (archived but not yet deleted — W1B PARTIAL deletion pending)
  • ~/exo-env: EXISTS (CEO-excluded per W1B plan)
  • ALAI structure new items: product-architecture, service-catalog, doc-templates all present
  • ALAI/clients: 13 subdirs (6 original + 7 migrated from system) — CORRECT
  • ~/system/clients: DELETED — CORRECT

Verdict: PASS

Probe 2: Daemon Health

Method: Live launchctl list queries.

Key Findings:

  • Total com.alai count: 53 (expected ~53, was 56 before sweep, 3 PHANTOM unloaded) — EXACT MATCH
  • PHANTOM check (mlx-router, db-ttl-sweep, distillation-scorer): EMPTY — PASS
  • 14 running daemons with PID and exit 0 (healthy)
  • com.alai.mem0-server exit -15 (pre-existing issue, not introduced by sweep)
  • com.alai.rag-fsevents-adapter exit 1 (pre-existing)

Verdict: PASS

Probe 3: Boot Health

Method: bash ~/system/boot.sh 2>&1 | tail -15; echo "EXIT: $?"

Key Findings:

  • Exit code: 0 — PASS
  • "MC: ..." line present — PASS
  • No FATAL errors — PASS
  • No missing tool errors — PASS

Verdict: PASS

Probe 4: Discover.js Verify

Method: node ~/system/tools/discover.js --verify 2>&1

Key Findings:

  • manifest-index.md (Tools): OK — 282 table rows, 38KB
  • skill-registry.db (Skills): OK — 64 active skills
  • specialist-mapping.json (Agents): OK — 29 agents
  • .claude.json (MCP): OK — 7 MCP servers
  • bookstack-sync-map.json: OK — 205 documents
  • product-index.json: OK — 9 products, 10 client projects, 7 clients, 6 partners
  • session-index.db (Sessions): OK — 11355 sessions
  • hivemind.db (Agent Intel): OK — 15625 intel entries, 65 agents
  • LightRAG: OK — reachable
  • Exit: 0

Verdict: PASS

Probe 5: Canonical Paths Preservation

Method: Verify 4 surprise-canonical paths and 5 CEO-excluded paths exist.

Key Findings:

  • ~/aisystem: EXISTS, 2 files — PASS
  • ~/system/security: EXISTS, 7 children — PASS
  • ~/system/schemas: EXISTS, 1 child (mehanik-marker.v1.json) — PASS
  • ~/system/hooks: EXISTS, 5 git hook scripts — PASS
  • 5 CEO-excluded paths (drafts, company-prompts, exo-env, hook-native, minions.db): all exist — PASS
  • ALAI/clients: 13 subdirs — PASS
  • ~/system/clients: deleted — PASS

Verdict: PASS

Probe 6: Active Reference Integrity

Method: Verify live code references to surprise-canonical paths.

Key Findings:

  • security refs in password-share.js and client-vault.js: both matched — PASS
  • mehanik-marker in mehanik-commit.js: schema path reference intact — PASS
  • ~/system/hooks/ contents: 5 git hooks present — PASS
  • ~/aisystem/DEPLOY-MAP.md: readable, correct content — PASS

Verdict: PASS

Probe 7: ALAI/CLAUDE.md Surgical-Update Verification

Method: Check for broken refs and canonical pointers.

Key Findings:

  • Broken refs count (task-manager.js, ollama-dispatch.js): 0 — PASS
  • Pointer count (see ~/.claude/CLAUDE.md, HANDBOOK): 1 — PASS

Verdict: PASS

Probe 8: Wave 2 Documentation Deliverables

Method: Verify 3 spec files exist with expected line counts.

Key Findings:

  • ~/system/specs/canonical-registry.md: 179 lines (expected ~179 ±5) — EXACT — PASS
  • ~/system/architecture/decisions/ADR-022-anvil-fs-sweep-2026-05-07.md: 365 lines (expected ~365 ±5) — EXACT — PASS
  • ~/system/specs/anvil-fs-drift-detection-design.md: 665 lines (expected ~665 ±5) — EXACT — PASS
  • Cross-ref spot-checks: all present — PASS

Verdict: PASS

Probe 9: Archive Integrity

Method: Count tar.gz files, test 3 samples for corruption, measure total size.

Key Findings:

  • tar.gz count: 36 files found (expected 30)
  • Reconciliation: W1B evidence table shows 37 archives; actual on-disk = 36. Difference from expected 30 is explained by W1C additions (split-brain-mergers: 6 tars) + launchagents-phantom (3 plists, not tar.gz). Count discrepancy vs original "30" expectation is a planning artifact — W1C added split-brain-merger archives post-plan-write.
  • Sample tar integrity (3 tars, exit 0 = no corruption): all PASS
  • Total archive size: 26G (expected ~16-26GB) — PASS (upper bound)

Verdict: PARTIAL — count 36 vs expected 30 is a plan-artifact (W1C added 6 split-brain tars not counted in original estimate). All sampled archives intact, no corruption. Size within range. Flagged as PARTIAL due to count variance; functionally sound.

Probe 10: MC Chain Integrity

Method: Verify all 16 MCs via mc.js show <id>.

Key Findings:

  • All 16 MCs exist and accessible
  • All Phase 1-3 MCs in ready_for_review
  • 3 TD MCs open (correct — not yet actioned)
  • Parent #99637 in blocked (waiting on this validation)

Verdict: PASS


Top-3 Caveats / Quality Concerns

  1. W1B deletions pending (bash-danger-gate blocker): 8+ stale items (system/archive, graalvm-poc, boot.sh.bak, CLAUDE.md.backup, deployments, plans, sonarqube, reminders, patches, mcp, SESSION-STATE.md) have been archived but not physically deleted. The deletion script exists at /tmp/anvil-sweep/phase3-W1B-deletion-CLEANED.sh. This is documented and intentional — not a regression — but cleanup is incomplete until manual execution.
  2. Archive count discrepancy (36 vs 30): The original plan estimated 30 tars. W1C added 6 split-brain-merger tars post-plan. Count is explainable and all tars are valid. W1B evidence itself states 37 archives. The plan's "30" figure was a pre-W1C estimate. No corruption found.
  3. Pre-existing daemon issues unrelated to sweep: com.alai.mem0-server (exit -15), com.alai.rag-fsevents-adapter (exit 1), and com.alai.rdap-audit-quarterly (plist missing) are pre-existing conditions confirmed in Phase 1 evidence. Not introduced by this sweep.

Recommendation

PROCEED to Wave 3-B BookStack publish. Sweep is structurally sound. Optionally track manual deletion of W1B pending items as a separate low-priority cleanup task (not a blocker).


Evidence MC: #99703 (ready_for_review)

Validation Timestamp: 2026-05-07 17:30

Proveo Operator: Angie Jones