V4.3 addresses the gaps V4.2 surfaces: no data on whether the new commands are being used, no way to verify the squad is installed correctly, and learnings trapped per-project instead of flowing across projects. Three themes shipped in parallel — Observability, Integrity, Cross-Project Memory — plus plugin packaging to reduce install friction. Phase 4 (monorepo-aware /fleet) is deferred pending Phase 1 telemetry signal. The guiding rule: before adding more surface area, know what’s installed, what’s working, and what’s being used.
-
hooks/squad-telemetry.js — PostToolUse event recorder
Records squad events to ~/.claude/projects/<sanitized-cwd>/memory/squad-metrics.jsonl. Captured events: verdict (APPROVE / REVISE / BLOCK from Nando, CONFIRM / CHALLENGE from Emily), auto-fix-applied (per-worker SUCCESS or abort-reason from /review-auto), auto-fix-round (via chore(auto-fix): round N commit detection), fleet-shard-complete (via worktree removal detection), command-invoked for /handoff (.claude/handoff.md writes) and /ui-iterate (ui-iterations report writes). Every event carries ts, session_id, project, cwd. Fail-open on any I/O error — the hook never blocks tool use.
-
/squad-metrics — summarize usage from telemetry
Aggregates across all squad-metrics.jsonl files with period filter (week / month / all) and per-project filter. Produces invocation counts, verdict distribution, /review-auto rounds-to-APPROVE histogram + worker success rates, /fleet shard completion counts, top projects by invocation + REVISE rate. Interprets the numbers in a Signals section: closed-loop effectiveness (percent of REVISEs closed in 1 auto-fix round), command adoption rates, verdict churn signals. --json flag for structured output; absent telemetry shows a clear “nothing to summarize yet” with wiring instructions.
-
/squad-health — installation integrity audit
Checklist-style audit of the 29 canonical agent files, 21+ commands, 3 hooks, 2 project-rules, 4 squad-patterns, templates, settings.json validity, hook wiring, project-local .review-squad/ state, version drift against upstream SHA, and gh auth active account. --fix auto-repairs safe issues (creates missing dirs, adds .review-squad/ to .gitignore). Never mutates settings.json — always directs user to manual fix for hook wiring drift. Default output is failure-focused; --verbose includes passing checks.
-
.claude-plugin/plugin.json — plugin manifest
Declares version 4.2.0, author, homepage, keywords, and component directories (agents, commands, hooks, templates, project-rules, squad-patterns). install.destinations map each source directory to its target under ~/.claude/. install.preserve protects custom-* agents from being overwritten. install.post-install documents which hooks need manual wiring into settings.json PostToolUse.
-
.claude-plugin/marketplace.json — marketplace entry
Exposes the repo as Corye-CIC-Review-Squad marketplace with a single plugin entry pointing at github:Corye-CIC/Review_Squad. Users add via claude plugin add + claude plugin install instead of manual curl + copy.
-
README Installation — Option A (plugin) + Option B (manual)
Plugin install is the recommended path for fresh setups: two commands install the entire squad. Manual setup remains documented as Option B for users who want per-file control. Both paths converge on the same hook-wiring step (settings.json PostToolUse).
-
New
squad-patterns/ directory (4 seeded files)
Patterns that recur across 2+ projects get promoted from per-project .review-squad/<project>/agent-notes/ to the global ~/.claude/squad-patterns/ library. Seeded from review data across campaign-management (288 learning lines / 27 rounds), llama.cpp (54 / 243), and SubAgents (173 / 808) projects — total 515 labeled learning lines. Tier A (domain): three classes identified — type safety / schema mismatch, unimplemented-but-promised features, null-safety + error-path gaps — each with citations, severity classification, plan/implement/review-time checks. Tier B (tooling): agent mode-specificity, cold-start edges, output routing gaps, prescriptive-vs-descriptive rule enforcement.
-
verdict-trends.md — distribution patterns by project type
Schema-heavy projects run BLOCK 35% / REVISE 45% / APPROVE 20%; low-level systems projects run BLOCK 25% / REVISE 30% / APPROVE 45%; agent-tooling projects run high REVISE churn with low BLOCK rate. Anomaly patterns included: first-pass APPROVE on schema change is suspicious (reviewer likely missed consumers); REVISE loop that doesn’t converge is design-level, should de-escalate to /consult; BLOCK on agent tooling is rare and load-bearing — avoid /review-auto.
-
auto-fix-patterns.md — classifier decision flow
Defines which finding classes route to NIT (95% worker success — formatting, unused imports, missing returns), MUST-FIX-SAFE (80% success — null guards, missing await, type narrowing), MUST-FIX-RISKY (always surface — Jared-flagged, multi-file, reviewer uncertainty), BLOCKER (always surface with recurring-blocker class reference). Decision flow encoded for /review-auto Step 2. Includes a telemetry-driven refinement plan: success-rate thresholds that promote / demote classes, triggered by /squad-learnings (future) reading accumulated metrics.
-
PM Cory agent updates — global pattern awareness
All three PM Cory review-cycle files updated.
pm-cory-early.md (discuss/research/plan): loads context from both .review-squad/<project>/ AND ~/.claude/squad-patterns/; cites matching patterns by name during planning so issues are preempted. pm-cory-consult.md: checks recurring-blockers.md and tooling-patterns.md at consult start. pm-cory-review.md: references matching global patterns in the review output, strengthening the receipt for challenges and teaching the pattern simultaneously.
-
/update-reviewsquad extended sync coverage
Adds squad-patterns/*.md to both first-run and incremental sync modes, with destination ~/.claude/squad-patterns/. Includes hooks/squad-telemetry.js in the tracked hook list so the new telemetry hook is pulled on every update. Without these additions, fresh installs would miss the pattern library and the telemetry recorder.
-
Decision gate: 3+ real
/fleet runs per month triggers the build
Phase 4 — extending /fleet to handle phases spanning multiple workspaces in a monorepo (pnpm workspaces, turbo.json, nx.json, lerna) — was designed but NOT shipped. The decision gate: build only if Phase 1 telemetry shows /fleet used on at least 3 refactors per month. Rationale: /fleet is the most complex existing command and the riskiest to extend; the community pattern (5-30 shards with worktree isolation) is aspirational but may not match actual usage. Reassess in 4–6 weeks once squad-metrics.jsonl has data.
-
Parked items
Explicitly not building in V4.3: cost estimator (waiting for Anthropic stable token-cost API), SDK-backed command rewrites (markdown templates still scaling fine), multi-account squad variants (complexity without clear payoff).