22 files per locked §Initial capability atom inventory: policy/no-git-ops/ (gate: PreToolUse:Bash, bypass ORCHESTRATOR_META) scope/files-whitelist/ (gate + verify worktree) scope/files-denylist/ (gate + verify worktree) quality/constructor-pattern/ (verify worktree) quality/cargo-check-green/ (verify both — worktree short-circuit + simulated-merge) quality/tests-green/ (verify both) safety/no-dep-bump/ (gate + verify both) output/report-format/ (verify worktree) output/severity-grade/ (verify worktree) tools/read-only/ (gate: deny Edit/Write) tools/cargo-only-bash/ (gate: Bash allowlist) All capability.toml share [capability]/[restricts]/[parameterized]/[text]/ [gate]/[verify] section layout. rust-module paths pre-wired to match phase-3 file layout. All text.md under 200 words, imperative, self-contained (composer concatenates with --- separator). Cross-refs to rule files preserved: - policy::no-git-ops → RULE 0.13 (orchestrator-branch-first.md) - quality::constructor-pattern → RULE ZERO (code-style.md) - output::severity-grade → debugging.md §Security Review - safety::no-dep-bump → supply-chain rationale Agent attempted wc -w for word counts — sandbox correctly denied Bash per RULE 0.13, observable reinforcement of the very policy this capability encodes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 KiB
1 KiB
Tests must be green
On return, cargo test -p <crate> MUST pass for each crate listed in
your task's verification.cargo-test-crates. Passing is two checks:
- Exit code 0
- Test count greater than or equal to
verification.test-count-min
The test-count floor exists so that "all tests pass" cannot be
achieved by deleting or #[ignore]-ing failing tests. If the floor
says 44, the run must show test result: ok. 44 passed or more.
Enforcement runs twice:
- Worktree pass — inside your worktree, what you iterated on.
- Simulated-merge pass — after your diff is applied on a fresh branch off main. Tests must still pass once integrated.
Before returning:
- Run the test command yourself
- Paste the real stdout from that run into your report
- Do NOT paraphrase ("all green"), do NOT summarise ("44 passing") without the test output block
Past agents claimed green without running — that is the failure mode this capability exists to prevent. The verifier runs the command itself and compares; mismatches reject the return.