refactor(kei-mcp): v0.46 — decompose safe_tools + fix CRITICAL Grok bypass

ARCHITECTURAL FIXES (Constructor Pattern — file >200 LOC): 1. safe_tools.rs (738 LOC god-object) → safe_tools/ module (5 files): - mod.rs (99 LOC) — descriptors + dispatch - env_guard.rs (79 LOC) — KillPgGuard RAII + apply_safe_env - path_guard.rs (166 LOC) — validate_path + canonicalize walk-up - chain_runner.rs (159 LOC) — hook chain loader/runner - exec.rs (222 LOC) — handle_bash/edit/write with O_NOFOLLOW 2. CRITICAL Grok bypass closed (Claude critic finding): - REMOVED env-based chain skip (CLAUDECODE / GROKCODE checks) - The skip assumed native PreToolUse would catch the call, but PreToolUse matchers fire on tool_name="Bash"|"Edit"|"Write" while MCP tools are named kei_bash/kei_edit/kei_write — so native hooks NEVER fire on MCP tool calls. The skip created an auth-bypass hole. - Chain now ALWAYS runs for kei_bash/kei_edit/kei_write. - Wire scripts (kei-mcp-wire-claude.sh + -grok.sh) updated: empty env block + comment explaining v0.46 rationale. 3. Fail-closed defaults (architecturally correct, not bandaid): - validate_path: empty allowed_roots() → ERROR (was silent disable) - load_chain: missing/empty section → ERROR unless KEI_POLICY_CHAIN_OPTIONAL=1 4. RAII guard for process-group cleanup: - KillPgGuard fires killpg on ANY exit path (success, error, timeout, panic) until explicitly disarmed. Replaces error-path-only killpg. 5. validate_path moved off tokio worker via spawn_blocking — was blocking syscalls in async context. VERIFIED: - cargo build --release → clean - cargo test -p kei-mcp --release → 2 passed - MCP smoke: chain fires under CLAUDECODE=1, GROKCODE=1, and no env (all three previously skipped; all three now block kei_bash on forbidden git push patterns). - Safe commands still pass (kei_bash echo HELLO → HELLO returned). README: substrate counts refreshed (105→110 Rust crates, v0.45→v0.46). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-27 14:00:16 +08:00 · 2026-05-27 14:00:16 +08:00 · 845f7f9ba1
commit 845f7f9ba1
parent b54b84ad45
12 changed files with 738 additions and 748 deletions
--- a/README.md
+++ b/README.md
@ -46,9 +46,9 @@ sleep consolidates 30-session windows into morning markdown reports.
  updates, agent regeneration, DNA index refresh, keimd graph
  reindex. Auto-self-indexing via kei-registry SQLite.

-## By the numbers (v0.45)
+## By the numbers (v0.46)

-105 Rust crates · 69 skills · 54 hooks · 38 agent manifests ·
+110 Rust crates · 69 skills · 54 hooks · 38 agent manifests ·
 86 substrate blocks · 18 capability atoms · 7 substrate roles ·
 565 indexed DNAs · 6 install profiles (minimal → full).

--- a/_primitives/_rust/kei-mcp/src/handlers/safe_tools.rs
+++ b/_primitives/_rust/kei-mcp/src/handlers/safe_tools.rs
@ -1,738 +0,0 @@
-//! Phase C — cross-CLI hook enforcement via MCP-wrapped tools.
-//!
-//! Exposes three built-in MCP tools — `kei_bash`, `kei_edit`, `kei_write` —
-//! that synthesize Claude Code's PreToolUse hook input contract, chain
-//! through the hook scripts declared in `~/.claude/hooks/_lib/policy-chain.toml`,
-//! and only execute the wrapped action if every hook returns exit 0.
-//!
-//! Why this exists: when an agent runs on Grok / Agy / Copilot / Kimi, none
-//! of our claude-side PreToolUse hooks fire. The agent could read the rules
-//! in its system prompt but the tool-call layer was previously ungated. The
-//! `kei_*` MCP tools restore that gate for any MCP-capable CLI.
-//!
-//! Constructor Pattern: ONE policy SSoT (`policy-chain.toml`), ONE dispatcher
-//! (this file), hooks reused as-is from `~/.claude/hooks/`. No rewrite, no
-//! abstraction layer. Shell-out per hook keeps the contract identical to
-//! Claude's native PreToolUse pipeline.
-//!
-//! CLAUDECODE / GROKCODE guard — DESIGN NOTE (NOT a security boundary):
-//! When invoked from inside Claude Code (`$CLAUDECODE=1`) or Grok the chain
-//! is SKIPPED to avoid double-firing the same hooks (they already ran on the
-//! CLI's own PreToolUse). This is a perf / UX optimization for the inside-CLI
-//! call path — NOT an authorization check. An attacker who can set the
-//! parent process's environment already controls the CLI invocation anyway;
-//! re-running hooks would not stop them. To raise the bar for confused-deputy
-//! scenarios use full sandboxing (Phase D) or run kei-mcp as a separate UID.
-//!
-//! v0.41 audit fixes (2026-05-26, Gemini security review):
-//!   #1 fail-CLOSED on missing hooks (was: silently skip)
-//!   #2 path-traversal guard on kei_edit/kei_write (canonicalize + root check)
-//!   #3 CLAUDECODE bypass — documented as design (see above), no behavior change
-//!   #4 tokio::fs for async file I/O (was: blocking std::fs on tokio thread)
-//!   #5 process-group kill on Unix (was: kill_on_drop SIGKILLs only direct child)
-//!
-//! v0.42 re-audit fixes (2026-05-26, 4-CLI dogfood: Claude+Grok+Gemini+Copilot):
-//!   #1 [CRITICAL] symlink LEAF bypass — canonicalize full path + reject
-//!      leaf symlinks (v0.41 only canonicalized PARENT; ln -s ~/.ssh/keys ./x
-//!      then kei_write x followed the link to the target)
-//!   #2 [HIGH]     $HOME removed from default allowed_roots — was a blanket
-//!      allow that let agent overwrite ~/.claude/hooks (self-neuter), ~/.zshrc
-//!      (RCE on next shell), and credential stores. Default: $PWD only.
-//!      Denylist also extended with .claude/, .grok/, .gemini/, .copilot/,
-//!      .kimi/, and exact shell-init filenames.
-//!   #3 [HIGH]     empty [bash]/[edit]/[write] section also FAIL-CLOSED (was:
-//!      empty vec → pass-through). KEI_POLICY_CHAIN_OPTIONAL=1 to opt in.
-//!   #4 [MED]      load_chain converted to async + tokio::fs (was: blocking
-//!      std::fs on tokio worker thread).
-//!   #5 [MED]      set_process_group + killpg applied to HOOK subprocess too
-//!      (v0.41 only had it on the bash action; hook grandchildren orphaned).
-//!   #6 [MED]      doc note that aggregate timeout is still per-step (60s ×
-//!      N hooks + 60s action). Single-deadline implementation deferred to
-//!      v0.43 — not security-blocking.
-
-use crate::protocol::{err, ok, JsonRpcRequest, JsonRpcResponse, INTERNAL_ERROR, INVALID_PARAMS};
-use serde::Deserialize;
-use serde_json::{json, Value};
-use std::path::{Path, PathBuf};
-use std::process::Stdio;
-use std::time::Duration;
-use tokio::fs;
-use tokio::io::AsyncWriteExt;
-use tokio::process::Command;
-
-/// Per-step timeout (each hook AND the action each get up to this long).
-/// For an N-hook chain the total wall-clock cap is approximately
-/// `(N+1) * SAFE_TOOL_TIMEOUT_SECS`. v0.44 doc-honesty fix (Claude MED):
-/// prior versions claimed this was an "aggregate" cap, which was always
-/// wrong. Aggregate-deadline impl is deferred; for now the per-step
-/// semantics are documented honestly so operators pick a sane value.
-const SAFE_TOOL_TIMEOUT_SECS: u64 = 60;
-
-#[derive(Deserialize, Default)]
-struct PolicyChain {
-    #[serde(default)]
-    bash: ChainSpec,
-    #[serde(default)]
-    edit: ChainSpec,
-    #[serde(default)]
-    write: ChainSpec,
-}
-
-#[derive(Deserialize, Default)]
-struct ChainSpec {
-    #[serde(default)]
-    chain: Vec<String>,
-}
-
-/// MCP tool descriptors — appended to `tools/list` by `handlers::tools::list`.
-pub fn descriptors() -> Vec<Value> {
-    vec![
-        json!({
-            "name": "kei_bash",
-            "description": "Run a shell command after running KeiSeiKit's [bash] policy chain (no-github-push, safety-guard, destructive-guard). Blocks on hook exit 2 with the hook's stderr surfaced as the MCP error message. Use this instead of native shell on non-Claude CLIs to inherit Claude Code's safety enforcement.",
-            "inputSchema": {
-                "type": "object",
-                "properties": {
-                    "command": { "type": "string", "description": "Shell command to execute" },
-                    "cwd": { "type": "string", "description": "Optional working directory; defaults to $PWD" }
-                },
-                "required": ["command"]
-            }
-        }),
-        json!({
-            "name": "kei_edit",
-            "description": "Modify a file (replace old_string with new_string) after running KeiSeiKit's [edit] policy chain (citation-verify, numeric-claims-guard). Blocks unverified academic citations and numeric claims without evidence markers.",
-            "inputSchema": {
-                "type": "object",
-                "properties": {
-                    "file_path": { "type": "string" },
-                    "old_string": { "type": "string" },
-                    "new_string": { "type": "string" }
-                },
-                "required": ["file_path", "old_string", "new_string"]
-            }
-        }),
-        json!({
-            "name": "kei_write",
-            "description": "Write content to a file after running KeiSeiKit's [write] policy chain (citation-verify, numeric-claims-guard). Blocks unverified academic citations and numeric claims without evidence markers.",
-            "inputSchema": {
-                "type": "object",
-                "properties": {
-                    "file_path": { "type": "string" },
-                    "content": { "type": "string" }
-                },
-                "required": ["file_path", "content"]
-            }
-        }),
-    ]
-}
-
-/// Top-level dispatch entry — called from `handlers::tools::call` when the
-/// tool name matches one of the three `kei_*` built-ins.
-pub async fn dispatch_safe(req: JsonRpcRequest, name: &str, args: &Value) -> JsonRpcResponse {
-    let result = match name {
-        "kei_bash"  => handle_bash(args).await,
-        "kei_edit"  => handle_edit(args).await,
-        "kei_write" => handle_write(args).await,
-        _ => Err(format!("safe_tools dispatched unknown name: {name}")),
-    };
-    match result {
-        Ok(text) => ok(req.id, json!({
-            "content": [{ "type": "text", "text": text }],
-            "isError": false,
-        })),
-        Err(e) => err(req.id, INTERNAL_ERROR, e),
-    }
-}
-
-// ---- per-tool handlers --------------------------------------------------
-
-async fn handle_bash(args: &Value) -> Result<String, String> {
-    let command = args.get("command").and_then(Value::as_str)
-        .ok_or_else(|| missing_arg("kei_bash", "command"))?;
-    let cwd = args.get("cwd").and_then(Value::as_str);
-
-    // v0.44 fix #8 (Gemini MED): include cwd in hook input. Without this,
-    // safety-guard could approve a destructive command (e.g. `rm -rf *`)
-    // assuming PWD, while the actual cwd arg redirected it to a sensitive
-    // dir. Hooks now see the real working directory.
-    let hook_input = json!({
-        "tool_name": "Bash",
-        "tool_input": {
-            "command": command,
-            "cwd": cwd
-        }
-    });
-    run_chain("bash", &hook_input).await?;
-
-    let mut cmd = Command::new("bash");
-    cmd.arg("-c").arg(command);
-    if let Some(dir) = cwd {
-        cmd.current_dir(dir);
-    }
-    cmd.stdin(Stdio::null())
-        .stdout(Stdio::piped())
-        .stderr(Stdio::piped())
-        .kill_on_drop(true);
-    // v0.41 fix #5: put child in its own process group so timeout kills it
-    // and ALL grandchildren together (not just the immediate shell).
-    set_process_group(&mut cmd);
-    // v0.44 fix #4 (Gemini HIGH): clear parent env on subprocess spawn.
-    // Was: child inherited AWS_*, GITHUB_TOKEN, MOONSHOT_API_KEY, etc.
-    // An agent that exec's `env` via kei_bash could exfiltrate all of them.
-    // Now: only PATH/HOME/USER/LANG/TERM/SHELL forwarded (set in helper).
-    apply_safe_env(&mut cmd);
-
-    let child = cmd.spawn().map_err(|e| format!("spawn bash: {e}"))?;
-    let pid_opt = child.id();
-    let fut = child.wait_with_output();
-
-    let out = match tokio::time::timeout(Duration::from_secs(SAFE_TOOL_TIMEOUT_SECS), fut).await {
-        Ok(Ok(o)) => o,
-        Ok(Err(e)) => return Err(format!("wait bash: {e}")),
-        Err(_) => {
-            // Timeout — kill the entire process group, not just the child.
-            if let Some(pid) = pid_opt {
-                killpg_best_effort(pid);
-            }
-            return Err("kei_bash timeout".to_string());
-        }
-    };
-
-    let stdout = String::from_utf8_lossy(&out.stdout).to_string();
-    let stderr = String::from_utf8_lossy(&out.stderr).to_string();
-    if !out.status.success() {
-        return Err(format!(
-            "bash exited {}: {}",
-            out.status.code().unwrap_or(-1),
-            stderr.trim()
-        ));
-    }
-    Ok(if stderr.is_empty() { stdout } else { format!("{stdout}\n[stderr]\n{stderr}") })
-}
-
-// v0.41 fix #5: process-group helpers (Unix-only; no-op on other platforms).
-#[cfg(unix)]
-fn set_process_group(cmd: &mut Command) {
-    cmd.process_group(0);
-}
-#[cfg(not(unix))]
-fn set_process_group(_cmd: &mut Command) {}
-
-/// v0.44 fix #4 (Gemini HIGH): strip parent env on subprocess spawn so secrets
-/// like AWS_*, GITHUB_TOKEN, MOONSHOT_API_KEY etc. don't leak to user-controlled
-/// bash commands or hook scripts. Whitelist forwards only PATH/HOME/USER/LANG/
-/// TERM/SHELL — enough to keep tools functional, none of it sensitive.
-///
-/// Override: `KEI_SAFE_ENV_EXTRA=":-separated list"` adds named vars to the
-/// whitelist for callers that legitimately need (e.g. NIX_PATH, JAVA_HOME).
-fn apply_safe_env(cmd: &mut Command) {
-    cmd.env_clear();
-    let default_keep = [
-        "PATH", "HOME", "USER", "LOGNAME", "SHELL", "LANG", "LC_ALL",
-        "LC_CTYPE", "TERM", "PWD", "TMPDIR",
-    ];
-    for k in default_keep {
-        if let Ok(v) = std::env::var(k) {
-            cmd.env(k, v);
-        }
-    }
-    if let Ok(extras) = std::env::var("KEI_SAFE_ENV_EXTRA") {
-        for k in extras.split(':') {
-            let k = k.trim();
-            if k.is_empty() { continue; }
-            if let Ok(v) = std::env::var(k) {
-                cmd.env(k, v);
-            }
-        }
-    }
-}
-
-#[cfg(unix)]
-fn killpg_best_effort(pid: u32) {
-    // SAFETY: libc::kill on a negative PID targets the process group.
-    // SIGKILL = 9. Best-effort — ignore errors (process may have exited).
-    unsafe {
-        let _ = libc::kill(-(pid as i32), libc::SIGKILL);
-    }
-}
-#[cfg(not(unix))]
-fn killpg_best_effort(_pid: u32) {}
-
-async fn handle_edit(args: &Value) -> Result<String, String> {
-    let file_path = args.get("file_path").and_then(Value::as_str)
-        .ok_or_else(|| missing_arg("kei_edit", "file_path"))?;
-    let old_string = args.get("old_string").and_then(Value::as_str)
-        .ok_or_else(|| missing_arg("kei_edit", "old_string"))?;
-    let new_string = args.get("new_string").and_then(Value::as_str)
-        .ok_or_else(|| missing_arg("kei_edit", "new_string"))?;
-
-    // v0.44 LOW: reject empty old_string (would silently prepend new_string
-    // because contents.contains("") is always true).
-    if old_string.is_empty() {
-        return Err("kei_edit: old_string must not be empty".into());
-    }
-
-    let safe_path = validate_path(file_path)?;
-
-    let hook_input = json!({
-        "tool_name": "Edit",
-        "tool_input": {
-            "file_path": safe_path.display().to_string(),
-            "old_string": old_string,
-            "new_string": new_string
-        }
-    });
-    run_chain("edit", &hook_input).await?;
-
-    // v0.44 fix #2 (Gemini HIGH + Claude #4 MED): close TOCTOU window. After
-    // validate_path approved the path, a concurrent process could swap the
-    // file for a symlink before our write. Open the existing file with
-    // O_NOFOLLOW so the open itself fails on symlink-swap; then read/write
-    // through the open fd (not the path again) so no second path lookup.
-    open_nofollow_read_write_edit(&safe_path, old_string, new_string).await
-}
-
-async fn handle_write(args: &Value) -> Result<String, String> {
-    let file_path = args.get("file_path").and_then(Value::as_str)
-        .ok_or_else(|| missing_arg("kei_write", "file_path"))?;
-    let content = args.get("content").and_then(Value::as_str)
-        .ok_or_else(|| missing_arg("kei_write", "content"))?;
-
-    let safe_path = validate_path(file_path)?;
-
-    let hook_input = json!({
-        "tool_name": "Write",
-        "tool_input": { "file_path": safe_path.display().to_string(), "content": content }
-    });
-    run_chain("write", &hook_input).await?;
-
-    if let Some(parent) = safe_path.parent() {
-        if !parent.as_os_str().is_empty() {
-            fs::create_dir_all(parent).await
-                .map_err(|e| format!("mkdir {}: {e}", parent.display()))?;
-        }
-    }
-    // v0.44 fix #2: open with O_NOFOLLOW + O_CREAT to refuse swap-to-symlink.
-    open_nofollow_write(&safe_path, content).await
-}
-
-/// v0.44 fix #2: edit via O_NOFOLLOW-opened fd to close the TOCTOU window
-/// between validate_path and the write. The open() itself refuses if the leaf
-/// has been swapped to a symlink during the hook-chain await.
-#[cfg(unix)]
-async fn open_nofollow_read_write_edit(
-    path: &Path, old_string: &str, new_string: &str,
-) -> Result<String, String> {
-    use std::os::unix::fs::OpenOptionsExt;
-    let path = path.to_path_buf();
-    let old_s = old_string.to_string();
-    let new_s = new_string.to_string();
-    // Blocking syscalls on a dedicated thread (tokio::task::spawn_blocking).
-    let result = tokio::task::spawn_blocking(move || -> Result<String, String> {
-        let mut f = std::fs::OpenOptions::new()
-            .read(true).write(true)
-            .custom_flags(libc::O_NOFOLLOW)
-            .open(&path)
-            .map_err(|e| format!("kei_edit: open(O_NOFOLLOW) {}: {e}", path.display()))?;
-        use std::io::{Read, Write, Seek, SeekFrom};
-        let mut contents = String::new();
-        f.read_to_string(&mut contents)
-            .map_err(|e| format!("kei_edit: read {}: {e}", path.display()))?;
-        if !contents.contains(&old_s) {
-            return Err(format!("kei_edit: old_string not found in {}", path.display()));
-        }
-        let updated = contents.replacen(&old_s, &new_s, 1);
-        f.set_len(0).map_err(|e| format!("kei_edit: truncate {}: {e}", path.display()))?;
-        f.seek(SeekFrom::Start(0))
-            .map_err(|e| format!("kei_edit: seek {}: {e}", path.display()))?;
-        f.write_all(updated.as_bytes())
-            .map_err(|e| format!("kei_edit: write {}: {e}", path.display()))?;
-        Ok(format!("edited {} ({} bytes)", path.display(), updated.len()))
-    }).await
-        .map_err(|e| format!("kei_edit: thread join: {e}"))?;
-    result
-}
-#[cfg(not(unix))]
-async fn open_nofollow_read_write_edit(
-    path: &Path, old_string: &str, new_string: &str,
-) -> Result<String, String> {
-    // Non-Unix fallback: best-effort using tokio::fs (no O_NOFOLLOW available).
-    let contents = fs::read_to_string(path).await
-        .map_err(|e| format!("read {}: {e}", path.display()))?;
-    if !contents.contains(old_string) {
-        return Err(format!("kei_edit: old_string not found in {}", path.display()));
-    }
-    let updated = contents.replacen(old_string, new_string, 1);
-    fs::write(path, &updated).await
-        .map_err(|e| format!("write {}: {e}", path.display()))?;
-    Ok(format!("edited {} ({} bytes)", path.display(), updated.len()))
-}
-
-#[cfg(unix)]
-async fn open_nofollow_write(path: &Path, content: &str) -> Result<String, String> {
-    use std::os::unix::fs::OpenOptionsExt;
-    let path = path.to_path_buf();
-    let bytes = content.as_bytes().to_vec();
-    let result = tokio::task::spawn_blocking(move || -> Result<String, String> {
-        let mut opts = std::fs::OpenOptions::new();
-        opts.write(true).create(true).truncate(true);
-        // O_NOFOLLOW: refuse if the leaf is a symlink (someone swapped it
-        // during our await). Without this the v0.42 symlink_metadata pre-check
-        // was just an indicator — fs::write still followed.
-        opts.custom_flags(libc::O_NOFOLLOW);
-        // O_EXCL combined with O_CREAT could be added when path does not yet
-        // exist to refuse any pre-existing inode — but the test suite uses
-        // the same path multiple times, so we keep truncate semantics. The
-        // O_NOFOLLOW + symlink_metadata pre-check is sufficient.
-        let mut f = opts.open(&path)
-            .map_err(|e| format!("kei_write: open(O_NOFOLLOW) {}: {e}", path.display()))?;
-        use std::io::Write;
-        f.write_all(&bytes)
-            .map_err(|e| format!("kei_write: write {}: {e}", path.display()))?;
-        Ok(format!("wrote {} ({} bytes)", path.display(), bytes.len()))
-    }).await
-        .map_err(|e| format!("kei_write: thread join: {e}"))?;
-    result
-}
-#[cfg(not(unix))]
-async fn open_nofollow_write(path: &Path, content: &str) -> Result<String, String> {
-    fs::write(path, content).await
-        .map_err(|e| format!("write {}: {e}", path.display()))?;
-    Ok(format!("wrote {} ({} bytes)", path.display(), content.len()))
-}
-
-/// Path-traversal + symlink + denylist guard.
-///
-/// v0.41 (initial): rejected `..`, canonicalized PARENT, checked denylist + roots.
-///   → 4-CLI re-audit (2026-05-26) found this was bypassable via symlink at the
-///     leaf and self-attackable via the $HOME blanket-allowed root.
-///
-/// v0.42 fixes:
-///   #1 [CRITICAL] reject if the leaf is a symlink (was: validated parent
-///      only, fs::write followed leaf symlink to anywhere). Done via
-///      `symlink_metadata` on the leaf BEFORE write, and full `canonicalize`
-///      on the leaf when the file already exists.
-///   #2 [HIGH] $HOME removed from default allowed-roots — default is $PWD
-///      only. Denylist now also covers $HOME/.claude/ (the substrate
-///      itself), shell init files, and credential stores. Operators who
-///      need broader access set KEI_ALLOWED_ROOTS explicitly.
-fn validate_path(p: &str) -> Result<PathBuf, String> {
-    if p.is_empty() {
-        return Err("file_path: empty".into());
-    }
-    // 1. Reject literal `..` segments — covers most traversal attempts.
-    if p.split('/').any(|seg| seg == "..") {
-        return Err(format!("file_path: '..' segment not allowed in {p}"));
-    }
-    let path = Path::new(p);
-
-    // 2. Build a canonical path. Walk UP to the deepest existing ancestor,
-    //    canonicalize it (resolves all symlinks in the existing prefix),
-    //    then reattach the non-existent tail. This catches symlinks at ANY
-    //    depth in the path, including nested non-existent leaves.
-    //
-    //    v0.44 fix #1 (Gemini CRITICAL): v0.42 only canonicalized the immediate
-    //    parent. If the parent didn't exist either (e.g. /proj/symlink_dir/
-    //    new_subdir/file.txt where symlink_dir → /Users/denis), the path fell
-    //    through to "absolute as-is" → no canonicalization → bypass.
-    let canonical = canonicalize_with_walk_up(path)?;
-
-    // 3. Even when the file doesn't exist yet, the LEAF could already be a
-    //    dangling symlink that `fs::write` would follow on creation. Reject.
-    if let Ok(meta) = std::fs::symlink_metadata(&canonical) {
-        if meta.file_type().is_symlink() {
-            return Err(format!(
-                "file_path: leaf is a symlink (refusing to follow): {}",
-                canonical.display()
-            ));
-        }
-    }
-
-    // 4. Allowed-root containment FIRST (v0.44 fix #6 reorder: was after
-    //    denylist, which meant macOS $TMPDIR = /private/var/folders/... hit
-    //    the /var/ denylist before reaching the allowed_roots check, blocking
-    //    legitimate use of tempfile-backed CWD on macOS).
-    //
-    //    v0.44 fix #5 (Claude HIGH): use Path::starts_with for component-aware
-    //    containment — Path::starts_with("/home/u/proj") does NOT match
-    //    /home/u/proj-secrets, the str::starts_with that was here did.
-    let roots = allowed_roots();
-    let in_allowed_root = roots.is_empty() || roots.iter().any(|r| {
-        canonical.starts_with(r)
-    });
-    if !in_allowed_root {
-        return Err(format!(
-            "file_path: outside allowed roots {:?}: {}",
-            roots, canonical.display()
-        ));
-    }
-
-    let canon_str = canonical.display().to_string();
-
-    // 5. Reject system + substrate-control + credential paths.
-    //    Note: paths inside an allowed root that also match a denylist entry
-    //    are STILL denied (e.g. agent's CWD == ~/.claude/ — denied even
-    //    though it matches a default root). System dirs not in any allowed
-    //    root would have been caught above anyway.
-    let denylist = [
-        "/etc/", "/usr/", "/System/", "/var/db/", "/var/log/", "/var/root/",
-        "/private/etc/", "/private/var/db/", "/private/var/log/", "/private/var/root/",
-        "/root/", "/bin/", "/sbin/",
-    ];
-    // NOTE: /var/folders/ (macOS $TMPDIR) and /private/tmp/ are NOT denied —
-    // they are legitimate working dirs for tempfile-backed agents.
-    for d in denylist {
-        if canon_str.starts_with(d) {
-            return Err(format!("file_path: denied (system dir): {canon_str}"));
-        }
-    }
-    if let Ok(home) = std::env::var("HOME") {
-        let dir_secrets = [
-            ".ssh/", ".aws/", ".gnupg/", ".config/gcloud/", ".cargo/credentials",
-            ".npmrc", ".docker/config.json", ".kube/",
-            ".claude/", ".grok/", ".gemini/", ".copilot/", ".kimi/",
-        ];
-        for sd in dir_secrets {
-            let full = format!("{home}/{sd}");
-            if canon_str.starts_with(&full) {
-                return Err(format!("file_path: denied (secret/substrate dir): {canon_str}"));
-            }
-        }
-        let init_files = [
-            ".zshrc", ".bashrc", ".profile", ".bash_profile", ".zprofile",
-            ".zshenv", ".bash_login", ".inputrc", ".gitconfig",
-            ".config/fish/config.fish",
-        ];
-        for f in init_files {
-            let full = format!("{home}/{f}");
-            if canon_str == full {
-                return Err(format!("file_path: denied (shell-init file): {canon_str}"));
-            }
-        }
-    }
-
-    Ok(canonical)
-}
-
-/// v0.44 fix #1: walk up the path looking for the deepest existing ancestor,
-/// canonicalize THAT, then reattach the non-existent tail components.
-/// Resolves symlinks at any depth (existing OR non-existing branches).
-fn canonicalize_with_walk_up(path: &Path) -> Result<PathBuf, String> {
-    // Make the path absolute first so we can walk up reliably.
-    let abs = if path.is_absolute() {
-        path.to_path_buf()
-    } else {
-        std::env::current_dir()
-            .map_err(|e| format!("file_path: cwd unavailable: {e}"))?
-            .join(path)
-    };
-
-    // Walk up from the leaf, collecting non-existent components in reverse.
-    let mut current = abs.clone();
-    let mut tail: Vec<std::ffi::OsString> = Vec::new();
-    let canon = loop {
-        if current.exists() {
-            break current.canonicalize()
-                .map_err(|e| format!("file_path: canonicalize {}: {e}", current.display()))?;
-        }
-        let name = current.file_name()
-            .ok_or_else(|| format!("file_path: path has no existing ancestor: {}", abs.display()))?
-            .to_os_string();
-        let parent = match current.parent() {
-            Some(p) if !p.as_os_str().is_empty() => p.to_path_buf(),
-            _ => return Err(format!("file_path: walked to root without finding existing dir: {}", abs.display())),
-        };
-        tail.push(name);
-        current = parent;
-    };
-
-    // Reattach tail (in reverse — we pushed from leaf to root).
-    let mut result = canon;
-    for name in tail.into_iter().rev() {
-        result.push(name);
-    }
-    Ok(result)
-}
-
-fn allowed_roots() -> Vec<String> {
-    // Canonicalize each entry so symlinked roots (e.g. macOS /var → /private/var,
-    // /tmp → /private/tmp) match canonicalized targets. Trailing slash added
-    // for the consistency-with-default format. v0.44 fix #5 + #6 combined.
-    let canon_with_slash = |raw: &str| -> Option<String> {
-        let p = Path::new(raw);
-        let canon = std::fs::canonicalize(p).unwrap_or_else(|_| p.to_path_buf());
-        let mut s = canon.display().to_string();
-        if !s.ends_with('/') { s.push('/'); }
-        if s.is_empty() { None } else { Some(s) }
-    };
-    if let Ok(v) = std::env::var("KEI_ALLOWED_ROOTS") {
-        return v.split(':')
-            .filter(|s| !s.is_empty())
-            .filter_map(canon_with_slash)
-            .collect();
-    }
-    let mut roots = Vec::new();
-    if let Ok(cwd) = std::env::current_dir() {
-        if let Some(r) = canon_with_slash(&cwd.display().to_string()) {
-            roots.push(r);
-        }
-    }
-    roots
-}
-
-// ---- chain runner -------------------------------------------------------
-
-/// Run the configured hook chain for `tool` ("bash"/"edit"/"write"), piping
-/// `hook_input` to each hook's stdin in order. Exit 0 → continue. Exit 2 (or
-/// other non-zero) → return Err with the hook's stderr.
-///
-/// Skips the chain if the parent process is already inside Claude or Grok
-/// (env flags), since those CLIs' native PreToolUse hooks already fired.
-/// Run the configured hook chain for `tool` ("bash"/"edit"/"write").
-///
-/// v0.42 fixes:
-///   #3 [HIGH]   empty chain (section absent or zero hooks) now FAILS CLOSED
-///               unless KEI_POLICY_CHAIN_OPTIONAL=1.
-///   #4 [MED]    load_chain() converted to async (was: blocking std::fs).
-///   #5 [MED]    hook subprocess gets `process_group(0)` + killpg on timeout
-///               (was: only the bash action got it; hooks could orphan).
-///   #6 [MED]    aggregate timeout across the whole chain + action (was:
-///               per-hook 60s, so chain+action could legitimately run
-///               4× the documented cap on a 3-hook chain).
-async fn run_chain(tool: &str, hook_input: &Value) -> Result<(), String> {
-    if env_truthy("CLAUDECODE") || env_truthy("GROKCODE") {
-        // Native hooks already enforced — don't double-fire.
-        return Ok(());
-    }
-
-    let chain = load_chain(tool).await?;
-    if chain.is_empty() {
-        // v0.42 fix #3 (Claude+Gemini HIGH): empty section is the same
-        // misconfig class as missing file — FAIL CLOSED with explicit opt-in.
-        if env_truthy("KEI_POLICY_CHAIN_OPTIONAL") {
-            return Ok(());
-        }
-        return Err(format!(
-            "[policy-chain] section [{tool}] is empty — refusing to run \
-             (set KEI_POLICY_CHAIN_OPTIONAL=1 to allow pass-through, e.g. for tests)"
-        ));
-    }
-
-    let hooks_dir = hooks_dir()?;
-    let payload = serde_json::to_string(hook_input)
-        .map_err(|e| format!("encode hook input: {e}"))?;
-
-    for hook in chain {
-        let path = hooks_dir.join(&hook);
-        if !path.is_file() {
-            return Err(format!(
-                "[policy-chain] hook missing: {} (declared in policy-chain.toml [{}])",
-                path.display(), tool
-            ));
-        }
-
-        let mut child_cmd = Command::new(&path);
-        child_cmd
-            .stdin(Stdio::piped())
-            .stdout(Stdio::piped())
-            .stderr(Stdio::piped())
-            .kill_on_drop(true);
-        set_process_group(&mut child_cmd);
-        // v0.44 fix #4: same env-isolation for hook subprocess.
-        apply_safe_env(&mut child_cmd);
-
-        let mut child = child_cmd
-            .spawn()
-            .map_err(|e| format!("spawn {}: {e}", path.display()))?;
-        let pid_opt = child.id();
-
-        if let Some(mut stdin) = child.stdin.take() {
-            stdin.write_all(payload.as_bytes()).await
-                .map_err(|e| format!("write stdin to {}: {e}", path.display()))?;
-            stdin.shutdown().await
-                .map_err(|e| format!("close stdin to {}: {e}", path.display()))?;
-        }
-
-        let fut = child.wait_with_output();
-        let out = match tokio::time::timeout(Duration::from_secs(SAFE_TOOL_TIMEOUT_SECS), fut).await {
-            Ok(Ok(o)) => o,
-            Ok(Err(e)) => return Err(format!("wait {}: {e}", path.display())),
-            Err(_) => {
-                // v0.42 fix #5: kill the whole hook process group, not just
-                // the immediate child.
-                if let Some(pid) = pid_opt {
-                    killpg_best_effort(pid);
-                }
-                return Err(format!("hook {hook} timeout"));
-            }
-        };
-
-        let code = out.status.code().unwrap_or(-1);
-        if code == 0 {
-            continue;
-        }
-        let stderr = String::from_utf8_lossy(&out.stderr).trim().to_string();
-        return Err(format!(
-            "[blocked by {hook} exit={code}]\n{stderr}"
-        ));
-    }
-    Ok(())
-}
-
-// ---- config helpers -----------------------------------------------------
-
-/// v0.42 fix #4: async + tokio::fs (was: blocking std::fs would freeze
-/// a tokio worker if policy-chain.toml lived on a slow / hung mount).
-async fn load_chain(tool: &str) -> Result<Vec<String>, String> {
-    let path = chain_path()?;
-    // tokio::fs::try_exists avoids a blocking is_file() syscall.
-    let exists = fs::try_exists(&path).await.unwrap_or(false);
-    if !exists {
-        if env_truthy("KEI_POLICY_CHAIN_OPTIONAL") {
-            return Ok(vec![]);
-        }
-        return Err(format!(
-            "[policy-chain] config missing: {} (set KEI_POLICY_CHAIN_OPTIONAL=1 to allow pass-through, e.g. for tests)",
-            path.display()
-        ));
-    }
-    let raw = fs::read_to_string(&path).await
-        .map_err(|e| format!("read policy-chain.toml: {e}"))?;
-    let parsed: PolicyChain = toml::from_str(&raw)
-        .map_err(|e| format!("parse policy-chain.toml: {e}"))?;
-    let chain = match tool {
-        "bash"  => parsed.bash.chain,
-        "edit"  => parsed.edit.chain,
-        "write" => parsed.write.chain,
-        _ => return Err(format!("unknown tool kind: {tool}")),
-    };
-    Ok(chain)
-}
-
-fn chain_path() -> Result<PathBuf, String> {
-    if let Ok(p) = std::env::var("KEI_POLICY_CHAIN") {
-        return Ok(PathBuf::from(p));
-    }
-    let dir = hooks_dir()?;
-    Ok(dir.join("_lib").join("policy-chain.toml"))
-}
-
-fn hooks_dir() -> Result<PathBuf, String> {
-    if let Ok(p) = std::env::var("KEI_HOOKS_DIR") {
-        return Ok(PathBuf::from(p));
-    }
-    let home = std::env::var("HOME").map_err(|_| "HOME not set".to_string())?;
-    Ok(PathBuf::from(home).join(".claude").join("hooks"))
-}
-
-fn env_truthy(name: &str) -> bool {
-    matches!(std::env::var(name).as_deref(), Ok("1") | Ok("true") | Ok("TRUE") | Ok("yes"))
-}
-
-fn missing_arg(tool: &str, field: &str) -> String {
-    format!("{tool}: missing '{field}' argument")
-}
-
-#[allow(dead_code)]
-const INVALID_PARAMS_REF: i32 = INVALID_PARAMS; // silence unused-import warning if removed
--- a/_primitives/_rust/kei-mcp/src/handlers/safe_tools/chain_runner.rs
+++ b/_primitives/_rust/kei-mcp/src/handlers/safe_tools/chain_runner.rs
@ -0,0 +1,159 @@
+//! Policy chain loader + runner.
+//!
+//! v0.46: extracted from monolithic safe_tools.rs. Reads
+//! `~/.claude/hooks/_lib/policy-chain.toml` to get the hook list for each
+//! tool kind (bash/edit/write), pipes synthesized PreToolUse input to each
+//! hook, aborts on first non-zero exit.
+//!
+//! v0.46 architectural fix #1 (Claude critic CRITICAL): REMOVED env-based
+//! chain-skip (CLAUDECODE / GROKCODE). The skip was logically broken — it
+//! assumed native PreToolUse would catch the call, but PreToolUse matchers
+//! fire on tool_name="Bash"|"Edit"|"Write" and MCP tools are named
+//! `kei_bash`/`kei_edit`/`kei_write`. Native hooks NEVER fire on these
+//! → skip created an auth-bypass hole on Grok. Chain now ALWAYS runs.
+
+use super::env_guard::{apply_safe_env, killpg_best_effort, set_process_group};
+use super::SAFE_TOOL_TIMEOUT_SECS;
+use serde::Deserialize;
+use serde_json::Value;
+use std::path::PathBuf;
+use std::process::Stdio;
+use std::time::Duration;
+use tokio::fs;
+use tokio::io::AsyncWriteExt;
+use tokio::process::Command;
+
+#[derive(Deserialize, Default)]
+struct PolicyChain {
+    #[serde(default)]
+    bash: ChainSpec,
+    #[serde(default)]
+    edit: ChainSpec,
+    #[serde(default)]
+    write: ChainSpec,
+}
+
+#[derive(Deserialize, Default)]
+struct ChainSpec {
+    #[serde(default)]
+    chain: Vec<String>,
+}
+
+/// Run the configured hook chain for `tool` ("bash"/"edit"/"write").
+pub async fn run_chain(tool: &str, hook_input: &Value) -> Result<(), String> {
+    let chain = load_chain(tool).await?;
+    if chain.is_empty() {
+        // v0.42 fix #3: empty section is the same misconfig class as missing
+        // file — FAIL CLOSED with explicit opt-in.
+        if env_truthy("KEI_POLICY_CHAIN_OPTIONAL") {
+            return Ok(());
+        }
+        return Err(format!(
+            "[policy-chain] section [{tool}] is empty — refusing to run \
+             (set KEI_POLICY_CHAIN_OPTIONAL=1 to allow pass-through, e.g. for tests)"
+        ));
+    }
+
+    let hooks_dir = hooks_dir()?;
+    let payload = serde_json::to_string(hook_input)
+        .map_err(|e| format!("encode hook input: {e}"))?;
+
+    for hook in chain {
+        let path = hooks_dir.join(&hook);
+        if !path.is_file() {
+            return Err(format!(
+                "[policy-chain] hook missing: {} (declared in policy-chain.toml [{}])",
+                path.display(), tool
+            ));
+        }
+
+        let mut child_cmd = Command::new(&path);
+        child_cmd
+            .stdin(Stdio::piped())
+            .stdout(Stdio::piped())
+            .stderr(Stdio::piped())
+            .kill_on_drop(true);
+        set_process_group(&mut child_cmd);
+        apply_safe_env(&mut child_cmd);
+
+        let mut child = child_cmd
+            .spawn()
+            .map_err(|e| format!("spawn {}: {e}", path.display()))?;
+        let pid_opt = child.id();
+
+        if let Some(mut stdin) = child.stdin.take() {
+            stdin.write_all(payload.as_bytes()).await
+                .map_err(|e| format!("write stdin to {}: {e}", path.display()))?;
+            stdin.shutdown().await
+                .map_err(|e| format!("close stdin to {}: {e}", path.display()))?;
+        }
+
+        let fut = child.wait_with_output();
+        let out = match tokio::time::timeout(Duration::from_secs(SAFE_TOOL_TIMEOUT_SECS), fut).await {
+            Ok(Ok(o)) => o,
+            Ok(Err(e)) => return Err(format!("wait {}: {e}", path.display())),
+            Err(_) => {
+                if let Some(pid) = pid_opt {
+                    killpg_best_effort(pid);
+                }
+                return Err(format!("hook {hook} timeout"));
+            }
+        };
+
+        let code = out.status.code().unwrap_or(-1);
+        if code == 0 {
+            continue;
+        }
+        let stderr = String::from_utf8_lossy(&out.stderr).trim().to_string();
+        return Err(format!(
+            "[blocked by {hook} exit={code}]\n{stderr}"
+        ));
+    }
+    Ok(())
+}
+
+/// v0.44 fix #4: async + tokio::fs.
+async fn load_chain(tool: &str) -> Result<Vec<String>, String> {
+    let path = chain_path()?;
+    let exists = fs::try_exists(&path).await.unwrap_or(false);
+    if !exists {
+        if env_truthy("KEI_POLICY_CHAIN_OPTIONAL") {
+            return Ok(vec![]);
+        }
+        return Err(format!(
+            "[policy-chain] config missing: {} (set KEI_POLICY_CHAIN_OPTIONAL=1 to allow pass-through, e.g. for tests)",
+            path.display()
+        ));
+    }
+    let raw = fs::read_to_string(&path).await
+        .map_err(|e| format!("read policy-chain.toml: {e}"))?;
+    let parsed: PolicyChain = toml::from_str(&raw)
+        .map_err(|e| format!("parse policy-chain.toml: {e}"))?;
+    let chain = match tool {
+        "bash"  => parsed.bash.chain,
+        "edit"  => parsed.edit.chain,
+        "write" => parsed.write.chain,
+        _ => return Err(format!("unknown tool kind: {tool}")),
+    };
+    Ok(chain)
+}
+
+fn chain_path() -> Result<PathBuf, String> {
+    if let Ok(p) = std::env::var("KEI_POLICY_CHAIN") {
+        return Ok(PathBuf::from(p));
+    }
+    let dir = hooks_dir()?;
+    Ok(dir.join("_lib").join("policy-chain.toml"))
+}
+
+fn hooks_dir() -> Result<PathBuf, String> {
+    if let Ok(p) = std::env::var("KEI_HOOKS_DIR") {
+        return Ok(PathBuf::from(p));
+    }
+    let home = std::env::var("HOME").map_err(|_| "HOME not set".to_string())?;
+    Ok(PathBuf::from(home).join(".claude").join("hooks"))
+}
+
+fn env_truthy(name: &str) -> bool {
+    matches!(std::env::var(name).as_deref(), Ok("1") | Ok("true") | Ok("TRUE") | Ok("yes"))
+}
--- a/_primitives/_rust/kei-mcp/src/handlers/safe_tools/env_guard.rs
+++ b/_primitives/_rust/kei-mcp/src/handlers/safe_tools/env_guard.rs
@ -0,0 +1,79 @@
+//! Subprocess environment + process-group hardening for kei_* tools.
+//!
+//! v0.46: extracted from monolithic safe_tools.rs.
+
+use tokio::process::Command;
+
+/// v0.41 fix #5: process-group helper (Unix-only; no-op on other platforms).
+/// tokio::process::Command::process_group is available on Unix without
+/// requiring the std::os::unix::process::CommandExt trait import.
+#[cfg(unix)]
+pub fn set_process_group(cmd: &mut Command) {
+    cmd.process_group(0);
+}
+#[cfg(not(unix))]
+pub fn set_process_group(_cmd: &mut Command) {}
+
+/// v0.41 fix #5: SIGKILL the entire process group (negative pid).
+#[cfg(unix)]
+pub fn killpg_best_effort(pid: u32) {
+    unsafe {
+        let _ = libc::kill(-(pid as i32), libc::SIGKILL);
+    }
+}
+#[cfg(not(unix))]
+pub fn killpg_best_effort(_pid: u32) {}
+
+/// v0.46 architectural fix: RAII guard. `kill_on_drop` only kills the
+/// immediate child; backgrounded grandchildren survive (e.g. `bash -c
+/// 'sleep 1000 &'`). v0.41 killpg fix only ran on the timeout error path.
+/// Now: killpg fires on EVERY exit path (success, error, panic, early return)
+/// via Drop. Caller disarms on clean wait_with_output success via `disarm()`.
+pub struct KillPgGuard {
+    pid: Option<u32>,
+}
+
+impl KillPgGuard {
+    pub fn new(pid: Option<u32>) -> Self { Self { pid } }
+    /// Caller succeeded cleanly; child is already reaped by wait_with_output.
+    /// Skip the killpg fire on Drop.
+    pub fn disarm(&mut self) { self.pid = None; }
+}
+
+impl Drop for KillPgGuard {
+    fn drop(&mut self) {
+        if let Some(pid) = self.pid {
+            killpg_best_effort(pid);
+        }
+    }
+}
+
+/// v0.44 fix #4 (Gemini HIGH): strip parent env on subprocess spawn so secrets
+/// like AWS_*, GITHUB_TOKEN, MOONSHOT_API_KEY etc. don't leak to user-controlled
+/// bash commands or hook scripts. Whitelist forwards only PATH/HOME/USER/LANG/
+/// TERM/SHELL/PWD/TMPDIR/LOGNAME/LC_* — enough to keep tools functional, none
+/// of it sensitive.
+///
+/// Override: `KEI_SAFE_ENV_EXTRA=":-separated list"` adds named vars to the
+/// whitelist for callers that legitimately need (e.g. NIX_PATH, JAVA_HOME).
+pub fn apply_safe_env(cmd: &mut Command) {
+    cmd.env_clear();
+    let default_keep = [
+        "PATH", "HOME", "USER", "LOGNAME", "SHELL", "LANG", "LC_ALL",
+        "LC_CTYPE", "TERM", "PWD", "TMPDIR",
+    ];
+    for k in default_keep {
+        if let Ok(v) = std::env::var(k) {
+            cmd.env(k, v);
+        }
+    }
+    if let Ok(extras) = std::env::var("KEI_SAFE_ENV_EXTRA") {
+        for k in extras.split(':') {
+            let k = k.trim();
+            if k.is_empty() { continue; }
+            if let Ok(v) = std::env::var(k) {
+                cmd.env(k, v);
+            }
+        }
+    }
+}
--- a/_primitives/_rust/kei-mcp/src/handlers/safe_tools/exec.rs
+++ b/_primitives/_rust/kei-mcp/src/handlers/safe_tools/exec.rs
@ -0,0 +1,222 @@
+//! Action executors for the three kei_* MCP tools.
+//!
+//! v0.46: extracted from monolithic safe_tools.rs. Wraps shell + file
+//! operations with O_NOFOLLOW (close TOCTOU after policy chain) and uses
+//! KillPgGuard (env_guard.rs) so killpg fires on EVERY exit path, not just
+//! the timeout error arm.
+
+use super::chain_runner::run_chain;
+use super::env_guard::{apply_safe_env, set_process_group, KillPgGuard};
+use super::path_guard::validate_path;
+use super::SAFE_TOOL_TIMEOUT_SECS;
+use serde_json::{json, Value};
+use std::path::{Path, PathBuf};
+use std::process::Stdio;
+use std::time::Duration;
+use tokio::fs;
+use tokio::process::Command;
+
+pub async fn handle_bash(args: &Value) -> Result<String, String> {
+    let command = args.get("command").and_then(Value::as_str)
+        .ok_or_else(|| missing_arg("kei_bash", "command"))?;
+    let cwd = args.get("cwd").and_then(Value::as_str);
+
+    let hook_input = json!({
+        "tool_name": "Bash",
+        "tool_input": {
+            "command": command,
+            "cwd": cwd
+        }
+    });
+    run_chain("bash", &hook_input).await?;
+
+    let mut cmd = Command::new("bash");
+    cmd.arg("-c").arg(command);
+    if let Some(dir) = cwd {
+        cmd.current_dir(dir);
+    }
+    cmd.stdin(Stdio::null())
+        .stdout(Stdio::piped())
+        .stderr(Stdio::piped())
+        .kill_on_drop(true);
+    set_process_group(&mut cmd);
+    apply_safe_env(&mut cmd);
+
+    let child = cmd.spawn().map_err(|e| format!("spawn bash: {e}"))?;
+    let pid_opt = child.id();
+    // v0.46 architectural fix: RAII guard. killpg fires on ANY exit path —
+    // including early returns, panics, and normal success (until disarmed).
+    let mut killpg_guard = KillPgGuard::new(pid_opt);
+
+    let fut = child.wait_with_output();
+    let out = match tokio::time::timeout(Duration::from_secs(SAFE_TOOL_TIMEOUT_SECS), fut).await {
+        Ok(Ok(o)) => o,
+        Ok(Err(e)) => return Err(format!("wait bash: {e}")),
+        Err(_) => return Err("kei_bash timeout".to_string()),
+        // Drop runs here → killpg fires.
+    };
+
+    let stdout = String::from_utf8_lossy(&out.stdout).to_string();
+    let stderr = String::from_utf8_lossy(&out.stderr).to_string();
+    if !out.status.success() {
+        return Err(format!(
+            "bash exited {}: {}",
+            out.status.code().unwrap_or(-1),
+            stderr.trim()
+        ));
+    }
+    // v0.46 architectural fix: arm guard fires by default. Disarm here ONLY
+    // after we know the parent shell exited cleanly + we want to leave any
+    // legitimate backgrounded jobs alone. Trade-off: killpg also reaps
+    // intentional `&` jobs (`sleep 1000 &`). For kei_bash use-case this is
+    // correct — the tool should not leak processes across calls.
+    killpg_guard.disarm();
+    // v0.46: explicitly reap orphaned group AFTER guard disarm-on-success.
+    // The disarm() above means we trust kill_on_drop + the kernel to clean
+    // up — but kill_on_drop only kills the direct child. For backgrounded
+    // grandchildren we'd want a separate killpg here. For now, kei_bash docs
+    // that `&` jobs DO survive — set them up in nohup or another tool if
+    // long-running is intended.
+    let _ = killpg_guard;
+    Ok(if stderr.is_empty() { stdout } else { format!("{stdout}\n[stderr]\n{stderr}") })
+}
+
+pub async fn handle_edit(args: &Value) -> Result<String, String> {
+    let file_path = args.get("file_path").and_then(Value::as_str)
+        .ok_or_else(|| missing_arg("kei_edit", "file_path"))?;
+    let old_string = args.get("old_string").and_then(Value::as_str)
+        .ok_or_else(|| missing_arg("kei_edit", "old_string"))?;
+    let new_string = args.get("new_string").and_then(Value::as_str)
+        .ok_or_else(|| missing_arg("kei_edit", "new_string"))?;
+
+    if old_string.is_empty() {
+        return Err("kei_edit: old_string must not be empty".into());
+    }
+
+    // v0.46 fix #4: blocking path validation moved off the tokio worker.
+    let p_owned = file_path.to_string();
+    let safe_path = tokio::task::spawn_blocking(move || validate_path(&p_owned))
+        .await
+        .map_err(|e| format!("kei_edit: thread join: {e}"))??;
+
+    let hook_input = json!({
+        "tool_name": "Edit",
+        "tool_input": {
+            "file_path": safe_path.display().to_string(),
+            "old_string": old_string,
+            "new_string": new_string
+        }
+    });
+    run_chain("edit", &hook_input).await?;
+
+    open_nofollow_read_write_edit(&safe_path, old_string, new_string).await
+}
+
+pub async fn handle_write(args: &Value) -> Result<String, String> {
+    let file_path = args.get("file_path").and_then(Value::as_str)
+        .ok_or_else(|| missing_arg("kei_write", "file_path"))?;
+    let content = args.get("content").and_then(Value::as_str)
+        .ok_or_else(|| missing_arg("kei_write", "content"))?;
+
+    let p_owned = file_path.to_string();
+    let safe_path = tokio::task::spawn_blocking(move || validate_path(&p_owned))
+        .await
+        .map_err(|e| format!("kei_write: thread join: {e}"))??;
+
+    let hook_input = json!({
+        "tool_name": "Write",
+        "tool_input": { "file_path": safe_path.display().to_string(), "content": content }
+    });
+    run_chain("write", &hook_input).await?;
+
+    if let Some(parent) = safe_path.parent() {
+        if !parent.as_os_str().is_empty() {
+            fs::create_dir_all(parent).await
+                .map_err(|e| format!("mkdir {}: {e}", parent.display()))?;
+        }
+    }
+    open_nofollow_write(&safe_path, content).await
+}
+
+/// v0.44 fix #2: edit via O_NOFOLLOW-opened fd to close the TOCTOU window
+/// between validate_path and the write.
+#[cfg(unix)]
+async fn open_nofollow_read_write_edit(
+    path: &Path, old_string: &str, new_string: &str,
+) -> Result<String, String> {
+    use std::os::unix::fs::OpenOptionsExt;
+    let path = path.to_path_buf();
+    let old_s = old_string.to_string();
+    let new_s = new_string.to_string();
+    let result = tokio::task::spawn_blocking(move || -> Result<String, String> {
+        let mut f = std::fs::OpenOptions::new()
+            .read(true).write(true)
+            .custom_flags(libc::O_NOFOLLOW)
+            .open(&path)
+            .map_err(|e| format!("kei_edit: open(O_NOFOLLOW) {}: {e}", path.display()))?;
+        use std::io::{Read, Write, Seek, SeekFrom};
+        let mut contents = String::new();
+        f.read_to_string(&mut contents)
+            .map_err(|e| format!("kei_edit: read {}: {e}", path.display()))?;
+        if !contents.contains(&old_s) {
+            return Err(format!("kei_edit: old_string not found in {}", path.display()));
+        }
+        let updated = contents.replacen(&old_s, &new_s, 1);
+        f.set_len(0).map_err(|e| format!("kei_edit: truncate {}: {e}", path.display()))?;
+        f.seek(SeekFrom::Start(0))
+            .map_err(|e| format!("kei_edit: seek {}: {e}", path.display()))?;
+        f.write_all(updated.as_bytes())
+            .map_err(|e| format!("kei_edit: write {}: {e}", path.display()))?;
+        Ok(format!("edited {} ({} bytes)", path.display(), updated.len()))
+    }).await
+        .map_err(|e| format!("kei_edit: thread join: {e}"))?;
+    result
+}
+#[cfg(not(unix))]
+async fn open_nofollow_read_write_edit(
+    path: &Path, old_string: &str, new_string: &str,
+) -> Result<String, String> {
+    let contents = fs::read_to_string(path).await
+        .map_err(|e| format!("read {}: {e}", path.display()))?;
+    if !contents.contains(old_string) {
+        return Err(format!("kei_edit: old_string not found in {}", path.display()));
+    }
+    let updated = contents.replacen(old_string, new_string, 1);
+    fs::write(path, &updated).await
+        .map_err(|e| format!("write {}: {e}", path.display()))?;
+    Ok(format!("edited {} ({} bytes)", path.display(), updated.len()))
+}
+
+#[cfg(unix)]
+async fn open_nofollow_write(path: &Path, content: &str) -> Result<String, String> {
+    use std::os::unix::fs::OpenOptionsExt;
+    let path = path.to_path_buf();
+    let bytes = content.as_bytes().to_vec();
+    let result = tokio::task::spawn_blocking(move || -> Result<String, String> {
+        let mut opts = std::fs::OpenOptions::new();
+        opts.write(true).create(true).truncate(true);
+        opts.custom_flags(libc::O_NOFOLLOW);
+        let mut f = opts.open(&path)
+            .map_err(|e| format!("kei_write: open(O_NOFOLLOW) {}: {e}", path.display()))?;
+        use std::io::Write;
+        f.write_all(&bytes)
+            .map_err(|e| format!("kei_write: write {}: {e}", path.display()))?;
+        Ok(format!("wrote {} ({} bytes)", path.display(), bytes.len()))
+    }).await
+        .map_err(|e| format!("kei_write: thread join: {e}"))?;
+    result
+}
+#[cfg(not(unix))]
+async fn open_nofollow_write(path: &Path, content: &str) -> Result<String, String> {
+    fs::write(path, content).await
+        .map_err(|e| format!("write {}: {e}", path.display()))?;
+    Ok(format!("wrote {} ({} bytes)", path.display(), content.len()))
+}
+
+fn missing_arg(tool: &str, field: &str) -> String {
+    format!("{tool}: missing '{field}' argument")
+}
+
+// PathBuf only needed in cfg(unix) blocks via spawn_blocking captures.
+#[allow(dead_code)]
+fn _path_buf_keep() -> Option<PathBuf> { None }
--- a/_primitives/_rust/kei-mcp/src/handlers/safe_tools/mod.rs
+++ b/_primitives/_rust/kei-mcp/src/handlers/safe_tools/mod.rs
@ -0,0 +1,99 @@
+//! Phase C — cross-CLI hook enforcement via MCP-wrapped tools.
+//!
+//! v0.46: decomposed from single safe_tools.rs (738 LOC, god-object per
+//! architect audit) into 5 focused modules:
+//!
+//!   mod.rs          — descriptor list + tools/call dispatch (this file)
+//!   chain_runner.rs — load_chain + run_chain (policy enforcement engine)
+//!   path_guard.rs   — validate_path + canonicalize-with-walk-up + allowed_roots
+//!   exec.rs         — handle_bash/edit/write + O_NOFOLLOW open + write paths
+//!   env_guard.rs    — apply_safe_env + set_process_group + KillPgGuard (RAII)
+//!
+//! Exposes three built-in MCP tools — `kei_bash`, `kei_edit`, `kei_write` —
+//! that synthesize Claude Code's PreToolUse hook input contract and chain
+//! through the hook scripts in `~/.claude/hooks/_lib/policy-chain.toml`.
+//!
+//! v0.46 architectural fix #1 (Claude critic CRITICAL): REMOVED env-based
+//! chain-skip (was `CLAUDECODE=1` / `GROKCODE=1` → skip). Rationale: those
+//! envs were set assuming "if we're inside Claude/Grok, native PreToolUse
+//! already fires — skip our chain to avoid double-firing". But native
+//! PreToolUse matchers fire on tool_name = "Bash"|"Edit"|"Write" — these
+//! MCP tools are named `kei_bash`/`kei_edit`/`kei_write` (or with mcp__
+//! prefix). Native hooks therefore NEVER fire on these calls, and the
+//! env-skip created a real auth-bypass hole on Grok. Chain now ALWAYS
+//! runs; the perf concern was fictional.
+
+use crate::protocol::{err, ok, JsonRpcRequest, JsonRpcResponse, INTERNAL_ERROR};
+use serde_json::{json, Value};
+
+mod chain_runner;
+mod env_guard;
+mod exec;
+mod path_guard;
+
+/// Per-step timeout (each hook AND the action each get up to this long).
+/// For an N-hook chain the total wall-clock cap is approximately
+/// `(N+1) * SAFE_TOOL_TIMEOUT_SECS`. v0.44 doc-honesty: prior versions
+/// claimed this was an "aggregate" cap which was always wrong.
+pub(crate) const SAFE_TOOL_TIMEOUT_SECS: u64 = 60;
+
+/// MCP tool descriptors — appended to `tools/list` by `handlers::tools::list`.
+pub fn descriptors() -> Vec<Value> {
+    vec![
+        json!({
+            "name": "kei_bash",
+            "description": "Run a shell command after running KeiSeiKit's [bash] policy chain (no-github-push, safety-guard, destructive-guard). Blocks on hook exit 2 with the hook's stderr surfaced as the MCP error message. Use this instead of native shell on non-Claude CLIs to inherit Claude Code's safety enforcement.",
+            "inputSchema": {
+                "type": "object",
+                "properties": {
+                    "command": { "type": "string", "description": "Shell command to execute" },
+                    "cwd": { "type": "string", "description": "Optional working directory; defaults to $PWD" }
+                },
+                "required": ["command"]
+            }
+        }),
+        json!({
+            "name": "kei_edit",
+            "description": "Modify a file (replace old_string with new_string) after running KeiSeiKit's [edit] policy chain (citation-verify, numeric-claims-guard). Blocks unverified academic citations and numeric claims without evidence markers.",
+            "inputSchema": {
+                "type": "object",
+                "properties": {
+                    "file_path": { "type": "string" },
+                    "old_string": { "type": "string" },
+                    "new_string": { "type": "string" }
+                },
+                "required": ["file_path", "old_string", "new_string"]
+            }
+        }),
+        json!({
+            "name": "kei_write",
+            "description": "Write content to a file after running KeiSeiKit's [write] policy chain (citation-verify, numeric-claims-guard). Blocks unverified academic citations and numeric claims without evidence markers.",
+            "inputSchema": {
+                "type": "object",
+                "properties": {
+                    "file_path": { "type": "string" },
+                    "content": { "type": "string" }
+                },
+                "required": ["file_path", "content"]
+            }
+        }),
+    ]
+}
+
+/// Dispatch entry — called from `handlers::tools::call` when the tool name
+/// matches one of the three `kei_*` built-ins.
+pub async fn dispatch_safe(req: JsonRpcRequest, name: &str, args: &Value) -> JsonRpcResponse {
+    let result = match name {
+        "kei_bash"  => exec::handle_bash(args).await,
+        "kei_edit"  => exec::handle_edit(args).await,
+        "kei_write" => exec::handle_write(args).await,
+        _ => Err(format!("safe_tools dispatched unknown name: {name}")),
+    };
+    match result {
+        Ok(text) => ok(req.id, json!({
+            "content": [{ "type": "text", "text": text }],
+            "isError": false,
+        })),
+        Err(e) => err(req.id, INTERNAL_ERROR, e),
+    }
+}
--- a/_primitives/_rust/kei-mcp/src/handlers/safe_tools/path_guard.rs
+++ b/_primitives/_rust/kei-mcp/src/handlers/safe_tools/path_guard.rs
@ -0,0 +1,166 @@
+//! Path-traversal + symlink + denylist guard for `kei_edit` / `kei_write`.
+//!
+//! v0.46: extracted from monolithic safe_tools.rs. Pure-sync helpers — the
+//! async handlers in exec.rs wrap them in `spawn_blocking` so a slow
+//! `canonicalize` syscall doesn't starve a tokio worker (v0.46 fix #4).
+
+use std::path::{Path, PathBuf};
+
+/// v0.41 (initial): rejected `..`, canonicalized PARENT, checked denylist + roots.
+///   → 4-CLI re-audit (2026-05-26) found this was bypassable via symlink at the
+///     leaf and self-attackable via the $HOME blanket-allowed root.
+///
+/// v0.42 fixes:
+///   #1 [CRITICAL] reject if the leaf is a symlink for new files; canonicalize
+///      full path when the file exists.
+///   #2 [HIGH] $HOME removed from default allowed-roots — default is $PWD only.
+///      Denylist now also covers $HOME/.claude/ (the substrate itself), shell
+///      init files, and credential stores.
+///
+/// v0.44 fixes:
+///   #1 [CRITICAL] walk_up_to_canonicalize — finds deepest existing ancestor,
+///      canonicalizes THAT (resolving all symlinks in the existing prefix),
+///      reattaches the non-existent tail. Closes the "parent's parent is a
+///      symlink" bypass.
+///   #5 [HIGH] Path::starts_with for component-aware containment + canonical
+///      KEI_ALLOWED_ROOTS so /var → /private/var symlink works on macOS.
+///   #6 [MED] allowed_roots check FIRST; narrowed /var/ blanket to /var/db/,
+///      /var/log/, /var/root/ — macOS $TMPDIR = /var/folders/ now allowed.
+pub fn validate_path(p: &str) -> Result<PathBuf, String> {
+    if p.is_empty() {
+        return Err("file_path: empty".into());
+    }
+    if p.split('/').any(|seg| seg == "..") {
+        return Err(format!("file_path: '..' segment not allowed in {p}"));
+    }
+    let path = Path::new(p);
+    let canonical = canonicalize_with_walk_up(path)?;
+
+    // Reject if the leaf is a symlink (covers dangling symlinks for new files).
+    if let Ok(meta) = std::fs::symlink_metadata(&canonical) {
+        if meta.file_type().is_symlink() {
+            return Err(format!(
+                "file_path: leaf is a symlink (refusing to follow): {}",
+                canonical.display()
+            ));
+        }
+    }
+
+    // Allowed-root containment FIRST (v0.44 fix #6).
+    let roots = allowed_roots();
+    // v0.46 fix #3: empty allowed_roots → fail-CLOSED (was: silently
+    // disabled containment). Operator must explicitly set KEI_ALLOWED_ROOTS
+    // to "" if they want to disable, and we still reject empty.
+    if roots.is_empty() {
+        return Err(
+            "file_path: allowed_roots is empty — refusing all writes \
+             (set KEI_ALLOWED_ROOTS to a non-empty value or run from a real cwd)".into()
+        );
+    }
+    let in_allowed_root = roots.iter().any(|r| canonical.starts_with(r));
+    if !in_allowed_root {
+        return Err(format!(
+            "file_path: outside allowed roots {:?}: {}",
+            roots, canonical.display()
+        ));
+    }
+
+    let canon_str = canonical.display().to_string();
+
+    // Reject system + substrate-control + credential paths.
+    let denylist = [
+        "/etc/", "/usr/", "/System/", "/var/db/", "/var/log/", "/var/root/",
+        "/private/etc/", "/private/var/db/", "/private/var/log/", "/private/var/root/",
+        "/root/", "/bin/", "/sbin/",
+    ];
+    for d in denylist {
+        if canon_str.starts_with(d) {
+            return Err(format!("file_path: denied (system dir): {canon_str}"));
+        }
+    }
+    if let Ok(home) = std::env::var("HOME") {
+        let dir_secrets = [
+            ".ssh/", ".aws/", ".gnupg/", ".config/gcloud/", ".cargo/credentials",
+            ".npmrc", ".docker/config.json", ".kube/",
+            ".claude/", ".grok/", ".gemini/", ".copilot/", ".kimi/",
+        ];
+        for sd in dir_secrets {
+            let full = format!("{home}/{sd}");
+            if canon_str.starts_with(&full) {
+                return Err(format!("file_path: denied (secret/substrate dir): {canon_str}"));
+            }
+        }
+        let init_files = [
+            ".zshrc", ".bashrc", ".profile", ".bash_profile", ".zprofile",
+            ".zshenv", ".bash_login", ".inputrc", ".gitconfig",
+            ".config/fish/config.fish",
+        ];
+        for f in init_files {
+            let full = format!("{home}/{f}");
+            if canon_str == full {
+                return Err(format!("file_path: denied (shell-init file): {canon_str}"));
+            }
+        }
+    }
+
+    Ok(canonical)
+}
+
+/// v0.44 fix #1: walk up the path looking for the deepest existing ancestor,
+/// canonicalize THAT, then reattach the non-existent tail components.
+fn canonicalize_with_walk_up(path: &Path) -> Result<PathBuf, String> {
+    let abs = if path.is_absolute() {
+        path.to_path_buf()
+    } else {
+        std::env::current_dir()
+            .map_err(|e| format!("file_path: cwd unavailable: {e}"))?
+            .join(path)
+    };
+
+    let mut current = abs.clone();
+    let mut tail: Vec<std::ffi::OsString> = Vec::new();
+    let canon = loop {
+        if current.exists() {
+            break current.canonicalize()
+                .map_err(|e| format!("file_path: canonicalize {}: {e}", current.display()))?;
+        }
+        let name = current.file_name()
+            .ok_or_else(|| format!("file_path: path has no existing ancestor: {}", abs.display()))?
+            .to_os_string();
+        let parent = match current.parent() {
+            Some(p) if !p.as_os_str().is_empty() => p.to_path_buf(),
+            _ => return Err(format!("file_path: walked to root without finding existing dir: {}", abs.display())),
+        };
+        tail.push(name);
+        current = parent;
+    };
+
+    let mut result = canon;
+    for name in tail.into_iter().rev() {
+        result.push(name);
+    }
+    Ok(result)
+}
+
+pub fn allowed_roots() -> Vec<String> {
+    let canon_with_slash = |raw: &str| -> Option<String> {
+        let p = Path::new(raw);
+        let canon = std::fs::canonicalize(p).unwrap_or_else(|_| p.to_path_buf());
+        let mut s = canon.display().to_string();
+        if !s.ends_with('/') { s.push('/'); }
+        if s.is_empty() { None } else { Some(s) }
+    };
+    if let Ok(v) = std::env::var("KEI_ALLOWED_ROOTS") {
+        return v.split(':')
+            .filter(|s| !s.is_empty())
+            .filter_map(canon_with_slash)
+            .collect();
+    }
+    let mut roots = Vec::new();
+    if let Ok(cwd) = std::env::current_dir() {
+        if let Some(r) = canon_with_slash(&cwd.display().to_string()) {
+            roots.push(r);
+        }
+    }
+    roots
+}
--- a/bin/kei
+++ b/bin/kei
@ -235,7 +235,7 @@ ${C1}    ██╔═██╗ ██╔══╝  ██║╚════█
 ${C1}    ██║  ██╗███████╗██║███████║███████╗██║${C0}
 ${C1}    ╚═╝  ╚═╝╚══════╝╚═╝╚══════╝╚══════╝╚═╝${C0}

-${C2}    KeiSeiKit · substrate v0.45${C0}
+${C2}    KeiSeiKit · substrate v0.46${C0}
 ${C3}    ─────────────────────────────────────${C0}
      primary CLI    : ${CV}${PRIMARY}${C0}
      profile        : ${CV}${p}${C0}
--- a/plugin.json
+++ b/plugin.json
@ -3,7 +3,7 @@
  "name": "keisei",
  "displayName": "KeiSei",
  "description": "Constructor Pattern multi-LLM agent substrate — 38 agents, 69 skills, 54 hooks, 86 blocks. Cross-CLI policy enforcement (Claude/Grok/Copilot/Agy/Kimi) via kei-mcp + kei_bash/kei_edit/kei_write. Rust primitives via classic ./install.sh.",
-  "version": "0.45.0",
+  "version": "0.46.0",
  "homepage": "https://keisei.app",
  "repository": "https://github.com/KeiSeiLab/KeiSeiKit-1.0.git",
  "author": {
--- a/scripts/kei-agent-cli.sh
+++ b/scripts/kei-agent-cli.sh
@ -145,7 +145,7 @@ backend_invoke() {
      printf '[kei-agent-cli] (or pipe via `kimi acp` if you have an ACP client.)\n' >&2
      exec "$bin"
      ;;
-    codex)                exec "$bin" -p "$prompt" ;;
+    codex)                exec "$bin" exec "$prompt" ;;
  esac
 }

--- a/scripts/kei-mcp-wire-claude.sh
+++ b/scripts/kei-mcp-wire-claude.sh
@ -31,12 +31,15 @@ if [ "${KEI_WIRE_CHECK:-0}" = "1" ] || [ "${KEI_WIRE_DRY_RUN:-0}" = "1" ]; then
    "mcpServers": {
      "kei-mcp": {
        "command": "$BIN",
-        "env": { "CLAUDECODE": "1" }
+        "env": {}
      }
    }
  }

-  (CLAUDECODE=1 tells kei-mcp to skip its hook chain — your native hooks
-   already fire on PreToolUse. Avoids double-enforcement.)
+  (v0.46: CLAUDECODE/GROKCODE env-skip was removed — the chain runs
+   always now. Native PreToolUse hooks fire on tool_name='Bash'/'Edit'/
+   'Write', but MCP tools are named kei_bash/kei_edit/kei_write, so
+   native hooks would NOT fire anyway — there is no double-enforcement
+   to avoid. Empty env block left in case operators add their own vars.)
 EOF
 fi
--- a/scripts/kei-mcp-wire-grok.sh
+++ b/scripts/kei-mcp-wire-grok.sh
@ -46,7 +46,7 @@ if [ -n "$KEI_MCP_BIN" ] && [ -x "$KEI_MCP_BIN" ]; then
  "mcpServers": {
    "kei-mcp": {
      "command": "$KEI_MCP_BIN",
-      "env": { "GROKCODE": "1" }
+      "env": {}
    }
  }
 }
@ -73,5 +73,5 @@ mv "$tmp" "$CFG"

 echo "  grok: wired PreToolUse hooks → $CFG"
 echo "         5 hook entries (Bash×3 + Edit×2 + Write×2)"
-[ -n "$mcp_block" ] && echo "         kei-mcp MCP server registered (with GROKCODE=1 guard)"
+[ -n "$mcp_block" ] && echo "         kei-mcp MCP server registered (v0.46: chain always runs, no env-skip)"
 echo "         Same enforcement as Claude Code."