Ecosystem Update - 2026-05-16
Highlights
- One safe harness Quick Win was implemented: pruned 134 nonexistent temp worktree trust entries from
~/.codex/config.toml;codex-runtime-doctornow reportserrors=0 warnings=0 - Current stable Codex is still
0.130.0;0.131.0-alpha.22is available only on the alpha channel, so no CLI upgrade was applied - Today’s strongest research signal is supply-chain and least-privilege pressure around agent skills, hooks, and tool authorization
Quick Wins (implemented today)
-
Stale temp trusted project prune Codex-mdRemove nonexistent
/private/.../Ttemp worktree entries from~/.codex/config.tomlwhile preserving real trusted project roots
New Tools, Skills & Patterns
-
Runtime doctor stale-project autofix Codex-mdOpenAI config reference, Claude toolkit config-health pattern - Add a bounded
codex-runtime-doctor --fix-stale-projectsor companion subcommand that previews and removes nonexistent temp project trust entries, with backups and TOML validation -
UserPromptSubmit secret-scan design hookOpenAI hooks guide - Codex supports
UserPromptSubmit, but prompt text is sensitive and global prompt telemetry is opt-in; design a no-log, block-only local scanner before wiring any hook -
PermissionRequest profile evaluation hookOpenAI hooks guide, OpenAI config reference - Useful for conservative/review profiles, but the current power-user default is
approval_policy = "never", so this needs profile-specific testing rather than global wiring -
MCP/skill registry audit intake mcp/skillrohitg00 toolkit ecosystem entries, awesome-claude-code - Sources such as TokRepo, Clarvia, and Not Human Search may improve discovery, but should feed an audit/intake report, not automatic MCP installation
-
SWE-Cycle eval adapterSWE-Cycle - Add a task-eval scenario that covers issue intake, environment setup, edit, verification, and closeout, matching the existing autonomy harness rather than only final patch success
Research Worth Reading
-
Do Coding Agents Understand Least-Privilege Authorization?- Directly relevant to permission profiles, subagent sandboxing, and default authority boundaries
-
Exploiting LLM Agent Supply Chains via Payload-less Skills- Reinforces strict outside-skill audit and argues against wholesale skill/plugin imports
-
WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections- Relevant to Browser/Computer Use sessions and any web-exposed MCP workflow
-
Is Grep All You Need? How Agent Harnesses Reshape Agentic Search- Supports the existing
rg-first posture and suggests benchmarking how tool output presentation affects retrieval quality -
GroupMemBench
-
Latency-Quality Routing for Functionally Equivalent Tools in LLM Agents- Candidate pattern for future MCP/tool-provider routing, likely via autoconfig rather than a new router
Considered, Not Adopting
Items reviewed and explicitly declined this cycle, with the reason. Curation discipline matters more than coverage.
-
Upgrade to
0.131.0-alpha.22— - rejected as an automatic Quick Win because npmlatestremains0.130.0; alpha release watch stays in the queue -
Enable
features.plugin_hooksblindly — - rejected because plugin-bundled hooks are opt-in and require trust review before execution -
Add Claude
PostCompacthook parity — - rejected for automatic implementation because Codex currently hasPreCompactlocally; no supported CodexPostCompactevent is present in the official hooks guide - Wholesale import from Claude toolkits or skill catalogs — - rejected because Codex-owned surfaces must not depend on Claude-owned layouts or unaudited outside skills
-
Enable
prevent_idle_sleep— - rejected as a Quick Win because it is experimental and can affect machine power behavior -
Prompt logging through
UserPromptSubmit— - rejected because global prompt telemetry is opt-in and the local policy forbids logging user prompts by default