| name | codex-auto |
| description | Use Codex CLI Full Auto for unattended, sandboxed prototyping and refactors. |
| allowed-tools | Read, Write, Edit, Bash, Glob, Grep, Task, TodoWrite |
| model | sonnet |
| x-version | 3.2.0 |
| x-category | platforms |
| x-vcl-compliance | v3.1.1 |
| x-cognitive-frames | HON, MOR, COM, CLS, EVD, ASP, SPC |
Purpose
Let Codex run autonomously in a sandbox to scaffold, refactor, and validate code without approvals.
Trigger Conditions
- Use this skill when: Need overnight builds, unattended refactors, or autonomous scaffolding in a sandbox.
- Reroute when: If work needs human-in-loop or policy-heavy guardrails, use codex-reasoning instead.
Guardrails (Inherited from Skill-Forge + Prompt-Architect)
- Structure-first: every platform skill keeps
SKILL.md,examples/, andtests/populated; createresources/andreferences/as needed. Log any missing artifact and fill a placeholder before proceeding. - Confidence ceilings are mandatory in outputs: inference/report 0.70, research 0.85, observation/definition 0.95. State as
Confidence: X.XX (ceiling: TYPE Y.YY). - English-only user-facing text; keep VCL markers internal. Do not leak internal notation.
- Adversarial validation is required before sign-off: boundary, failure, and COV checks with notes.
- MCP tagging for runs:
WHO=codex-auto-{session},WHY=skill-execution, namespaceskills/platforms/codex-auto/{project}.
Execution Framework
- Intent & Constraints — clarify task goal, inputs, success criteria, and risk limits; extract hard/soft/inferred constraints explicitly.
- Plan & Docs — outline steps, needed examples/tests, and data contracts; confirm platform-specific policies.
- Build & Optimize — apply platform playbook below; keep iterative checkpoints and diffs.
- Validate — run adversarial tests, measure KPIs, and record evidence with ceilings.
- Deliver & Hand off — summarize decisions, artifacts, and next actions; capture learnings for reuse.
Platform Playbook
- Workflow patterns:
- Kick off full-auto scaffolds with explicit acceptance tests
- Queue unattended refactors guarded by linters and smoke tests
- Collect execution logs and diffs for review on return
- Anti-patterns to avoid: Running against production directories, Allowing network access during full-auto runs, Skipping diff review before merge
- Example executions:
codex --full-auto Scaffold CRUD API with testscodex --full-auto Refactor to TS strict mode and fix type errors
Documentation & Artifacts
SKILL.md(this file) is canonical; keep quick-reference notes inREADME.mdif present.examples/should hold runnable or narrative examples;tests/should include validation steps or checklists.resources/stores helper scripts/templates;references/stores background links or research.- Update
metadata.jsonversion if behavior meaningfully changes.
Verification Checklist
- Trigger matched and reroute considered
- Examples/tests present or stubbed with TODOs
- Constraints captured and confidence ceiling stated
- Validation evidence captured (boundary, failure, COV)
- MCP tags applied for this run
Confidence: 0.70 (ceiling: inference 0.70) - Standardized platform skill rewrite aligned with skill-forge + prompt-architect guardrails.