Tutorial#ai-agents#claude-code#skills#codex#developer-tools

Codex Skills: Sam Altman's Framework (2026)

Codex Skills: Sam Altman says most Codex failures are context problems, not model problems. What Skills are and how to write your first one.

Jason ZhouJune 9, 20265 min read

Course outline · Claude Code Fundamentals (2.15)

Sam Altman posted something on June 6, 2026 that hit 194K views and 3,200+ likes:

"Whenever I don't use Codex for a task, I ask myself why and usually realize that there's some missing context, I needed to write a skill, or I just didn't think to use it. Rarely is it because the task is outside of the capabilities of the model. Overhang right now feels large."

The word "skill" is the key. Altman isn't talking about the AI getting smarter. He's talking about a specific workflow: when Codex fails, the fix is usually writing a skill - a structured knowledge package that gives the agent the context it needs to succeed.

This is the Codex Skills framework. Here's what it is and how to use it.

What Are Codex Skills?

A Codex Skill is a structured markdown file that tells an AI coding agent how to work with a specific tool, API, codebase, or workflow. It's the knowledge layer that sits between the model's general training and the specific context of your project.

Altman's point is that most of Codex's apparent limitations aren't model limitations - they're context limitations. The model knows how to code. It doesn't know your specific deployment process, your team's conventions, which API endpoints are deprecated, or how to authenticate with your internal services.

A skill encodes all of that.

The Agent Skills Standard

Codex Skills are part of the broader Agent Skills open standard. Originally developed by Anthropic and released as an open standard under Apache 2.0 in early 2026, it has been adopted across all major AI coding agents: Claude Code, Cursor, GitHub Copilot, Gemini CLI, OpenAI Codex, Windsurf, Aider, and 30+ others.

The standard is gaining rapid adoption. The awesome-agent-skills community library has cataloged over 1,200 skills compatible with 16+ different agents. GitHub trending regularly shows multiple Skills repositories:

mvanhorn/last30days-skill - 34,700+ stars, real-time research across Reddit, X, YouTube, and HackerNews
google/skills - 12,200+ stars, official Google Cloud infrastructure skills
VoltAgent/awesome-agent-skills - 24,600+ stars, curated directory with official skills from Anthropic, Google, Vercel, Stripe, Cloudflare, and more

This isn't a coincidence. Skills have become the primary mechanism for extending what AI agents can do without waiting for a better model.

The Three Reasons Codex Fails (Altman's Framework)

Altman's tweet identified three reasons an agent task fails:

1. Missing context

The agent doesn't know something it needs to know. Your API structure, your environment variables, your deployment target. This is fixed by documentation in the project or a context skill.

2. No skill for this

The task requires knowledge that isn't encoded anywhere the agent can access. How does your CI/CD pipeline work? What's the internal process for getting code reviewed? What are the gotchas with your database migration tool? This is fixed by writing a skill.

3. Didn't think to use it

You defaulted to manual work out of habit. This is the "overhang" Altman mentions - the gap between what Codex can actually do and what people are actually using it for. OpenAI estimates that even if models were frozen at current capability, the untapped economic value is enormous.

Of these three, #2 is the most actionable. Writing skills is the primary lever for expanding what your AI agent can reliably accomplish.

How Agent Skills Work: Progressive Disclosure

The Agent Skills standard uses a three-stage loading model that keeps context lean:

Discovery: At startup, agents load only the name and description of each available skill - just enough to know when it might be relevant.
Activation: When a task matches a skill's description, the agent reads the full SKILL.md instructions into context.
Execution: The agent follows the instructions, optionally executing bundled scripts or loading referenced files as needed.

This means you can have dozens of skills installed without bloating every session's context window. The agent only loads what it needs.

Claude Code 101 · Rebuilt June 2026

You've read the theory. The course is where you ship.

3 guided Labs - a live website, a full-stack app with Stripe payments, and a business automation with measured 10x token savings - plus the Template Vault: CLAUDE.md templates, 9 skills, 5 subagent definitions, a hooks pack, and the security audit prompt.

Start shipping with Claude Code

How to Write a Codex Skill

A Skill is a folder containing at minimum a SKILL.md file. The YAML frontmatter tells the agent when to use it; the markdown body tells it what to do:

code

---
name: deploy-to-production
description: "How to deploy this project to production. Use when the user asks to deploy, push to prod, or release a new version."
---

# Deploy to Production

## Prerequisites
- AWS credentials configured via `aws configure`
- Docker running locally
- Environment: `export ENV=production`

## Steps
1. Run tests: `npm test`
2. Build Docker image: `docker build -t app:latest .`
3. Push to ECR: `docker push [ECR_URL]/app:latest`
4. Deploy: `aws ecs update-service --service app-prod`

## Common errors
- "No space left": clear Docker cache with `docker system prune`
- "Credentials expired": run `aws sso login --profile prod`

The description field is critical. It's how the agent decides whether to load this skill. Include trigger phrases - the exact words someone would use when they need this workflow.

Once installed, your agent can deploy your application without you spelling out the steps every session.

Installing and Managing Skills

Skills work across every major agent host:

code

# Claude Code (marketplace - auto-updates)
/plugin marketplace add [skill-name]

# Codex, Cursor, Copilot, Gemini CLI, and 30+ others
npx skills add [skill-name]

# Install globally (available across all projects)
npx skills add [skill-name] -g

# Update
npx skills update [skill-name]

For project-specific skills you write yourself, place the folder in .claude/skills/ (Claude Code), .cursor/skills/ (Cursor), or .agents/skills/ (cross-platform standard).

The Skill Categories That Matter Most

Based on what's working in production teams right now, the highest-value skills to write or install:

Category	Why It Matters	Examples
Infrastructure	Most common reason agents fail on real tasks	Deployment processes, CI/CD pipelines, environment configs
Codebase context	Institutional knowledge usually in people's heads	Architecture decisions, naming conventions, deprecated patterns
API integration	Saves enormous time on integration tasks	Auth flows, rate limits, endpoints that actually work vs docs
Workflow	Soft conventions agents otherwise guess at	Code review process, commit message format, ticket structure

Why "Overhang Feels Large"

Altman's observation that "overhang feels large" aligns with what he told interviewers more broadly: even if model capabilities were frozen at GPT-5.2 and Codex today - no more improvements, no GPT-6 - there is still enormous untapped economic value. The models can already do far more than people are using them for.

The model is not the bottleneck. The knowledge encoding is.

This creates a temporary arbitrage: if you figure out how to actually use these models before your competitors and colleagues do, you have an edge. But if you're waiting for AI to "get smart enough" before you change your workflow, you might be waiting for something that's already here.

What This Means for Your Workflow

Altman's framework suggests a simple diagnostic: the next time your AI agent fails at something, ask why. Was it a model limitation - the task was genuinely beyond its capabilities? Or was it a context limitation - it didn't know something it needed to know?

For most teams, the honest answer is mostly context. The fix is a skill.

Start here: Think about the last 5 times you had to explain something to your agent that you've explained before. Each of those is a skill waiting to be written. Write those five skills, install them, and see how much of your "the AI can't do this" friction disappears.

That's the overhang Altman is talking about. It's yours to capture.

Frequently Asked Questions

What are Codex Skills?

Codex Skills are structured markdown files that give AI coding agents specific context about your tools, APIs, codebase, and workflows. They follow the open Agent Skills standard (agentskills.io) and work across Claude Code, Codex, Cursor, GitHub Copilot, Gemini CLI, and 30+ other agent hosts.

What did Sam Altman say about Codex Skills?

On June 6, 2026, Altman posted that when Codex fails, it is almost never a model limitation. The failures come from missing context, needing to write a skill, or simply not thinking to use the agent. He called the current capability overhang "large."

How do I install Agent Skills?

In Claude Code, use /plugin marketplace add [skill-name]. For Codex, Cursor, Copilot, and other hosts, use npx skills add [skill-name]. For project-specific skills, create a SKILL.md file in your .claude/skills/ or .agents/skills/ directory.

What is the capability overhang Altman mentioned?

The capability overhang is the gap between what AI models can actually do and what people are using them for. Altman argues that even if model development stopped today, there is enormous untapped value because most users have not encoded enough context for the models to work effectively on their specific tasks.

What types of Skills should I write first?

Start with infrastructure skills (deployment processes, CI/CD), then codebase context skills (architecture decisions, naming conventions), then API integration skills (authentication flows, endpoint details), and finally workflow skills (code review processes, commit message formats).

Sources & Verification

This guide is written from hands-on testing, then cross-checked against primary sources - official documentation and first-party announcements. Field results and opinions are labeled as such. See our editorial standards.

Join AI Builder Club

✓65+ lessons, 22+ workshops

✓350+ plug-and-play prompts & skills

✓Weekly live builder workshop

✓Premium tools (e.g. 10xCoder, AI tutor)

✓AI Builder Pack ($5,000+ in exclusive AI credits & perks)

1k+

Join 1,000+ builders already inside

Start shipping →30-day money-back · Cancel anytime

$37/mo

Live workshop

Get the free newsletter

Weekly deep-dives on AI tools, automation workflows, and builder strategies. Join 5,000+ readers.

No spam. Unsubscribe anytime.

Continue Learning

Mastering AI Agents

The builder's deep dive into agent loops, tools, context engineering & memory — from using AI to building it.

AI Agent 101

Build autonomous research agents with tool use, API access, web scraping, and deep search.

Claude Code 101

You've read the theory. The course is where you ship: 3 guided Labs (live website, full-stack app with payments, business automation) plus the Template Vault starter kit. Rebuilt June 2026.

← Back to Blog