grill-me — When AI Interviews You Before Writing Code

grill-me is a tiny Claude Code / Codex / agent-agnostic skill by Matt Pocock that flips the default agent dynamic: instead of you describing a feature and the agent immediately writing code, the agent interviews you — one question at a time, walking down each branch of the decision tree — until you’ve thought through every dependency. Matt’s repo mattpocock/skills is at about 82k stars (May 15, 2026), and grill-me is the most-cited piece. Matt himself calls it “the most flexible skill I’ve ever created” — he uses it for non-coding work too, like figuring out which course to build next.

The entire skill, verbatim

The skill is famously small — about seven lines of prose plus YAML frontmatter. Here it is verbatim from the public SKILL.md:

---
name: grill-me
description: Interview the user relentlessly about a plan or design
  until reaching shared understanding, resolving each branch of the
  decision tree. Use when user wants to stress-test a plan, get
  grilled on their design, or mentions "grill me".
---

Interview me relentlessly about every aspect of this plan until we
reach a shared understanding. Walk down each branch of the design
tree, resolving dependencies between decisions one-by-one. For each
question, provide your recommended answer.

Ask the questions one at a time.

If a question can be answered by exploring the codebase, explore the
codebase instead.

Three paragraphs. That’s it. The skill produces a substantively different agent behavior than any “let me think through this with you” prompt because each directive does specific work:

Directive	What it changes
“Interview me relentlessly”	Sets the agent’s role from helper to interrogator. No early agreement.
“Walk down each branch of the design tree”	Structures the conversation around dependencies, not the agent’s free-form curiosity
“Provide your recommended answer”	Stops the conversation devolving into pure Socratic loop; the agent supplies its best guess so you can react, not invent from scratch
“Ask the questions one at a time”	Prevents the seven-questions-in-one-message wall that paralyzes most planning sessions
“If a question can be answered by exploring the codebase, explore”	Cuts trivial lookups so the human time goes to the real decisions

Why this one went viral

Out of 18 promoted skills in Matt Pocock’s repo (more if you count personal/in-progress entries — third-party indices show ~34 entries in the full tree), grill-me is the breakout. Three reasons in my reading:

It inverts the painful default. The default behavior of every coding agent is eager helpfulness — produce something fast, ask few questions. That looks productive and is responsible for a lot of agent-generated code that gets thrown away because it implemented the wrong thing. grill-me sells “slowness now, quality later” as a feature.
It modernizes a familiar practice. Matt’s framing: “Before AI came along, devs called this rubber ducking - talking through your idea until you figured out all the permutations.” Rubber ducking is older than most engineers. grill-me is rubber ducking with an opinionated duck.
The “recommended answer” trick is doing a lot of work. Most planning conversations stall when the human can’t answer a question without more context. grill-me requires the agent to guess first — which immediately gives the human something to react to. “No, not that, because…” is faster than “Hmm, I haven’t decided yet.”

What a session looks like

Matt reports that grilling sessions often last about 45 minutes. He doesn’t publish a phase breakdown, so the sketch below is illustrative (my synthesis based on running it myself), not a Matt-Pocock-prescribed structure:

Phase 1   Agent reads the request + skims any relevant code
          → composes its first question

Phase 2   Foundational branches: data model, ownership, lifecycle,
          error handling, who-calls-who. Agent proposes a recommended
          answer, you correct.

Phase 3   Edge cases: race conditions, retries, partial failures,
          auth boundaries, observability. Most surprises live here.

Phase 4   Synthesis: agent provides a summary of the plan with your
          decisions baked in. You either approve or send it back for
          one more pass.

If the session is shorter, the problem was probably already well-defined. If it stretches well past an hour, the agent is doing work that should happen later — pause, run the code on a piece, then resume.

Beyond coding — the part most reviews miss

The skill’s most interesting claim — Matt’s own — is that grill-me isn’t really about code. From his blog: “It’s also great for non-coding use cases. I’ve used it for figuring out what course to build next…” He calls it “the most flexible skill I’ve ever created.”

Concrete non-coding applications:

Use case	What you’re grilling
Course / syllabus design (Matt’s own example)	What students already know, what they should know by week 8, what the assessments measure, which weeks can be cut if behind schedule
Conference talk outline	Who’s the audience, what’s the one thing they should remember, what’s the demo, what gets cut if you run long
Research-paper framing	What’s the actual contribution vs. claimed contribution, who’s the strongest critic, where’s the threats-to-validity section weakest
Workshop / lab plan	Pre-reqs, deliverable per hour, the “if anyone is stuck” fork, the assessment rubric
Migration / refactor scoping	What can break, what’s not in scope, what’s the rollback, what’s the proof-of-success check

The non-coding angle is why this skill is worth understanding even for faculty / non-engineers — it’s a structured-planning tool that happens to ride inside an agent shell.

Important framing note: Matt’s repo README explicitly describes the skills as small, composable, and agent-agnostic — they work with Claude Code, Codex, and other AI agents, not just Anthropic’s stack. Don’t let “Claude Code skill” become a barrier if your colleague is on Codex or another harness.

Install

# From within your project (or globally in your home directory):
npx skills@latest add mattpocock/skills

# Or copy just grill-me into ~/.claude/skills/:
git clone https://github.com/mattpocock/skills /tmp/mp-skills
cp -r /tmp/mp-skills/skills/productivity/grill-me ~/.claude/skills/

Then in any agent session: type “grill me” before describing what you want to build (or design, or plan). The agent will switch into interview mode.

The community ecosystem around this skill

Because the skill is tiny + MIT-licensed, forks and variants have proliferated. A non-exhaustive map:

Variant	Distinctive twist
`mattpocock/skills` (original) — the canonical upstream version	Includes sister skill `/grill-with-docs`
`RobMitt/grill-me-skill`	Marketed for Claude Code / Cowork
`Jekudy/grillme-skill`	“Socratic deep interview … structured questioning waves” — more explicit phase structure
`vclimenco/claude-code-skills`	Mirror with light edits

Pick the original unless a variant pitches a specific twist you need.

Limitations and honest caveats

45 minutes is real. This is not a 2-minute time-saver. Budget the time, or don’t use it for trivial requests.
The agent’s “recommended answer” can anchor you wrong. When the agent proposes an answer you’re not sure about, treat it as a strawman to react to, not a default to accept. Anchoring is a real risk.
My own caveat (not Matt’s): I find grill-me works best when you already have a rough shape of the idea. Matt’s blog suggests it works whether your idea is fully formed or very vague — try both ends and see what fits your style.
Doesn’t replace documentation. grill-me’s sister skill /grill-with-docs is the one that updates CONTEXT.md (which the SKILL.md describes explicitly as “a glossary and nothing else”) and creates ADRs only when decisions are hard to reverse, surprising without context, and involve genuine trade-offs. Plain /grill-me is conversational; the decisions live in your chat history unless you save them out.

How LearnAI Team Could Use This

Course-redesign workshops — assign faculty to run grill-me on one of their existing syllabi. Output: a list of design decisions they hadn’t explicitly thought through. Time: roughly an hour per faculty. Use as the prep artifact for a sit-down planning meeting, not a replacement for it.
Student research-proposal grilling — before a student submits a CS-336 (Program Analysis for Security) project proposal, require they run grill-me on it. Submit the transcript with the proposal. Tells you in 30 seconds whether they actually thought through the problem or just picked a topic.
LearnAI’s own internal projects — when scoping any new tool / curriculum module, run grill-me on the proposal. Cheap, fast, and the recommended-answer mechanic surfaces decisions the team is implicitly making.
Faculty onboarding — pair grill-me with the non-coding-skills-claude-code entry as the “AI doesn’t just write code” demonstration. The non-coding angle is what gets non-engineer colleagues to actually try an agent shell.

Real-World Use Cases

Scenario	Description
Pre-coding planning for a new feature	Run grill-me before describing the feature. Surfaces decisions that would otherwise show up as “you didn’t ask about X” surprises later
Stress-testing a research direction	Use it to identify the strongest objection to your hypothesis before you’ve spent months proving it
Course / syllabus design (Matt’s own example)	“What course should I build next?” — exactly the question grill-me was used for by its creator
Conference talk scoping	The “if you run long, what gets cut?” question alone often reshapes the talk
Onboarding documentation — grill yourself about your codebase	The transcript becomes the on-call runbook you never had time to write
Pre-mortem on a migration	Force the “what breaks?” question to be answered branch-by-branch, not waved off

Important things to know

This is widely considered the highest-leverage skill in Matt’s repo. If you only install one of his skills, install this one.
Pair with /grill-with-docs if you want decisions captured automatically. /grill-me alone is conversational; /grill-with-docs updates CONTEXT.md (glossary) and writes ADRs for hard-to-reverse / surprising / trade-off decisions.
The “recommended answer” rule is the secret sauce. Without it, the skill is just structured rubber-ducking. With it, the conversation has momentum.
Agent-agnostic: works in Claude Code, Codex, and other agent harnesses — not Claude Code-only.
The skill produces a summary at the end (per Matt’s blog) — the agent restates the plan with your decisions baked in. Save that summary; it’s the artifact you’ll actually use.
Companion deep-dives in this wiki:
- Matt Pocock’s Skills — Claude Code for Real Engineers — the parent repo entry, full skill list
- Karpathy Skills — Four Rules That Fix LLM Coding’s Worst Habits — adjacent philosophy on small/composable skills
- Anthropic Skills — Thariq’s Framework for Reusable Capabilities — the broader skills movement
- Claude Code · CLAUDE.md Practices — where grilling-output should land (CLAUDE.md / CONTEXT.md / ADRs)
- Domain Experts Shipping Products with Claude Code — why the non-coding use case matters for faculty