Skill Creator

Transform general-purpose agents into domain specialists by synthesizing knowledge from authoritative books, web research, and reusable resources.

Core Philosophy

Context window is a public good — Agent is already smart. Only add information it lacks. Challenge every paragraph's token cost.
Synthesize, never summarize — Extract frameworks, decision trees, actionable patterns. Never copy verbatim.
Progressive disclosure — SKILL.md under 200 lines. Deep knowledge in references/, deterministic code in scripts/, templates in assets/.
Match freedom to fragility — Text instructions for flexible tasks, pseudocode for preferred patterns, exact scripts for fragile operations.
Understand before building — Collect concrete examples before research. Skills from real examples outperform skills from abstract knowledge.
Save research to disk — Skill creation generates massive context. Save to /tmp/{skill-name}-research/ immediately.
Write your own work — Read existing skills freely to learn what works (structure, length, patterns). But always write your own content from your own research. Never copy-paste text from other skills.
Explain the why — Reasoning beats rigid rules. LLMs understand intent; MUST/NEVER in all caps is a yellow flag. Reframe as reasoning.

Skill Anatomy

skill-name/
├── SKILL.md              # Entry point: frontmatter + workflow + file index
├── skills.json           # Permissions: filesystem, network, subprocess
├── references/           # Docs loaded on demand (250-450 lines each)
├── scripts/              # Executable code for deterministic tasks
└── assets/               # Templates, images (not loaded into context)

Quick-Start: Common Problems

"Skill doesn't activate"

Check description field — must include trigger phrases + scenarios
Add 10-15 trigger phrases covering user phrasings
Include specific file types, tools, tasks in description
The body loads AFTER triggering — "When to Use" sections are wasted
Make description slightly "pushy" — Claude tends to under-trigger
Test with references/evaluation-workflow.md trigger eval queries

"Agent gives generic advice"

Reference files need more specific, actionable content
Add decision trees, frameworks, concrete tables
Replace prose with step-by-step procedures
Verify agent is loading the right reference file

"Reference files wrong length"

Target 250-450 lines per file
Split large files, merge small ones
Each file covers distinct subtopic — no overlap

Workflow

Phase 1: Scope and Research

Define scope — Ask: domain, tasks, triggers, style reference
Study ecosystem — Browse skills.sh leaderboard. Read top skills to learn what works — structure, triggers, file layout. Use them as inspiration but write everything yourself from your own research.
- anthropics/skills (frontend-design, skill-creator)
- vercel-labs/agent-skills (react, web-design)
- obra/superpowers (debugging, tdd, planning)
Collect examples — Concrete queries skill should handle
Research books — 6-10 authoritative books (2018+)
Acquire books — Purchase or access through library/publisher
Extract frameworks — Use look_at on books. Save to /tmp/
Quality gate — 4+ books extracted before Phase 2

Phase 2: Plan Structure

Plan reference files — 5-9 files, each from 2+ books
Analyze resource types — Script vs reference vs asset

Phase 3: Write Content

Write reference files — Parallel agents, books as primary source
Verify — Format, length, sources, no overlap

Phase 3.5: Evaluate & Iterate

Create test cases — 3-5 realistic prompts + assertions
Run with/without skill — Baseline comparison via delegation
Grade results — Use references/quality-agents.md grader
Human review — Get feedback on output quality
Improve and repeat — Fix issues, rerun until satisfied
Optimize description — Test trigger accuracy with eval queries
Validate — Run scripts/quick_validate.py

See references/evaluation-workflow.md for detailed procedures.

Phase 4: Deploy

Write SKILL.md — Under 200 lines
Create skills.json — Minimal permissions
Git commit and push

See references/research-workflow.md for detailed procedures.

Decision Trees

Resource Type Selection

Content	Type	Loaded?
Domain knowledge, frameworks	`references/`	Yes, on demand
Deterministic code, automation	`scripts/`	No, executed
Templates, images, boilerplate	`assets/`	No, output
Core workflow, triggers	SKILL.md	Yes, on activate

SKILL.md Structure

Skill Type	Pattern	Example
Sequential processes	Workflow-based	Editor: Create → Edit → Export
Multiple operations	Task-based	PDF: Merge, Split, Extract
Standards/rules	Guidelines	Brand: Colors, Typography, Voice
Interrelated features	Capabilities	PM: Context, Updates, Comms

Instruction Freedom Level

Signal	Freedom	Format
Multiple valid approaches	High	Text instructions
Preferred pattern + variation	Medium	Pseudocode
Fragile operation, exact sequence	Low	Specific script

Tank-Skill Patterns

Observed in 20 skills:

Pattern	Lines	Example Skills
Best practices	47-115	react, typescript, python, clean-code
Methodology	143-291	systematic-debugging, planning, tdd
Operational	137-382	playwright, security-review, gmail
Integration	100-250	notion, slack, google-calendar

skills.json Permissions

{
  "network": [],
  "filesystem": {
    "read": ["**/*"],
    "write": []
  },
  "subprocess": false
}

Only add permissions when actually needed.

Reference Files

File	Contents
`references/skill-design-patterns.md`	Context window principle, degrees of freedom, progressive disclosure, resource types, SKILL.md patterns, what NOT to include
`references/research-workflow.md`	Scope definition, ecosystem study, book research, framework extraction, scratch file protocol, parallel writing delegation
`references/writing-conventions.md`	Frontmatter rules, body structure, reference file format, quality standards, instruction philosophy, completion checklist
`references/ecosystem-patterns.md`	skills.sh leaderboard analysis, top publisher patterns, tank-skills conventions, supported agents
`references/evaluation-workflow.md`	Test case creation, grading, iteration loop, description optimization, evaluation schemas
`references/quality-agents.md`	Grader and blind comparator agent prompts, delegation templates, output schemas

@tank/skill-creator

Description

Triggered by

Skill Creator

Core Philosophy

Skill Anatomy

Quick-Start: Common Problems

"Skill doesn't activate"

"Agent gives generic advice"

"Reference files wrong length"

Workflow

Phase 1: Scope and Research

Phase 2: Plan Structure

Phase 3: Write Content

Phase 3.5: Evaluate & Iterate

Phase 4: Deploy

Decision Trees

Resource Type Selection

SKILL.md Structure

Instruction Freedom Level

Tank-Skill Patterns

skills.json Permissions

Reference Files

@tank/skill-creator

Description

Triggered by

Skill Creator

Core Philosophy

Skill Anatomy

Quick-Start: Common Problems

"Skill doesn't activate"

"Agent gives generic advice"

"Reference files wrong length"

Workflow

Phase 1: Scope and Research

Phase 2: Plan Structure

Phase 3: Write Content

Phase 3.5: Evaluate & Iterate

Phase 4: Deploy

Decision Trees

Resource Type Selection

SKILL.md Structure

Instruction Freedom Level

Tank-Skill Patterns

skills.json Permissions

Reference Files

Command Palette