name	skill-reviewer
description	Evaluates skill effectiveness, suggests improvements to instructions, identifies missing edge cases, and recommends structure changes.

Skill Reviewer

Purpose

The Skill Reviewer provides comprehensive quality assurance and improvement recommendations for Claude Code skills. It evaluates skills against official Anthropic guidelines, identifies clarity issues, detects missing edge cases, and suggests structural improvements. Use this skill to validate skill quality, identify gaps, and ensure skills meet professional standards before distribution.

When to Use

Evaluating New Skills: Review skills before adding to project or distributing
Improving Existing Skills: Identify gaps and enhancement opportunities
Quality Assurance: Ensure skills meet Anthropic guidelines
Documentation Audit: Check for clarity, completeness, and accessibility
Structure Validation: Verify proper organization and naming conventions
Edge Case Testing: Identify scenarios the skill doesn't handle
Pre-Release Review: Final validation before skill distribution
Team Standards: Ensure consistency across skill libraries

Capabilities

Metadata Validation: Check YAML frontmatter, name conventions, description completeness
Documentation Audit: Evaluate clarity, completeness, and user accessibility
Structure Assessment: Verify directory organization, file hierarchy, and naming patterns
Functionality Coverage: Identify missing capabilities and edge cases
Instruction Quality: Analyze step-by-step guides, examples, and usage patterns
Guideline Compliance: Verify adherence to Anthropic skill standards
Cross-Reference Validation: Check links to related skills and documentation
Risk Identification: Spot potential issues, confusion points, and improvement areas
Score Generation: Provide overall skill quality score (0-100) with grade (A-F)
Improvement Roadmap: Generate prioritized recommendations for enhancements

Evaluation Framework

1. Metadata Quality (20 points)

YAML Frontmatter

✅ Valid YAML syntax
✅ Required fields present: name, description
✅ Consistent naming (kebab-case, lowercase)
✅ Description is one concise sentence

Naming Conventions

✅ Directory name matches skill name
✅ Descriptive verb-based names (not "helper", "tool", "manager")
✅ Clear intent without jargon
✅ Consistent with project patterns

2. Documentation Quality (25 points)

Structure & Organization

✅ Clear heading hierarchy (H1 → H2 → H3)
✅ Logical content flow
✅ Proper use of formatting (bold, code, lists)
✅ Consistent voice and tone

Content Completeness

✅ Purpose section present and clear
✅ "When to Use" section with scenarios
✅ Capabilities listed with specifics
✅ Usage examples with code snippets
✅ Best practices documented
✅ Troubleshooting section for common issues

Clarity & Accessibility

✅ Jargon-free or well-explained
✅ Active voice preferred
✅ Concrete examples over abstract concepts
✅ Appropriate detail level for audience

3. Structure & Organization (20 points)

File Organization

✅ SKILL.md in root of skill directory
✅ scripts/ subdirectory for automation (lowercase, if needed)
✅ references/ subdirectory for detailed docs (lowercase, if needed)
✅ assets/ subdirectory for templates/boilerplate (lowercase, if needed)
✅ README.md for development notes (excluded from distribution)

Directory Conventions

✅ Lowercase directory names
✅ No nested subdirectories in references/
✅ Single-level organization
✅ Consistent naming patterns

File Size Constraints

✅ SKILL.md under 500 lines (move excessive detail to references/)
✅ Reference files over 100 lines have table of contents
✅ Executable scripts have proper permissions (chmod +x)

4. Functionality & Coverage (20 points)

Scope & Capabilities

✅ Clear boundaries of what skill does/doesn't do
✅ Realistic capability claims
✅ No overpromising functionality
✅ Dependencies clearly documented

Edge Cases & Error Handling

✅ Anticipated edge cases addressed
✅ Error scenarios handled gracefully
✅ Fallback strategies documented
✅ Limitations explicitly stated

Practical Applicability

✅ Real-world usage scenarios covered
✅ Examples are realistic and actionable
✅ Step-by-step guides are complete
✅ Expected outcomes clearly defined

5. Guideline Compliance (15 points)

Anthropic Standards

✅ Follows official skill guidelines
✅ Proper YAML syntax validation
✅ Description includes "when to use" triggers
✅ No auxiliary docs mixed with skill content

Project Standards

✅ Aligns with project conventions
✅ Consistent with similar skills
✅ Follows naming patterns
✅ Uses established templates

Best Practices

✅ Clear value proposition
✅ Appropriate scope (not too broad)
✅ Progressive content disclosure
✅ Links to related resources

Usage

Basic Review

# User: "Review the react-component-scaffolder skill"
# Skill Reviewer will:
# 1. Read the SKILL.md file
# 2. Evaluate against all criteria
# 3. Generate comprehensive review report
# 4. Provide improvement recommendations
# 5. Output skill-reviewer_report.md

Structure Review

# User: "Check the directory structure of the fastapi-endpoint-scaffolder skill"
# Skill Reviewer will:
# 1. Verify SKILL.md location and format
# 2. Check subdirectory organization
# 3. Validate file naming conventions
# 4. Review file permissions and sizes
# 5. Recommend structural improvements

Edge Case Analysis

# User: "Identify missing edge cases in the component-builder skill"
# Skill Reviewer will:
# 1. Analyze described capabilities
# 2. Brainstorm unhandled scenarios
# 3. Test against common failure modes
# 4. List missing functionality
# 5. Recommend coverage improvements

Compliance Check

# User: "Verify that my-new-skill follows Anthropic guidelines"
# Skill Reviewer will:
# 1. Validate YAML frontmatter
# 2. Check naming conventions
# 3. Verify description completeness
# 4. Review against official standards
# 5. Report compliance status

Examples

Example 1: Quality Review Report

Skill Reviewed: jest-test-scaffolder

Review Output:

# Skill Review: jest-test-scaffolder

## Overall Score: 87/100 (Grade: B+)

### Metadata Quality: 18/20
✅ Valid YAML frontmatter
✅ Clear, concise description
❌ Description could include mention of "Jest" specifically
⚠️  Consider adding example output reference

### Documentation Quality: 23/25
✅ Comprehensive "When to Use" section
✅ Clear capabilities list
❌ Missing troubleshooting section
⚠️  One example could show error handling

### Structure & Organization: 19/20
✅ Proper directory structure
✅ SKILL.md under 500 lines
❌ Reference files lack table of contents
⚠️  Consider adding assets/ directory for templates

### Functionality & Coverage: 19/20
✅ Real-world scenarios covered
✅ Clear step-by-step guide
⚠️  Missing edge case: TypeScript strict mode configuration
⚠️  No mention of coverage threshold testing

### Guideline Compliance: 14/15
✅ Follows Anthropic standards
⚠️  Could strengthen "when to use" language

## Recommendations

### Priority 1 (Critical)
1. Add troubleshooting section for common Jest configuration issues
2. Document edge case: TypeScript strict mode with React Testing Library

### Priority 2 (High)
1. Add table of contents to reference files
2. Include example output in capabilities section
3. Add coverage threshold testing to best practices

### Priority 3 (Nice to Have)
1. Create templates/ directory with example test files
2. Add video walkthrough link
3. Document advanced patterns for custom matchers

## Usage Statistics
- User Feedback: 4.2/5 ⭐
- Effectiveness: High (solving real problems)
- Complexity: Medium

Example 2: Edge Case Analysis

Skill Reviewed: deployment-manager

Edge Cases Identified:

## Missing Edge Case Coverage

### Unhandled Scenarios

1. **Deployment Timeout**: No guidance on stuck deployments
2. **Rollback Failures**: What if rollback fails?
3. **Multi-Region**: Only documents single-region deployment
4. **Authentication Expiry**: No mention of credential refresh during long deployments
5. **Partial Failures**: Handling when some services deploy but others fail
6. **Concurrent Deployments**: What happens if two deployments run simultaneously?
7. **Resource Constraints**: No mention of memory/disk requirements
8. **Network Interruptions**: Handling transient network failures

### Recommended Additions

- Add deployment timeout configuration section
- Document rollback procedures in detail
- Include multi-region deployment guide
- Add environment variable validation step
- Create contingency/recovery procedures
- Add resource requirements documentation

Example 3: Structure Recommendations

Skill Reviewed: component-builder

Structure Improvements:

## Current Structure

.claude/skills/component-builder/ ├── SKILL.md (620 lines - exceeds 500 limit) └── README.md


## Recommended Structure

.claude/skills/component-builder/ ├── SKILL.md (380 lines - main overview) ├── README.md (development notes, not distributed) ├── references/ │ ├── MATERIAL_DESIGN_3_TOKENS.md (detailed token reference) │ ├── COMPONENT_PATTERNS.md (advanced patterns, 150+ lines, includes TOC) │ └── TYPESCRIPT_SETUP.md (TypeScript configuration guide) ├── scripts/ │ ├── generate-component.sh (executable) │ └── validate-m3-tokens.py (executable) └── assets/ ├── component.tsx.template ├── component.test.tsx.template └── component.stories.tsx.template


## Benefits
- SKILL.md within 500-line limit (progressive disclosure)
- Reference files organized by topic
- Reusable templates in assets/
- Automation scripts clearly separated
- Better content navigation

Best Practices

For Skill Developers

Self-Review First: Use this skill before distributing
Iterate Based on Feedback: Implement recommendations progressively
Test Your Examples: Verify all code snippets work
Get Peer Review: Have teammates review critical skills
Version Your Skills: Track changes and improvements

For Skill Maintenance

Regular Audits: Review skills quarterly
User Feedback: Incorporate reports and suggestions
Update Examples: Keep code samples current
Expand Coverage: Add edge cases as they're discovered
Archive Old Skills: Retire obsolete skills clearly

For Skill Distribution

Pre-Release Review: Score ≥80 before distribution
Document Dependencies: List required tools/knowledge
Provide Feedback Mechanism: How users report issues
Create Support Resources: Links to help and troubleshooting
Plan Updates: Version numbering and upgrade path

Scoring Guidelines

Grade Scale

A (90-100): Exceptional - Ready for distribution, exemplary quality
B (80-89): Good - Ready for distribution, minor improvements suggested
C (70-79): Satisfactory - Usable but needs improvements before wider sharing
D (60-69): Poor - Needs significant work before distribution
F (Below 60): Inadequate - Not ready for release, major rework required

Minimum Distribution Standards

✅ Score ≥ 80 (Grade B or higher)
✅ All Priority 1 recommendations addressed
✅ Comprehensive documentation
✅ Tested examples and workflows
✅ Clear scope and limitations

Troubleshooting

Issue: Score Below 80

Solution:

Address all Priority 1 recommendations first
Add missing sections (troubleshooting, edge cases)
Improve documentation clarity
Get peer review for feedback
Iterate and re-review

Issue: Unclear Recommendations

Solution:

Request specific examples
Ask for comparison with high-scoring skill
Discuss recommendations with team
Review Anthropic guidelines for clarification
Request follow-up review after changes

Issue: Scope Too Broad

Solution:

Break into smaller, focused skills
Move advanced topics to references/
Focus on core use case
Create related skills for extensions
Use "Related Skills" to connect them

Related Skills

skill-creator - Create new Claude Code skills
task-delegator - Delegate review tasks to specialized agents
code-reviewer - Review code quality and structure
project-health-checker - Comprehensive project validation
audit-agent - Security and quality audits

Summary

The Skill Reviewer provides systematic, objective evaluation of Claude Code skills across five key dimensions: metadata, documentation, structure, functionality, and guideline compliance. Use it to ensure skills meet professional standards, identify improvement opportunities, and maintain consistent quality across skill libraries.

Quick Checklist for High-Quality Skills:

✅ Score ≥ 80/100 (Grade B or higher)
✅ Clear, concise description with "when to use"
✅ Comprehensive documentation with examples
✅ Proper directory structure and naming
✅ Addressed Priority 1 recommendations
✅ All claims tested and verified
✅ Edge cases documented
✅ Related skills and references linked

skill-reviewer

Install Skill

SKILL.md

Skill Reviewer

Purpose

When to Use

Capabilities

Evaluation Framework

1. Metadata Quality (20 points)

2. Documentation Quality (25 points)

3. Structure & Organization (20 points)

4. Functionality & Coverage (20 points)

5. Guideline Compliance (15 points)

Usage

Basic Review

Structure Review

Edge Case Analysis

Compliance Check

Examples

Example 1: Quality Review Report

Example 2: Edge Case Analysis

Example 3: Structure Recommendations

Best Practices

For Skill Developers

For Skill Maintenance

For Skill Distribution

Scoring Guidelines

Grade Scale

Minimum Distribution Standards

Troubleshooting

Issue: Score Below 80

Issue: Unclear Recommendations

Issue: Scope Too Broad

Related Skills

Related Documentation

Summary