name	AI Documentation Standards
description	Write AI-readable documentation following concise-over-comprehensive principle, hierarchical CLAUDE.md/AGENTS.md inheritance (100-200 line rule), structured formats (tables over prose), parallel validation, and session knowledge capture. Use when writing documentation, updating docs, or optimizing existing docs.
allowed-tools	Read, Write, Edit, Grep, Glob

AI Documentation Standards

Purpose: Create documentation optimized for AI code agents in large monorepos and complex systems.

Key Insight: AI agents need concise, structured, scannable information with clear context boundaries - not comprehensive tutorials. Documentation should be a MAP to the codebase, not a REPLACEMENT for reading code.

Core Principles for AI-Readable Docs

1. Concise Over Comprehensive

AI agents can read code. Documentation = MAP to codebase, not replacement. Provide structure and location references (file:line), not exhaustive explanations.

2. Structure Over Prose

Tables, bullets, code snippets >>> Paragraphs. AI agents parse structured content faster and more accurately.

3. Location References Over Explanations

Always include file:line_number for implementation details. AI agents need to know WHERE to look, then they read the code.

4. Hierarchical Context Boundaries

Create clear "entry points" for different detail levels:

OVERVIEW.md (what/why)
architecture/ (how it works)
implementation/ (where specific logic lives)
guides/ (when to use approaches)

5. Searchable Keywords

Use consistent terminology. Include searchable keywords, related components. AI agents use semantic search.

CLAUDE.md Hierarchical Inheritance

The 100-200 Line Rule

Most CLAUDE.md files should be 100-200 lines. Exceptions: Complex production systems (300-400 lines max).

WHY: Claude recursively loads CLAUDE.md from root → current directory. Everything becomes agent context, so verbosity = wasted tokens.

Principle: Child CLAUDE.md files should ONLY contain information unique to their level. Never duplicate parent content.

Line Count Targets by Level

Level	Lines	Focus	What to Include	What to Exclude
Global (`~/.claude/CLAUDE.md`)	100-120	Decision framework, workflow modes	Thinking budget, core principles, parallel execution, git safety	Tool-specific commands, project patterns, testing details
Project (e.g., `project/CLAUDE.md`)	150-180	Project standards, tool configs	Linting commands, testing structure, code style	Core principles (in global), subsystem details
Subsystem (e.g., `subsystem/CLAUDE.md`)	120-150	Architecture overview, design patterns	System architecture, data flow, major components	Testing patterns (in project), code style (in project)
Framework (e.g., `imports/CLAUDE.md`)	100-120	Framework patterns, conventions	Three-phase pattern, base classes, directory structure	Subscription handling (in parent), testing (in project)
Simple Tool	200-250	Purpose, architecture	Tool-specific logic, configuration, gotchas	Code quality (in project), testing (in project)
Complex Tool	300-400	Architecture, business logic	Critical business logic, patterns, gotchas	Code standards, testing strategy, base patterns

What NOT to Duplicate Across Hierarchy

Never duplicate across hierarchy levels:

Core Principles (global only) - NO PARTIAL WORK, FAIL LOUD, parallel execution, decision framework
Testing Patterns (project only) - Coverage requirements, test organization, location conventions
Code Style (project only) - Line length, type hints, naming conventions, documentation rules
Tool Configurations (project only) - Linting command paths, formatter configs, build settings
Error Handling Philosophy (global only) - CRASH/RETRY/SKIP/WARN strategies, try/except usage
Subsystem Patterns (subsystem only) - Multi-tenancy, database operations, caching strategies

Hierarchical Inheritance Example

Bad: Testing standards repeated in global, project, and tool CLAUDE.md (3x duplication)

Good: Define once in global, reference in project ("See global CLAUDE.md for coverage requirements"), reference in tool ("See project CLAUDE.md for structure")

Result: 50% reduction in duplication

For full examples: See reference.md

When to Extract to QUICKREF.md

Triggers (any one means extract)

CLAUDE.md exceeds 400 lines
More than 5 code examples with before/after patterns
Detailed implementation walkthroughs (>50 lines per pattern)
Comprehensive testing strategies with mock examples
Refactoring guides with line-by-line explanations

What Goes Where

QUICKREF.md (deep-dive): Full code examples, detailed patterns, testing strategies with mocks, performance optimization, debugging strategies, architecture deep dives

CLAUDE.md (quick reference): Critical business logic (table format), architecture overview (bullets), common gotchas (condensed with file:line), key constants, "See QUICKREF.md for details"

AI-Specific Documentation Types

For full templates: See reference.md

Quick Reference

Type 1: OVERVIEW.md (100-200 lines)

Purpose + Performance metrics
Key components table
Data flow (bullet points)
Critical decisions table
Common gotchas (numbered)

Type 2: BUSINESS_RULES.md (300-800 lines)

Rules index for quick navigation
Rule definitions table: # | Rule | Condition | Behavior | WHY | Implementation
Detailed section per rule with test coverage and edge cases

Type 3: ARCHITECTURE.md (200-400 lines)

Design principles (bullet points)
Processing pipeline (phases with file:line refs)
Key patterns with examples
Integration points + performance characteristics

Type 4: API_REFERENCE.md (200-600 lines)

Public functions table: Function | Purpose | Input | Output | Location
Detailed signature per function with examples
Internal functions reference (condensed)

Type 5: TROUBLESHOOTING.md (150-300 lines)

Quick diagnosis table: Symptom | Likely Cause | Fix
Problem/Solution pairs with validation steps
Debug workflows (numbered steps)
Log analysis patterns

llms.txt Index Pattern

Purpose: Help AI agents find relevant docs quickly in large monorepos

When to Create:

Monorepo has >10 documentation files
Multiple subsystems with separate docs
Documentation spread across directories

Structure: Organized sections (Quick Start, Business Logic, Architecture, Implementation, Troubleshooting) with file paths + line counts, plus task-specific navigation guides

For full template: See reference.md

AGENTS.md Pattern

Purpose: Specialized documentation for AI coding agents (alternative to CLAUDE.md)

When to Use: Detect parent format - maintain consistency (CLAUDE.md or AGENTS.md)

Location: Project root

Structure:

# AGENTS.md

## Quick Context
- Tech stack, architecture, entry point

## Documentation Structure
- Start: OVERVIEW.md, Rules: BUSINESS_RULES.md, API: API_REFERENCE.md

## For Documentation Updates
- Review strategy: 1 reviewer per N components
- Validation priority, line counts, session knowledge extraction

## For Code Changes
- Testing, linting, standards references

## For Reviews
- Parallel execution, grouping, focus areas

## Critical Gotchas
[Project-specific]

Key Difference from CLAUDE.md: Focused on agent workflows vs general project context

Session Knowledge Capture

Purpose: Extract and preserve insights learned during work sessions

Categories:

Gotchas: Edge cases, non-obvious behavior, timing requirements
Decisions: Design choices with rationale and trade-offs
Performance: Metrics, optimal values, bottlenecks
Business Rules: Logic discovered not previously documented
Patterns: Reusable approaches discovered

Scoping:

project_local: Specific project/tool only → $WORK_DIR/main doc
parent_scope: Parent directory (framework/subsystem) → parent doc
repo_scope: Entire repository → repo docs or skills

Integration:

Gotchas → Main doc "Common Gotchas"
Decisions → ARCHITECTURE.md or QUICKREF.md
Performance → QUICKREF.md or HOW_TO.md
Business rules → BUSINESS_RULES.md
Patterns → ARCHITECTURE.md or skills (if reusable)

Smart Placement:

if scope == "project_local":
    location = "$WORK_DIR/CLAUDE.md or AGENTS.md"
elif scope == "parent_scope":
    location = find_parent_doc($WORK_DIR)
elif scope == "repo_scope":
    location = "$REPO_ROOT/docs/PATTERNS.md or skills/"

Parallel Validation Strategy

Complexity Metrics:

components = modules + standalone_files
doc_lines = sum(count_lines(md_files))
rules = count_rule_rows("BUSINESS_RULES.md")
score = (components * 100) + (doc_lines / 10) + (rules * 50)

Reviewer Count:

<1000: 1 reviewer
1000-3000: 2 reviewers
3000-6000: 4 reviewers
6000: 6 reviewers

Grouping:

Related components together
Main doc hierarchy in single reviewer
Balance lines (~2000 per reviewer)
Dedicated hierarchy reviewer if >3 levels

Example:

# 15 components, 4 reviewers
Group 1: processors/ (7 files) + main doc sections
Group 2: cache/ + api/ + main doc sections
Group 3: docs/ (OVERVIEW, BUSINESS_RULES, ARCHITECTURE)
Group 4: Hierarchy integrity (all main docs)

Smart Documentation Placement

Placement Algorithm:

Analyze scope (project_local, parent_scope, repo_scope)
Walk hierarchy ($WORK_DIR → repo root)
Detect format (CLAUDE.md or AGENTS.md - maintain consistency)
Check organization (docs// structure exists?)

Decision Tree:

if scope == "project_local":
    if not exists("$WORK_DIR/docs/"):
        create_docs_structure($WORK_DIR)

    if topic in ["gotchas", "overview"]:
        location = "$WORK_DIR/main doc"
    elif topic == "business_rules":
        location = "$WORK_DIR/docs/BUSINESS_RULES.md"
    elif topic == "architecture":
        location = "$WORK_DIR/docs/ARCHITECTURE.md"
    elif topic == "api":
        location = "$WORK_DIR/docs/API_REFERENCE.md"

elif scope == "parent_scope":
    parent = find_parent_with_docs($WORK_DIR)
    location = f"{parent}/docs/{topic}.md"

elif scope == "repo_scope":
    root = find_repo_root($WORK_DIR)
    if exists(f"{root}/docs/"):
        location = f"{root}/docs/PATTERNS.md"
    else:
        location = f"{root}/.claude/skills/{category}/"

Proper Organization:

$WORK_DIR/docs/
├── llms.txt
├── OVERVIEW.md
├── API_REFERENCE.md
├── ARCHITECTURE.md
├── BUSINESS_RULES.md
├── HOW_TO.md
├── TROUBLESHOOTING.md
└── <sub-component>/
    ├── COMPONENT_OVERVIEW.md
    └── COMPONENT_DETAILS.md

Content Optimization Strategies

For full examples with before/after: See reference.md

Quick Reference

1. Extract-Consolidate-Reference (ECR)

Problem: Duplicate information across files
Solution: Single source of truth, reference from other docs
Example savings: 1,500 lines across 4 files

2. Split Large Monoliths (SLM)

Problem: Single file >1,000 lines
Solution: Split by responsibility (OVERVIEW, PIPELINE, BUSINESS_LOGIC, etc.)
Benefit: Each file fits on one screen

3. Table-ify Prose (T2T)

Problem: Long paragraphs explaining logic
Solution: Convert to tables with What/When/Why/Where columns
Example savings: 85% reduction (200 lines → 30 lines)

4. Add llms.txt Index

Problem: AI agents can't find relevant docs
Solution: Create index with doc summaries and task-specific navigation

5. Condense File Structure Trees

Show only key files, collapse similar items with [N more files]
Example savings: 67% reduction (60 lines → 20 lines)

6. Bullet Points Over Paragraphs

Convert prose to dense bullet points or single-sentence summaries
Example savings: 75% reduction (4 lines → 1 line)

Anti-Patterns to Avoid

For detailed examples: See reference.md

Quick Reference

1. Don't Write Tutorials

Why: AI agents need REFERENCE, not teaching
Bad: Step-by-step walkthroughs explaining every detail
Good: Checklist with base class, required methods, file:line references

2. Don't Explain Obvious Code

Why: AI can read code directly
Bad: Line-by-line prose restating code logic
Good: One-line summary with file:line reference + WHY

3. Don't Mix Abstraction Levels

Why: Confuses readers, mixes strategic and tactical
Bad: OVERVIEW.md with implementation details (asyncio.gather, table indexes)
Good: OVERVIEW.md stays high-level, links to detailed architecture docs

4. Don't Create Deep Directory Nesting

Why: AI agents struggle with deep hierarchies
Bad: 7+ levels of nested directories
Good: 2-3 levels max (docs/architecture/COMPONENT.md)

5. Don't Use Relative Paths Without Context

Why: AI can't resolve without knowing current location
Bad: ../processors/base.py
Good: processors/base_processor.py or absolute path

6. Don't Duplicate Content Across Hierarchy

Why: Wastes context tokens, creates maintenance burden
Bad: Testing standards repeated in global, project, and tool CLAUDE.md
Good: Define once (global), reference elsewhere (project, tool)

Validation Checklist

Before committing documentation changes:

Duplication Check

No duplicate core principles (defined in global only)
No duplicate testing patterns (defined in project only)
No duplicate code style rules (defined in project only)
No duplicate subsystem patterns (defined in parent subsystem only)

Content Optimization

Business logic in table format (not prose)
Decision matrices for strategies (not paragraphs)
File structure condensed (not full tree)
Bullet points over paragraphs where possible
Location references include file:line when relevant

Line Count Targets

Global CLAUDE.md: 100-120 lines
Project CLAUDE.md: 150-180 lines
Subsystem CLAUDE.md: 120-150 lines
Framework CLAUDE.md: 100-120 lines
Simple Tool CLAUDE.md: 200-250 lines
Complex Tool CLAUDE.md: 300-400 lines max

Deep-Dive Extraction

If CLAUDE.md > 400 lines, extract to QUICKREF.md
Code examples with before/after → QUICKREF.md
Detailed implementations → QUICKREF.md
Keep quick reference + "See QUICKREF for details"

Hierarchy Integrity

Child files reference parent files (not duplicate)
Each level focuses on unique information
Clear separation of concerns across levels

AI-Readability

Can AI find relevant doc in <30 seconds via llms.txt?
Can AI understand component in <5 min reading OVERVIEW?
Can AI find specific function via API_REFERENCE?
Can AI debug issue via TROUBLESHOOTING?

Structure

llms.txt index exists and is accurate (if needed)
No file >600 lines (except BUSINESS_RULES if needed)
Directory depth ≤3 levels
No duplicate files

Maintenance

Single source of truth for business rules
No scattered duplicates
Archive clearly marked as historical (if applicable)
Cross-references use file paths, not relative links

Templates

Template: Simple Tool CLAUDE.md (200-250 lines)

# [Tool Name] Import

**Purpose**: [One sentence describing what this import does]

**Key Patterns**:
- Extends framework base classes (see `framework/CLAUDE.md`)
- [Tool-specific pattern 1]
- [Tool-specific pattern 2]

**Configuration**:
- Config location: `/path/to/config.json` → `[tool_name]` section
- Required keys: [list required config keys]

**Critical Logic**:
[Only if tool has unique business logic - DELETE section if not applicable]

**Common Gotchas**:
[Only if tool has known edge cases - DELETE section if not applicable]

**Testing**:
- Unit tests: `tests/test_[tool_name]/unit_tests/`
- Integration tests: `tests/test_[tool_name]/integration_tests/`
- See project CLAUDE.md for testing standards

**References**:
- Framework: `framework/CLAUDE.md`
- Testing standards: Project CLAUDE.md

Template: Complex Tool CLAUDE.md (300-400 lines)

Structure:

Purpose + Performance Metrics (5-10 lines)
Architecture Overview (40-60 lines) - Core design, storage architecture table, processing pipeline
File Structure (20-40 lines, condensed tree)
Critical Business Logic (60-80 lines, table format)
Key Architecture Patterns (60-80 lines) - Pattern name, summary, benefits, "See QUICKREF.md"
Common Gotchas (40-60 lines, 8-10 condensed subsections)
Key Constants (10-20 lines, bullet list)
When in Doubt (10-15 lines, numbered list with references)

Total: 300-400 lines max

For full example structure: See reference.md

Summary

The Golden Rules

Hierarchical Inheritance - Never duplicate parent content, always reference
100-200 Line Target - Most files should be this range (complex tools 300-400 max)
Tables Over Prose - Business logic, gotchas, decisions all in table format
Extract Deep-Dive - Code examples and detailed implementations → QUICKREF.md
Define Once - Testing, code style, core principles defined once at appropriate level
Location References - Always include file:line for implementation details
Concise Over Comprehensive - Documentation is a MAP, not a TUTORIAL

The Test

Can you scan the documentation in 30 seconds and find critical information? If not, it's too verbose.

The Goal

Minimize agent context, maximize information utility, maintain hierarchical clarity.

Remember: Less is more. Every line consumes agent context. Make every line count.

For comprehensive templates and examples: See reference.md

Install Skill

SKILL.md

AI Documentation Standards

Core Principles for AI-Readable Docs

1. Concise Over Comprehensive

2. Structure Over Prose

3. Location References Over Explanations

4. Hierarchical Context Boundaries

5. Searchable Keywords

CLAUDE.md Hierarchical Inheritance

The 100-200 Line Rule

Line Count Targets by Level

What NOT to Duplicate Across Hierarchy

Hierarchical Inheritance Example

When to Extract to QUICKREF.md

Triggers (any one means extract)

What Goes Where

AI-Specific Documentation Types

Quick Reference

llms.txt Index Pattern

AGENTS.md Pattern

Session Knowledge Capture

Parallel Validation Strategy

Smart Documentation Placement

Content Optimization Strategies

Quick Reference

Anti-Patterns to Avoid

Quick Reference

Validation Checklist

Duplication Check

Content Optimization

Line Count Targets

Deep-Dive Extraction

Hierarchy Integrity

AI-Readability

Structure

Maintenance

Templates

Template: Simple Tool CLAUDE.md (200-250 lines)

Template: Complex Tool CLAUDE.md (300-400 lines)

Summary

The Golden Rules

The Test

The Goal