name	llm-council
description	Facilitate council-style deliberation among specialized models with structured prompts, evidence gates, and explicit confidence ceilings.

LIBRARY-FIRST PROTOCOL (MANDATORY)

Before writing ANY code, you MUST check:

Step 1: Library Catalog

Location: .claude/library/catalog.json
If match >70%: REUSE or ADAPT

Step 2: Patterns Guide

Location: .claude/docs/inventories/LIBRARY-PATTERNS-GUIDE.md
If pattern exists: FOLLOW documented approach

Step 3: Existing Projects

Location: D:\Projects\*
If found: EXTRACT and adapt

Decision Matrix

Match	Action
Library >90%	REUSE directly
Library 70-90%	ADAPT minimally
Pattern exists	FOLLOW pattern
In project	EXTRACT
No match	BUILD (add to library after)

STANDARD OPERATING PROCEDURE

Purpose

Run council deliberations that collect diverse model opinions, enforce evidence-backed synthesis, and prevent overconfident consensus.

Trigger Conditions

Positive: multi-model debate, comparative reasoning, adjudication of conflicting outputs, need for weighted synthesis with evidence.
Negative: single-model answers, pure prompt polishing (route to prompt-architect), or skill creation (route to skill-forge).

Guardrails

Skill-Forge structure-first: maintain SKILL.md, examples/, tests/; add resources/ and references/ or log remediation tasks.
Prompt-Architect hygiene: define intent and constraints per council question; capture HARD/SOFT/INFERRED assumptions; output pure English with explicit ceilings.
Council safety: assign roles (proposer, challenger, judge), enforce registry agents, timebox rounds, and ensure hook latency budgets.
Adversarial validation: require dissenting review, cross-check citations, and COV on synthesis steps; capture evidence.
MCP tagging: store council transcripts with WHO=llm-council-{session} and WHY=skill-execution.

Execution Playbook

Intent & scope: define the question, success metric, and constraints; confirm inferred items.
Panel setup: select specialists, assign roles, and set scoring criteria; configure timeboxes.
Deliberation rounds: gather proposals, run adversarial critiques, and request evidence per claim.
Synthesis: weigh arguments, resolve conflicts, and produce a grounded recommendation with alternatives.
Validation loop: check evidence integrity, run COV on synthesis, and record telemetry.
Delivery: provide recommendation, rationale, dissent, risks, and confidence ceiling.

Output Format

Question, constraints, and panel composition.
Round summaries (proposals, critiques, evidence).
Synthesis with chosen path, alternatives, and risk notes.
Confidence: X.XX (ceiling: TYPE Y.YY) - rationale.

Validation Checklist

Structure-first assets present or planned; examples/tests updated or ticketed.
Roles/timeboxes enforced; evidence cited; registry-only agents used; hooks within budget.
Adversarial and COV results captured with MCP tags; confidence ceiling declared; English-only output.

Completion Definition

Council run is complete when a recommendation (or documented stalemate) is delivered with evidence, dissent captured, risks owned, and MCP log persisted.

Confidence: 0.70 (ceiling: inference 0.70) - Council orchestration rewritten with skill-forge scaffolding and prompt-architect constraint and confidence discipline.

Install Skill

SKILL.md