name	agent-certifier
description	Given a human certification or license (e.g. PL-300, SAP B1, Azure AI Engineer), create a production-ready agent skill profile and certification ladder, including skills.yaml entries, agent YAML, and skills documentation, using the anthropics/skills SKILL.md conventions.

Agent Certifier

This skill turns human certifications (e.g. PL-300, CPA, SAP B1, Azure AI) into machine-certifiable agents with clear skills, benchmarks, and a signed competency contract.

Use this skill when the user gives:

A human cert or license name (e.g. "PL-300: Power BI Data Analyst")
Optionally one or more reference repos or products (e.g. microsoft/powerbi-desktop-samples)
A target agent name/slug (e.g. powerbi-bi-architect)

Your job is to emit a complete, market-ready bundle:

A skills spec (YAML) with levels, benchmarks, and tools
An agent spec (YAML) wired to those skills
A human-readable skills.md for documentation
Optional certification JWT schema, compatible with an external CertificationAuthority

When to Use This Skill

Invoke this skill whenever the user wants to:

Translate a human certification (Azure, SAP, CPA, Azure AI, etc.) into an agent certification ladder
Define skills + tools + benchmarks for an agent in a reusable, model-agnostic way
Produce ready-to-commit files for a repo:
- skills/<domain>.yaml
- agents/<agent_slug>.yaml
- docs/<agent_slug>-skills.md

The output should be designed so it can be:

Used by Claude Skills (this SKILL.md format)
Loaded by OpenSkills (anthropics/skills compatible)
Reused by other agents (Gemini, OpenAI, etc.) via the same YAML contracts

Input Format

Assume the user will give you (in natural language, not strict JSON):

Human cert(s) and level(s)
- e.g. "PL-300 + DP-500 + Power Platform Solution Architect Expert"
Domain / role
- e.g. "Power BI / Fabric BI Architect for retail dashboards"
Reference repos or artifacts (optional but common)
- e.g. GitHub URLs, product pages, sample .pbix, .twb, etc.
Target agent id/slug
- e.g. powerbi_bi_architect

You must infer missing pieces safely and document assumptions.

What To Produce

Always produce three main artifacts (as copy-paste-ready blocks):

1. `skills/<domain>.yaml`

A YAML file that defines:

version, domain
human_analogs: list of human certs you are mirroring
sources: reference repos or artifacts (GitHub, sample files)
tools: logical tool contract names (e.g. pbix_reader, dax_analyzer)
skills:
- Each with id, level (fundamentals/role_based/specialty/business/expert etc.)
- human_analog, description
- required_tools, optional prerequisites
- benchmarks: each with id, description, repo_source, and success_criteria list

Keep tool names abstract so they can be mapped to OpenAI/Gemini/Claude tool schemas later.

2. `agents/<agent_slug>.yaml`

An agent spec that:

References the domain and skill IDs from the skills YAML
Lists required tools by id
Defines human_cert_analogs (strings)
Defines certification_policy:
- Levels (fundamentals, associate, expert, etc.)
- required_skills for each level
- min_benchmarks_passed per level
- issuance block:
  - title_template
  - validity_days
  - conditions (bullets)

Include a benchmarks_runtime block describing:

repo_sources (e.g. microsoft/powerbi-desktop-samples)
execution.runner (e.g. ci.pipeline.powerbi)
schedule (e.g. nightly)

3. `docs/<agent_slug>-skills.md`

A markdown doc for humans that:

Explains which human certs this agent emulates
Lists each skill level with:
- Human analog
- Capabilities (bullets)
- Benchmark(s) and pass criteria
Explains the certification policy:
- What "Fundamentals / Associate / Expert Certified" means
- How the external CertificationAuthority JWT is issued & used

Structure this as:

Overview
Human Certification Analogs
Tools Required
Skill Levels & Benchmarks
Certification Policy

Instructions

When this skill is active:

Parse the user brief.
- Identify the domain (e.g. Power BI, SAP B1, Azure AI, Odoo).
- Extract all human certification names and their levels.
- Note any reference repos / products / sample files.
Define the skills ladder.
- Map human certs into 3–5 levels:
  - fundamentals → role_based → specialty → business → expert
- For each level:
  - Write a concise description of capabilities.
  - Choose the tools needed (abstract names).
  - Design 1–3 concrete benchmarks that can be evaluated automatically.
- Benchmarks must have clear, measurable pass criteria (e.g. KPI parity within 1%, build succeeds, tests green, etc.).
Design tool contracts.
- Keep tools model-agnostic:
  - Do NOT hard-code OpenAI / Gemini / Claude APIs.
  - Use logical names (e.g. ocr_gateway, sap_b1_api_client, pbix_reader).
- Focus on what the tool does, not how it's implemented.
Generate the three artifacts.
- Emit them in this order, each in its own fenced code block:
  1. skills/<domain>.yaml
  2. agents/<agent_slug>.yaml
  3. docs/<agent_slug>-skills.md
- Make them ready to commit (no placeholders like TODO or ...).
State assumptions & risks briefly.
- At the end, add a short "Assumptions & Risks" section (3–6 bullets) outside the code blocks:
  - List any big assumptions (e.g. data source, repos, tools).
  - Flag anything that absolutely needs human review (compliance, PII, prod access).

Examples

Example 1 – Power BI Architect

"Create an agent that is equivalent to PL-300 + DP-500 + Power Platform Solution Architect, using microsoft/powerbi-desktop-samples as the benchmark repo. Agent slug: powerbi_bi_architect."

You should:

Define domain: "cloud_ai_platforms.power_bi"
Map the certs into a skills ladder (fundamentals → expert)
Use Store Sales / Competitive Marketing Analysis .pbix as benchmarks
Emit YAML + docs as specified above

Example 2 – Azure AI Engineer

"Build an agent certified at the same level as Azure AI Engineer Associate, focused on RAG systems for finance dashboards."

You should:

Create a cloud_ai_platforms.azure_ai_rag skills domain
Define tools like embedding_indexer, rag_query_runner, azure_openai_client
Add benchmarks: end-to-end RAG flow, latency, accuracy, hallucination checks
Emit all three artifacts.

Example 3 – Retail Analytics (Scout)

"Certify an agent at the same level as a Retail Analytics Data Engineer for sari-sari / FMCG dashboards, using the Scout dashboard as the benchmark."

You should:

Create retail_analytics.scout domain
Define tools: supabase_query_runner, kpi_validator, chart_renderer
Add benchmarks: schema validation, KPI accuracy, dashboard render time
Emit YAML + docs

Guidelines

Prefer clear, testable benchmarks over vague descriptions.
Keep everything implementation-agnostic:
- No hard-wiring to a single model provider.
- Assume tools can be backed by Claude, OpenAI, Gemini, or local models.
Favor production-readiness:
- Think like a vendor shipping a marketplace agent, not a demo.
- Include governance/security considerations where relevant (RLS, PII, secrets).
Never include real secrets or API keys in outputs.

Related Skills

engine-spec-writer – Creates engine.yaml specs for IPAI Platform
supabase-schema-designer – Designs Supabase schemas with RLS
dashboard-builder – Builds Next.js dashboards from specs

Install Skill

SKILL.md

Agent Certifier

When to Use This Skill

Input Format

What To Produce

1. skills/<domain>.yaml

2. agents/<agent_slug>.yaml

3. docs/<agent_slug>-skills.md

Instructions

Examples

Example 1 – Power BI Architect

Example 2 – Azure AI Engineer

Example 3 – Retail Analytics (Scout)

Guidelines

Related Skills

1. `skills/<domain>.yaml`

2. `agents/<agent_slug>.yaml`

3. `docs/<agent_slug>-skills.md`