Claude Code Plugins

Community-maintained marketplace

Feedback

Analyzes PDFs, images, screenshots, diagrams, and documents using Gemini multimodal. Extracts text, tables, forms; interprets visuals, architecture diagrams, flowcharts, ERDs. Use when user mentions PDFs, images, screenshots, document extraction, OCR, visual analysis, diagram interpretation, or form processing. Do not use for web searching or shell commands.

Install Skill

1Download skill
2Enable skills in Claude

Open claude.ai/settings/capabilities and find the "Skills" section

3Upload to Claude

Click "Upload skill" and select the downloaded ZIP file

Note: Please verify skill by going through its instructions before using it.

SKILL.md

name analyzing-media
description Analyzes PDFs, images, screenshots, diagrams, and documents using Gemini multimodal. Extracts text, tables, forms; interprets visuals, architecture diagrams, flowcharts, ERDs. Use when user mentions PDFs, images, screenshots, document extraction, OCR, visual analysis, diagram interpretation, or form processing. Do not use for web searching or shell commands.
allowed-tools Task

Gemini Media Analysis

Spawn the gemini-media-analyst agent for image and PDF analysis.

Foreground (blocking)

Task(subagent_type="gemini-media-analyst", prompt="Analyze @<file-path>: <what to look for>")

Background (for context efficiency)

Task(subagent_type="gemini-media-analyst", prompt="Analyze @<file-path>: <task>", run_in_background=true)

Use TaskOutput(task_id="<id>") to retrieve results.

Use for screenshots, architecture diagrams, PDFs, flowcharts, or any visual content analysis.