| name | discover-data |
| description | Automatically discover data pipeline and ETL skills when working with ETL. Activates for data development tasks. |
Data Skills Discovery
Provides automatic access to comprehensive data skills.
When This Skill Activates
This skill auto-activates when you're working with:
- ETL
- data pipelines
- batch processing
- stream processing
- data validation
- orchestration
- Airflow
- timely dataflow
- differential dataflow
- streaming aggregations
- windowing
- real-time analytics
Available Skills
Quick Reference
The Data category contains 9 skills:
- batch-processing - Orchestrating complex data pipelines with dependencies
- data-validation - Validating data schema before processing
- dataflow-coordination - Coordination patterns for distributed dataflow systems
- differential-dataflow - Differential computation for incremental updates and efficient joins
- etl-patterns - Designing data extraction from multiple sources
- pipeline-orchestration - Coordinating complex multi-step data workflows
- stream-processing - Processing real-time event streams (Kafka, Flink)
- streaming-aggregations - Windowing, sessionization, time-series aggregation
- timely-dataflow - Low-latency streaming computation with progress tracking
Load Full Category Details
For complete descriptions and workflows:
cat skills/data/INDEX.md
This loads the full Data category index with:
- Detailed skill descriptions
- Usage triggers for each skill
- Common workflow combinations
- Cross-references to related skills
Load Specific Skills
Load individual skills as needed:
# Traditional ETL/Batch
cat skills/data/batch-processing.md
cat skills/data/data-validation.md
cat skills/data/etl-patterns.md
cat skills/data/pipeline-orchestration.md
# Stream Processing
cat skills/data/stream-processing.md
cat skills/data/streaming-aggregations.md
# Advanced Dataflow Systems
cat skills/data/timely-dataflow.md
cat skills/data/differential-dataflow.md
cat skills/data/dataflow-coordination.md
Common Workflow Combinations
Real-Time Analytics Pipeline
# Load these skills together:
cat skills/data/stream-processing.md # Kafka setup
cat skills/data/streaming-aggregations.md # Windowing patterns
cat skills/data/dataflow-coordination.md # Coordination
Incremental Computation System
# Load these skills together:
cat skills/data/timely-dataflow.md # Foundation
cat skills/data/differential-dataflow.md # Incremental updates
cat skills/data/dataflow-coordination.md # Distributed coordination
Hybrid Batch + Stream
# Load these skills together:
cat skills/data/batch-processing.md # Batch jobs
cat skills/data/stream-processing.md # Stream processing
cat skills/data/pipeline-orchestration.md # Overall coordination
Progressive Loading
This gateway skill enables progressive loading:
- Level 1: Gateway loads automatically (you're here now)
- Level 2: Load category INDEX.md for full overview
- Level 3: Load specific skills as needed
Usage Instructions
- Auto-activation: This skill loads automatically when Claude Code detects data work
- Browse skills: Run
cat skills/data/INDEX.mdfor full category overview - Load specific skills: Use bash commands above to load individual skills
Next Steps: Run cat skills/data/INDEX.md to see full category details.