name	moai-essentials-perf
version	4.0.0
updated	Thu Nov 20 2025 00:00:00 GMT+0000 (Coordinated Universal Time)
status	stable
description	Performance optimization with profiling, memory analysis, and benchmarking
allowed-tools	Read, Bash, WebSearch, WebFetch

Performance Optimization Expert

Application Profiling & Performance Tuning

Focus: CPU/Memory Profiling, Benchmarking, Optimization Patterns
Tools: Python (Scalene, cProfile), Node.js (Clinic.js), Go (pprof)

Overview

Systematic approach to identifying and fixing performance bottlenecks.

Core Techniques

Profiling: Measure CPU, memory, I/O usage.
Benchmarking: Compare performance before/after changes.
Optimization: Apply targeted improvements (caching, lazy loading, parallelism).

Implementation Patterns

1. Python Profiling (Scalene)

Installation:

pip install scalene

Usage:

# Profile CPU, memory, and GPU
scalene my_script.py

# HTML report
scalene --html --outfile profile.html my_script.py

Code-level profiling:

from scalene import scalene_profiler

scalene_profiler.start()
# Code to profile
result = expensive_computation()
scalene_profiler.stop()

2. Memory Optimization

Common Issues:

Large data structures held in memory
Circular references preventing GC
Inefficient data structures (lists vs generators)

Example (Python):

# Bad: Load entire file into memory
data = [line for line in open('large_file.txt')]

# Good: Use generator
def read_lines(filename):
    with open(filename) as f:
        for line in f:
            yield line.strip()

3. Database Query Optimization

N+1 Query Problem:

# Bad: N+1 queries
users = User.query.all()
for user in users:
    posts = Post.query.filter_by(user_id=user.id).all()  # N queries

# Good: Eager loading
users = User.query.options(joinedload(User.posts)).all()

Indexing:

-- Create index on frequently queried columns
CREATE INDEX idx_user_email ON users(email);

4. Caching Strategies

In-Memory Cache (Python):

from functools import lru_cache

@lru_cache(maxsize=128)
def expensive_function(x):
    # Expensive computation
    return result

Redis Cache:

import redis

cache = redis.Redis(host='localhost', port=6379)

def get_data(key):
    cached = cache.get(key)
    if cached:
        return cached

    data = fetch_from_database(key)
    cache.setex(key, 3600, data)  # Cache for 1 hour
    return data

5. Async I/O (Node.js)

Avoid blocking I/O:

// Bad: Synchronous
const data = fs.readFileSync("large_file.txt", "utf8");

// Good: Asynchronous
const data = await fs.promises.readFile("large_file.txt", "utf8");

Best Practices

Measure First: Don't optimize without profiling.
Target Hotspots: Focus on the 20% causing 80% slowdown.
Benchmark: Use realistic workloads, not microbenchmarks.
Monitor Production: Use APM tools (New Relic, Datadog).

Validation Checklist

Profiled: Code profiled with appropriate tool?
Baseline: Performance baseline established?
Optimized: Hotspots identified and fixed?
Benchmarked: Improvements measured with benchmarks?
Monitored: Production metrics tracked?

Related Skills

moai-domain-backend: API optimization
moai-domain-monitoring: APM setup
moai-devops-docker: Container resource limits

Last Updated: 2025-11-20

moai-essentials-perf

Install Skill

SKILL.md