name	julien-media-subtitle-translation
description	Translates SRT subtitle files using LLM APIs (OpenRouter/Llama). Use when user wants to translate subtitles, SRT files, or needs batch subtitle translation for movies/series. Handles HTML tags, batch processing, retries, and cost estimation.
triggers	translate subtitles, translate srt, subtitle translation, traduire sous-titres, batch translate subtitles, srt file translation, movie subtitles, series subtitles, translate episodes

Subtitle Translation Skill

Translate SRT subtitle files efficiently using LLM APIs with proper cost management.

When to Use

User wants to translate .srt subtitle files
Batch translation of multiple episodes/movies
Translation between any language pair (default: EN→FR)

Step 0: Ask User for API Provider

Always ask the user which provider to use:

Which API provider do you want to use?

1. **OpenRouter** (Recommended) - Best price/quality, many models
2. **OpenAI** - GPT-4o-mini, reliable but needs credit card
3. **Ollama Local** - Free, runs on your machine (needs GPU)
4. **Ollama Hostinger** - Free, runs on your VPS

For large batches (100+ episodes): OpenRouter or Ollama Hostinger
For small batches (<20 episodes): Any option works

Provider Configuration

Provider	API URL	Auth
OpenRouter	`https://openrouter.ai/api/v1/chat/completions`	Bearer token
OpenAI	`https://api.openai.com/v1/chat/completions`	Bearer token
Ollama Local	`http://localhost:11434/api/chat`	None
Ollama Hostinger	`http://YOUR_VPS_IP:11434/api/chat`	None

Step 1: Choose Model Based on Complexity

Ask user about subtitle complexity:

What type of content are you translating?

A) Simple dialogue (sitcoms, slice-of-life) → Fast/cheap model
B) Standard content (action, drama) → Balanced model
C) Complex content (technical, poetry, wordplay) → Quality model

Model Recommendations (December 2024)

Complexity	OpenRouter	OpenAI	Ollama
Simple	`mistral-7b-instruct` ($0.03/M)	-	`mistral:7b`
Standard	`llama-3.3-70b-instruct` ($0.11/M in, $0.34/M out)	`gpt-4o-mini` ($0.15/M in, $0.60/M out)	`llama3.1:70b`
Complex	`claude-3.5-sonnet` ($3/M in, $15/M out)	`gpt-4o` ($2.50/M in, $10/M out)	`llama3.1:70b`

Low-Resource Servers (VPS < 16GB RAM)

For servers with limited RAM, use lightweight multilingual models:

Model	RAM Required	Quality	Speed	Best For
`aya:8b`	~6GB	Good	Fast	Multilingual - 100+ languages native
`mistral:7b`	~5GB	OK	Fast	European languages
`gemma2:9b`	~7GB	Good	Medium	General purpose

Aya 8B is recommended for subtitle translation on VPS because:

Trained specifically for multilingual tasks (100+ languages)
Low memory footprint (~6GB VRAM/RAM)
Good quality for dialogue translation
Free with Ollama

# Install on VPS
ollama pull aya:8b

# Test
ollama run aya:8b "Translate to French: Hello, how are you?"

Live Model Search (OpenRouter)

def search_models(min_context=8000):
    """Search OpenRouter for available models with pricing"""
    r = requests.get("https://openrouter.ai/api/v1/models")
    models = r.json().get("data", [])

    # Filter and sort by price
    suitable = []
    for m in models:
        ctx = m.get("context_length", 0)
        if ctx >= min_context:
            price_in = m.get("pricing", {}).get("prompt", 0)
            price_out = m.get("pricing", {}).get("completion", 0)
            suitable.append({
                "id": m["id"],
                "name": m.get("name", m["id"]),
                "context": ctx,
                "price_in": float(price_in) * 1_000_000,  # Per M tokens
                "price_out": float(price_out) * 1_000_000
            })

    return sorted(suitable, key=lambda x: x["price_in"])[:10]

Key Learnings (Production-Tested)

Cost Reality Check

Always multiply naive estimates by 1.5x due to:

System prompts repeated per batch
JSON formatting overhead
Instruction tokens

Model	Input	Output	Real Cost/Episode
Llama 3.3 70B	$0.11/M	$0.34/M	~$0.007
GPT-4o-mini	$0.15/M	$0.60/M	~$0.012

Optimal Configuration

BATCH_SIZE = 25          # Subtitles per API request (sweet spot)
DELAY_BETWEEN_REQUESTS = 0.5  # Avoid rate limits
MAX_RETRIES = 3          # Per batch
REQUEST_TIMEOUT = 90     # Seconds

Implementation Steps

1. Parse SRT Format

def parse_srt(content):
    """Parse SRT into list of (index, timing, text) tuples"""
    blocks = []
    current = []
    for line in content.strip().split('\n'):
        if line.strip() == '':
            if current:
                idx = current[0]
                timing = current[1]
                text = '\n'.join(current[2:])
                blocks.append((idx, timing, text))
                current = []
        else:
            current.append(line)
    if current:
        idx = current[0]
        timing = current[1]
        text = '\n'.join(current[2:])
        blocks.append((idx, timing, text))
    return blocks

2. HTML Tag Preservation (Critical)

Never send HTML tags to the LLM - they get corrupted. Strip before, reapply after.

import re

def extract_formatting(text):
    """Extract HTML tags with positions for later restoration"""
    tags = []
    for match in re.finditer(r'<[^>]+>', text):
        tags.append((match.start(), match.end(), match.group()))
    return tags

def strip_html_tags(text):
    """Remove HTML tags for clean translation"""
    return re.sub(r'<[^>]+>', '', text)

def apply_formatting(translated, original_tags):
    """Reapply original HTML structure to translation"""
    if not original_tags:
        return translated
    # Preserve italic tags at start/end
    result = translated
    for start, end, tag in original_tags:
        if tag == '<i>' and not result.startswith('<i>'):
            result = '<i>' + result
        elif tag == '</i>' and not result.endswith('</i>'):
            result = result + '</i>'
    return result

3. Batch Translation with Retry Queue

def translate_batch(texts, source='English', target='French'):
    """Translate batch of texts, return list or None on failure"""
    prompt = f"""Translate these {source} subtitles to {target}.
Return ONLY a JSON array of translated strings, same order.
Keep it natural and conversational.

{json.dumps(texts, ensure_ascii=False)}"""

    # API call with retries
    for attempt in range(MAX_RETRIES):
        try:
            response = call_api(prompt)
            return json.loads(response)
        except:
            time.sleep(2 ** attempt)  # Exponential backoff
    return None

4. Failed Batch Queue

Store failed batches for later retry:

def add_failed_batch(file_path, batch_index, blocks_data):
    """Queue failed batch for retry"""
    failed_data = load_failed_batches()
    failed_data["batches"].append({
        "fr_srt": str(file_path),
        "batch_index": batch_index,
        "blocks": blocks_data,
        "retry_count": 0,
        "timestamp": datetime.now().isoformat()
    })
    save_failed_batches(failed_data)

5. Resilient Execution

Always wrap main() in auto-restart:

def run_resilient():
    """Auto-restart on errors (up to 50 times)"""
    max_restarts = 50
    restart_count = 0

    while restart_count < max_restarts:
        try:
            main()
            break
        except KeyboardInterrupt:
            log("Manual stop (Ctrl+C)")
            break
        except Exception as e:
            restart_count += 1
            log(f"ERROR: {e}")
            log(f"Auto-restart {restart_count}/{max_restarts} in 30s...")
            time.sleep(30)

6. Progress Tracking

PROGRESS_FILE = "translation_progress.json"

def load_progress():
    try:
        with open(PROGRESS_FILE) as f:
            return json.load(f)
    except:
        return {"completed": [], "failed": [], "total_cost": 0.0}

def save_progress(state):
    state["last_update"] = datetime.now().isoformat()
    with open(PROGRESS_FILE, 'w') as f:
        json.dump(state, f, indent=2)

API Configuration

OpenRouter (Recommended)

OPENROUTER_API_KEY = "sk-or-v1-..."
MODEL = "meta-llama/llama-3.3-70b-instruct"
API_URL = "https://openrouter.ai/api/v1/chat/completions"

headers = {
    "Authorization": f"Bearer {OPENROUTER_API_KEY}",
    "Content-Type": "application/json"
}

Check Credits

def get_credits():
    r = requests.get("https://openrouter.ai/api/v1/credits",
                     headers={"Authorization": f"Bearer {API_KEY}"})
    data = r.json().get("data", {})
    return data.get("total_credits", 0) - data.get("total_usage", 0)

File Naming Convention

Input	Output
`movie.eng.srt`	`movie.fr.srt`
`episode.en.srt`	`episode.fr.srt`

Cost Estimation Formula

episodes × 250 subtitles × 50 tokens × 2 (in+out) × price/token × 1.5 (overhead)

Example: 600 episodes EN→FR with Llama 3.3 70B:

Naive: 600 × 250 × 50 × 2 × $0.20/M = $3.00
Real: $3.00 × 1.5 = ~$4.50

Common Issues & Solutions

Issue	Solution
HTML tags corrupted	Strip before translation, reapply after
Rate limit errors	Add 0.5s delay between requests
Parsing failures	Retry with exponential backoff
Script stops randomly	Use `run_resilient()` wrapper
Cost higher than expected	Multiply estimates by 1.5x

Skill Chaining

Skills Required Before

None (can work standalone)

Input Expected

Directory path containing .srt files
Source/target languages (default: EN→FR)
API provider choice (OpenRouter/OpenAI/Ollama)
API key (if not Ollama)

Output Produced

Translated .srt files with language suffix
translation_progress.json - tracks completed files
failed_batches.json - retry queue
translation.log - detailed logs

Compatible Skills After

video-transcoding: Burn subtitles into video
media-organization: Organize translated files

Tools Used

Write (create translation script)
Bash (run translation)
Read (check progress/logs)
AskUserQuestion (provider/model selection)
WebFetch (live model pricing)

Visual Workflow

User: "Translate my subtitles to French"
    ↓
[Ask] Which API provider?
    ├─► OpenRouter (recommended)
    ├─► OpenAI
    ├─► Ollama Local
    └─► Ollama Hostinger
    ↓
[Ask] Content complexity?
    ├─► Simple → mistral-7b ($0.03/M)
    ├─► Standard → llama-3.3-70b ($0.20/M)
    └─► Complex → claude-3.5-sonnet ($9/M)
    ↓
[Calculate] Cost estimate × 1.5
    ↓
[Confirm] "~$X for Y episodes. Proceed?"
    ↓
[Execute] translate_srt.py
    ├─► Parse SRT → Strip HTML → Batch (25/req)
    ├─► Translate → Reapply formatting
    └─► Track progress → Queue failures
    ↓
[Done] X.fr.srt files created

Usage Example

Scenario: Translate 600 One Piece episodes EN→FR

Interaction:

Claude asks provider → User: "OpenRouter"
Claude asks complexity → User: "Standard (anime)"
Claude calculates: 600 × $0.007 × 1.5 = ~$6.30
User confirms → Script runs
Result: 600 .fr.srt files, actual cost ~$4-7

Output structure:

/anime/onepiece/
├── Episode.001.eng.srt  (original)
├── Episode.001.fr.srt   (NEW)
├── translation_progress.json
├── failed_batches.json
└── translation.log

julien-media-subtitle-translation

Install Skill

SKILL.md