name	moai-connector-nano-banana
description	Professional image generation with Google Nano Banana Pro (Gemini 3 Pro)
version	1.0.1
modularized	true
tags	enterprise, patterns, nano, banana, architecture
updated	Mon Nov 24 2025 00:00:00 GMT+0000 (Coordinated Universal Time)
status	active

🎯 Quick Reference (30 seconds)

Purpose: Professional image generation using Google's Nano Banana Pro (Gemini 3 Pro Image Preview).

Key Features:

Text-to-Image: Detailed prompts → 1K/2K/4K resolution images
Image-to-Image: Style transfer, object manipulation, editing
Real-time Grounding: Google Search integration for factual content
Multi-Reference: Up to 14 reference images (6 objects + 5 humans)
Advanced Text: Sophisticated text rendering directly in images

Two Models:

Nano Banana Pro (gemini-3-pro-image-preview) - Professional quality, 10-60s
Gemini 2.5 Flash (gemini-2.5-flash-image) - Fast, ~5-15s

Implementation Guide (5 minutes)

Features

Text-to-Image generation with 1K/2K/4K resolutions
Image-to-Image editing and style transfer
Multi-turn refinement for iterative improvements
Reference image guidance (up to 14 references)
Real-time Google Search grounding for factual content
Advanced text rendering directly in images

When to Use

Generating professional visual assets for documentation or marketing
Creating UI mockups and design concepts quickly
Producing social media graphics and promotional images
Illustrating technical documentation with custom diagrams
Rapid prototyping of visual ideas before final design work

Core Patterns

Pattern 1: Structured Prompt for Quality

prompt = """
A serene Japanese garden at golden hour.
Lighting: warm sunset light filtering through maple trees.
Camera: wide-angle 35mm lens, low angle shot.
Composition: Rule of thirds, stone path leading to pagoda.
Color palette: warm gold, jade green, soft cream.
Style: photorealistic with slight cinematic color grading.
Quality: 4K resolution. Final output: PNG.
"""

Pattern 2: Multi-Turn Refinement

Generate initial image with base prompt
Review output and identify areas for improvement
Provide targeted refinement: "Make sky more dramatic"
Iterate up to 5 turns for perfect result

Pattern 3: Reference-Guided Generation

# Use reference images to guide style
generate_image(
    prompt="Mountain landscape in the style of reference",
    reference_images=["style_ref.png", "composition_ref.png"],
    resolution="2K",
    aspect_ratio="16:9"
)

📚 Core Patterns (5-10 minutes)

Pattern 1: Prompt Structure for Quality Images

Key Concept: Well-structured prompts generate better images

Template:

[Scene Description]
A [adjective] [subject] doing [action].
Setting: [location] with [environmental details].

[Photographic Elements]
Lighting: [type], creating [mood].
Camera: [angle] shot with [lens] lens.
Composition: [framing_details].

[Color & Style]
Color palette: [colors]. Style: [art_style].
Quality: [resolution]. Final output: [format].

Example:

A serene Japanese garden at golden hour.
Lighting: warm sunset light, creating peaceful mood.
Camera: wide-angle 35mm lens shot.
Color palette: gold, jade green, cream.
Quality: 4K photorealistic. Final: PNG.

Pattern 2: Text-to-Image Generation

Key Concept: Generate professional images from text prompts

Basic Flow:

Write detailed, structured prompt
Choose resolution (1K, 2K, 4K)
Select aspect ratio (1:1, 16:9, 3:2, etc.)
Enable Google Search for current information (optional)
Generate and retrieve Base64 PNG

Execution:

image_data = generate_image(
    prompt="Your detailed prompt here",
    resolution="2K",
    aspect_ratio="16:9",
    enable_google_search=True,  # For current info
    thinking_process=True        # Auto-optimize
)

Pattern 3: Image-to-Image Editing

Key Concept: Transform existing images with detailed instructions

Common Tasks:

Style Transfer: Convert to art style (Van Gogh, anime, etc.)
Object Manipulation: Add, remove, or modify elements
Composition Change: Reframe, zoom, or reposition subjects
Quality Enhancement: Upscale, improve detail, adjust colors

Flow:

Load original image
Write transformation instruction
Reference images (optional)
Apply edit maintaining coherence
Retrieve edited image

Pattern 4: Multi-Turn Refinement

Key Concept: Iteratively improve images through conversation

Workflow:

Generate initial image
Review output
Provide refinement instruction
Regenerate with improvements
Repeat (max 5 turns)

Example:

Turn 1: "A mountain landscape at sunset"
Turn 2: "Make the sky more dramatic with purple clouds"
Turn 3: "Add a lone tree in foreground"

Pattern 5: Reference Image Guidance

Key Concept: Use reference images to guide generation style

Supported References:

Up to 6 object references
Up to 5 human references
Style influences
Composition guides

Usage:

generate_image(
    prompt="Similar style to reference",
    reference_images=[
        "path/to/style_reference.png",
        "path/to/composition_ref.png"
    ]
)

📖 Advanced Documentation

This Skill uses Progressive Disclosure. For detailed implementation:

modules/prompt-engineering.md - Professional prompt templates
modules/api-reference.md - Complete API documentation
modules/examples.md - Real-world usage examples
modules/troubleshooting.md - Common issues and solutions

🎨 Model Selection Guide

Choose Nano Banana Pro when:

Professional quality required
2K/4K resolution needed
Complex compositions
Sophisticated text in images
Real-time information important
Budget allows (higher cost)

Choose Gemini 2.5 Flash when:

Quick iterations needed
Prototyping and testing
High volume generation
1K resolution sufficient
Speed critical
Cost sensitive

Quick Reference (30 seconds)

Core Purpose: Professional AI image generation using Nano Banana Pro (Gemini 3 Pro) and Gemini 2.5 Flash.

Key Features: Text-to-image, image-to-image editing, multi-turn refinement, reference guidance, 4K resolution.

When to Use: Visual asset creation, prototyping, documentation, UI mockups, marketing materials.

Works Well With

Agents:

design-uiux - UI/UX design integration
code-frontend - Frontend asset implementation
workflow-docs - Visual documentation generation

Skills:

moai-lang-unified - UI/UX implementation with generated assets
moai-docs-generation - Create visual documentation
moai-cc-claude-md - Embed generated images in markdown
moai-domain-frontend - Frontend integration

Commands:

/moai:3-sync - Documentation with visual assets
/moai:9-feedback - Image generation improvements

🔗 Integration with Other Skills

Typical Workflow:

Use this Skill to generate visual assets
Use moai-domain-frontend to implement in UI
Use moai-docs-generation to document with images

📈 Version History

1.0.1 (2025-11-23)

🔄 Refactored with Progressive Disclosure pattern
📚 Detailed prompts moved to modules/
✨ Core patterns highlighted in SKILL.md
✨ Added model selection guide

1.0.0 (2025-11-12)

✨ Nano Banana Pro (Gemini 3 Pro) support
✨ Text-to-Image and Image-to-Image
✨ Multi-turn refinement capability
✨ Reference image guidance

Maintained by: alfred Domain: Image Generation & Visual Creation Generated with: MoAI-ADK Skill Factory

Install Skill

SKILL.md