feat: Add intelligent auto-router and enhanced integrations

- Add intelligent-router.sh hook for automatic agent routing - Add AUTO-TRIGGER-SUMMARY.md documentation - Add FINAL-INTEGRATION-SUMMARY.md documentation - Complete Prometheus integration (6 commands + 4 tools) - Complete Dexto integration (12 commands + 5 tools) - Enhanced Ralph with access to all agents - Fix /clawd command (removed disable-model-invocation) - Update hooks.json to v5 with intelligent routing - 291 total skills now available - All 21 commands with automatic routing 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
2026-01-28 00:27:56 +04:00
parent 3b128ba3bd
commit b52318eeae
1724 changed files with 351216 additions and 0 deletions
--- a/dexto/agents/nano-banana-agent/README.md
+++ b/dexto/agents/nano-banana-agent/README.md
@@ -0,0 +1,200 @@
+# Nano Banana Agent
+
+A Dexto agent that provides access to Google's **Gemini 2.5 Flash Image** model for image generation and editing through a lean, powerful MCP server.
+
+## 🎯 What is Gemini 2.5 Flash Image?
+
+Gemini 2.5 Flash Image is Google's cutting-edge AI model that enables:
+- **Near-instantaneous** image generation and editing
+- **Object removal** with perfect background preservation
+- **Background alteration** while maintaining subject integrity
+- **Image fusion** for creative compositions
+- **Style modification** with character consistency
+- **Visible and invisible watermarks** (SynthID) for digital safety
+
+## 🚀 Key Features
+
+### Core Capabilities
+- **Image Generation**: Create images from text prompts with various styles and aspect ratios
+- **Image Editing**: Modify existing images based on natural language descriptions
+- **Object Removal**: Remove unwanted objects while preserving the background
+- **Background Changes**: Replace backgrounds while keeping subjects intact
+- **Image Fusion**: Combine multiple images into creative compositions
+- **Style Transfer**: Apply artistic styles to images
+
+### Advanced Features
+- **Character Consistency**: Maintain facial features and identities across edits
+- **Scene Preservation**: Seamless blending with original lighting and composition
+- **Multi-Image Processing**: Handle batch operations and complex compositions
+- **Safety Features**: Built-in safety filters and provenance signals
+
+## 🛠️ Setup
+
+### Prerequisites
+- Dexto framework installed
+- Google AI API key (Gemini API access)
+- Node.js 20.0.0 or higher
+
+### Installation
+1. **Set up environment variables**:
+   ```bash
+   export GOOGLE_GENERATIVE_AI_API_KEY="your-google-ai-api-key"
+   # or
+   export GEMINI_API_KEY="your-google-ai-api-key"
+   ```
+
+2. **Run the agent** (the MCP server will be automatically downloaded via npx):
+   ```bash
+   # From the dexto repository root
+   npx dexto -a agents/nano-banana-agent/nano-banana-agent.yml
+   ```
+
+The agent configuration uses `npx @truffle-ai/nano-banana-server` to automatically download and run the latest version of the MCP server.
+
+## 📋 Available Tools
+
+The agent provides access to 3 essential tools:
+
+### 1. `generate_image`
+Generate new images from text prompts.
+
+**Example:**
+```
+Generate a majestic mountain landscape at sunset in realistic style with 16:9 aspect ratio
+```
+
+### 2. `process_image`
+Process existing images based on detailed instructions. This tool can handle any image editing task including object removal, background changes, style transfer, adding elements, and more.
+
+**Example:**
+```
+Remove the red car in the background from /path/to/photo.jpg
+```
+
+**Example:**
+```
+Change the background of /path/to/portrait.jpg to a beach sunset with palm trees
+```
+
+**Example:**
+```
+Apply Van Gogh painting style with thick brushstrokes to /path/to/photo.jpg
+```
+
+### 3. `process_multiple_images`
+Process multiple images together based on detailed instructions. This tool can combine images, create collages, blend compositions, or perform any multi-image operation.
+
+**Example:**
+```
+Place the person from /path/to/person.jpg into the landscape from /path/to/landscape.jpg as if they were standing there
+```
+
+## 📤 Response Format
+
+Successful operations return both image data and metadata:
+```json
+{
+  "content": [
+    {
+      "type": "image",
+      "data": "base64-encoded-image-data",
+      "mimeType": "image/png"
+    },
+    {
+      "type": "text",
+      "text": "{\n  \"output_path\": \"/absolute/path/to/saved/image.png\",\n  \"size_bytes\": 12345,\n  \"format\": \"image/png\"\n}"
+    }
+  ]
+}
+```
+
+## 🎨 Popular Use Cases
+
+### 1. **Selfie Enhancement**
+- Remove blemishes and unwanted objects
+- Change backgrounds for professional photos
+- Apply artistic filters and styles
+- Create figurine effects (Nano Banana's signature feature)
+
+### 2. **Product Photography**
+- Remove backgrounds for clean product shots
+- Add or remove objects from scenes
+- Apply consistent styling across product images
+
+### 3. **Creative Compositions**
+- Fuse multiple images into unique scenes
+- Apply artistic styles to photos
+- Create imaginative scenarios from real photos
+
+### 4. **Content Creation**
+- Generate images for social media
+- Create variations of existing content
+- Apply brand-consistent styling
+
+## 🔧 Configuration
+
+### Environment Variables
+- `GOOGLE_GENERATIVE_AI_API_KEY` or `GEMINI_API_KEY`: Your Google AI API key (required)
+
+### Agent Settings
+- **LLM Provider**: Google Gemini 2.5 Flash
+- **Storage**: In-memory cache with SQLite database
+- **Tool Confirmation**: Auto-approve mode for better development experience
+
+## 📁 Supported Formats
+
+**Input/Output Formats:**
+- JPEG (.jpg, .jpeg)
+- PNG (.png)
+- WebP (.webp)
+- GIF (.gif)
+
+**File Size Limits:**
+- Maximum: 20MB per image
+- Recommended: Under 10MB for optimal performance
+
+## 🎯 Example Interactions
+
+### Generate a Creative Image
+```
+User: "Generate a futuristic cityscape at night with flying cars and neon lights"
+Agent: I'll create a futuristic cityscape image for you using Nano Banana's image generation capabilities.
+```
+
+### Remove Unwanted Objects
+```
+User: "Remove the power lines from this photo: /path/to/landscape.jpg"
+Agent: I'll remove the power lines from your landscape photo while preserving the natural background.
+```
+
+### Create Figurine Effect
+```
+User: "Transform this selfie into a mini figurine on a desk: /path/to/selfie.jpg"
+Agent: I'll create Nano Banana's signature figurine effect, transforming your selfie into a mini figurine displayed on a desk.
+```
+
+### Change Background
+```
+User: "Change the background of this portrait to a professional office setting: /path/to/portrait.jpg"
+Agent: I'll replace the background with a professional office setting while keeping you as the main subject.
+```
+
+## 🔒 Safety & Ethics
+
+Nano Banana includes built-in safety features:
+- **SynthID Watermarks**: Invisible provenance signals
+- **Safety Filters**: Content moderation and filtering
+- **Character Consistency**: Maintains identity integrity
+- **Responsible AI**: Designed to prevent misuse
+
+## 🤝 Contributing
+
+We welcome contributions! Please see our [Contributing Guidelines](../../CONTRIBUTING.md) for details.
+
+## 📄 License
+
+This project is licensed under the MIT License - see the [LICENSE](../../LICENSE) file for details.
+
+---
+
+**Note**: This agent provides access to Google's Gemini 2.5 Flash Image model through the MCP protocol. The implementation returns both image content (base64-encoded) and text metadata according to MCP specifications, allowing for direct image display in compatible clients. A valid Google AI API key is required and usage is subject to Google's terms of service and usage limits.
--- a/dexto/agents/nano-banana-agent/nano-banana-agent.yml
+++ b/dexto/agents/nano-banana-agent/nano-banana-agent.yml
@@ -0,0 +1,110 @@
+# Dexto Agent Configuration for Nano Banana (Gemini 2.5 Flash Image) MCP Server
+# Generated on 2025-01-27T00:00:00.000Z
+
+# Optional greeting shown at chat start (UI can consume this)
+greeting: "🍌 Hi! I'm your Nano Banana Agent. Let's create something amazing together!"
+
+systemPrompt: |
+  You are an AI assistant specialized in advanced image generation and editing using Google's Nano Banana (Gemini 2.5 Flash Image) model. You have access to cutting-edge AI tools for:
+
+  - **Image Generation**: Create stunning images from text prompts with various styles and aspect ratios
+  - **Image Editing**: Modify existing images using natural language descriptions
+  - **Object Removal**: Remove unwanted objects while perfectly preserving the background
+  - **Background Changes**: Replace backgrounds seamlessly while keeping subjects intact
+  - **Image Fusion**: Combine multiple images into creative compositions
+  - **Style Transfer**: Apply artistic styles to images with character consistency
+  - **Advanced Features**: Character consistency, scene preservation, and multi-image processing
+
+  When working with images:
+  1. Always validate that input images exist and are in supported formats (JPG, PNG, WebP, GIF)
+  2. Provide clear feedback about what operations you're performing
+  3. Save processed images with descriptive names
+  4. Include image information (dimensions, file size, format) in your responses
+  5. Suggest additional enhancements and creative possibilities when appropriate
+  6. Leverage Nano Banana's signature features like the figurine effect and character consistency
+
+  Key Nano Banana Capabilities:
+  - **Near-instantaneous** processing with high visual coherence
+  - **Character consistency** across multiple edits
+  - **Scene preservation** with seamless background blending
+  - **Safety features** including SynthID watermarks
+  - **Multi-image processing** for complex compositions
+
+  Popular use cases:
+  - Selfie enhancement and creative variations
+  - Product photography with clean backgrounds
+  - Artistic style applications
+  - Object removal from photos
+  - Background replacement for portraits
+  - Creating figurine effects (Nano Banana's signature feature)
+  - Image fusion for creative compositions
+
+  Supported image formats: JPG, JPEG, PNG, WebP, GIF
+  Maximum file size: 20MB per image
+
+mcpServers:
+  nano_banana:
+    type: stdio
+    command: npx
+    args:
+      - -y
+      - "@truffle-ai/nano-banana-server"
+    connectionMode: strict
+    env:
+      GEMINI_API_KEY: $GOOGLE_GENERATIVE_AI_API_KEY
+    timeout: 60000
+
+toolConfirmation:
+  mode: "auto-approve"
+  allowedToolsStorage: "memory"
+
+llm:
+  provider: google
+  model: gemini-2.5-flash
+  apiKey: $GOOGLE_GENERATIVE_AI_API_KEY
+
+storage:
+  cache:
+    type: in-memory
+  database:
+    type: sqlite
+  blob:
+    type: local  # CLI provides storePath automatically
+    maxBlobSize: 52428800     # 50MB per blob
+    maxTotalSize: 1073741824  # 1GB total storage
+    cleanupAfterDays: 30
+
+# Prompts - image generation and editing examples shown as clickable buttons in WebUI
+prompts:
+  - type: inline
+    id: generate-landscape
+    title: "🎨 Generate Landscape"
+    description: "Create a scenic image from text"
+    prompt: "Generate a stunning image of a majestic mountain landscape at sunset with vibrant colors and dramatic clouds."
+    category: generation
+    priority: 10
+    showInStarters: true
+  - type: inline
+    id: generate-portrait
+    title: "👤 Generate Portrait"
+    description: "Create portrait images"
+    prompt: "Generate a professional portrait of a person in business attire with a clean studio background."
+    category: generation
+    priority: 9
+    showInStarters: true
+  - type: inline
+    id: generate-abstract
+    title: "🌀 Generate Abstract Art"
+    description: "Create abstract artistic images"
+    prompt: "Generate an abstract art piece with swirling colors and geometric patterns inspired by Kandinsky."
+    category: generation
+    priority: 8
+    showInStarters: true
+  - type: inline
+    id: generate-product
+    title: "📦 Generate Product Image"
+    description: "Create product photography"
+    prompt: "Generate a professional product photo of a sleek modern smartphone on a minimalist white background."
+    category: generation
+    priority: 7
+    showInStarters: true