feat: Add intelligent auto-router and enhanced integrations

- Add intelligent-router.sh hook for automatic agent routing
- Add AUTO-TRIGGER-SUMMARY.md documentation
- Add FINAL-INTEGRATION-SUMMARY.md documentation
- Complete Prometheus integration (6 commands + 4 tools)
- Complete Dexto integration (12 commands + 5 tools)
- Enhanced Ralph with access to all agents
- Fix /clawd command (removed disable-model-invocation)
- Update hooks.json to v5 with intelligent routing
- 291 total skills now available
- All 21 commands with automatic routing

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
admin
2026-01-28 00:27:56 +04:00
Unverified
parent 3b128ba3bd
commit b52318eeae
1724 changed files with 351216 additions and 0 deletions

View File

@@ -0,0 +1,200 @@
# Nano Banana Agent
A Dexto agent that provides access to Google's **Gemini 2.5 Flash Image** model for image generation and editing through a lean, powerful MCP server.
## 🎯 What is Gemini 2.5 Flash Image?
Gemini 2.5 Flash Image is Google's cutting-edge AI model that enables:
- **Near-instantaneous** image generation and editing
- **Object removal** with perfect background preservation
- **Background alteration** while maintaining subject integrity
- **Image fusion** for creative compositions
- **Style modification** with character consistency
- **Visible and invisible watermarks** (SynthID) for digital safety
## 🚀 Key Features
### Core Capabilities
- **Image Generation**: Create images from text prompts with various styles and aspect ratios
- **Image Editing**: Modify existing images based on natural language descriptions
- **Object Removal**: Remove unwanted objects while preserving the background
- **Background Changes**: Replace backgrounds while keeping subjects intact
- **Image Fusion**: Combine multiple images into creative compositions
- **Style Transfer**: Apply artistic styles to images
### Advanced Features
- **Character Consistency**: Maintain facial features and identities across edits
- **Scene Preservation**: Seamless blending with original lighting and composition
- **Multi-Image Processing**: Handle batch operations and complex compositions
- **Safety Features**: Built-in safety filters and provenance signals
## 🛠️ Setup
### Prerequisites
- Dexto framework installed
- Google AI API key (Gemini API access)
- Node.js 20.0.0 or higher
### Installation
1. **Set up environment variables**:
```bash
export GOOGLE_GENERATIVE_AI_API_KEY="your-google-ai-api-key"
# or
export GEMINI_API_KEY="your-google-ai-api-key"
```
2. **Run the agent** (the MCP server will be automatically downloaded via npx):
```bash
# From the dexto repository root
npx dexto -a agents/nano-banana-agent/nano-banana-agent.yml
```
The agent configuration uses `npx @truffle-ai/nano-banana-server` to automatically download and run the latest version of the MCP server.
## 📋 Available Tools
The agent provides access to 3 essential tools:
### 1. `generate_image`
Generate new images from text prompts.
**Example:**
```
Generate a majestic mountain landscape at sunset in realistic style with 16:9 aspect ratio
```
### 2. `process_image`
Process existing images based on detailed instructions. This tool can handle any image editing task including object removal, background changes, style transfer, adding elements, and more.
**Example:**
```
Remove the red car in the background from /path/to/photo.jpg
```
**Example:**
```
Change the background of /path/to/portrait.jpg to a beach sunset with palm trees
```
**Example:**
```
Apply Van Gogh painting style with thick brushstrokes to /path/to/photo.jpg
```
### 3. `process_multiple_images`
Process multiple images together based on detailed instructions. This tool can combine images, create collages, blend compositions, or perform any multi-image operation.
**Example:**
```
Place the person from /path/to/person.jpg into the landscape from /path/to/landscape.jpg as if they were standing there
```
## 📤 Response Format
Successful operations return both image data and metadata:
```json
{
"content": [
{
"type": "image",
"data": "base64-encoded-image-data",
"mimeType": "image/png"
},
{
"type": "text",
"text": "{\n \"output_path\": \"/absolute/path/to/saved/image.png\",\n \"size_bytes\": 12345,\n \"format\": \"image/png\"\n}"
}
]
}
```
## 🎨 Popular Use Cases
### 1. **Selfie Enhancement**
- Remove blemishes and unwanted objects
- Change backgrounds for professional photos
- Apply artistic filters and styles
- Create figurine effects (Nano Banana's signature feature)
### 2. **Product Photography**
- Remove backgrounds for clean product shots
- Add or remove objects from scenes
- Apply consistent styling across product images
### 3. **Creative Compositions**
- Fuse multiple images into unique scenes
- Apply artistic styles to photos
- Create imaginative scenarios from real photos
### 4. **Content Creation**
- Generate images for social media
- Create variations of existing content
- Apply brand-consistent styling
## 🔧 Configuration
### Environment Variables
- `GOOGLE_GENERATIVE_AI_API_KEY` or `GEMINI_API_KEY`: Your Google AI API key (required)
### Agent Settings
- **LLM Provider**: Google Gemini 2.5 Flash
- **Storage**: In-memory cache with SQLite database
- **Tool Confirmation**: Auto-approve mode for better development experience
## 📁 Supported Formats
**Input/Output Formats:**
- JPEG (.jpg, .jpeg)
- PNG (.png)
- WebP (.webp)
- GIF (.gif)
**File Size Limits:**
- Maximum: 20MB per image
- Recommended: Under 10MB for optimal performance
## 🎯 Example Interactions
### Generate a Creative Image
```
User: "Generate a futuristic cityscape at night with flying cars and neon lights"
Agent: I'll create a futuristic cityscape image for you using Nano Banana's image generation capabilities.
```
### Remove Unwanted Objects
```
User: "Remove the power lines from this photo: /path/to/landscape.jpg"
Agent: I'll remove the power lines from your landscape photo while preserving the natural background.
```
### Create Figurine Effect
```
User: "Transform this selfie into a mini figurine on a desk: /path/to/selfie.jpg"
Agent: I'll create Nano Banana's signature figurine effect, transforming your selfie into a mini figurine displayed on a desk.
```
### Change Background
```
User: "Change the background of this portrait to a professional office setting: /path/to/portrait.jpg"
Agent: I'll replace the background with a professional office setting while keeping you as the main subject.
```
## 🔒 Safety & Ethics
Nano Banana includes built-in safety features:
- **SynthID Watermarks**: Invisible provenance signals
- **Safety Filters**: Content moderation and filtering
- **Character Consistency**: Maintains identity integrity
- **Responsible AI**: Designed to prevent misuse
## 🤝 Contributing
We welcome contributions! Please see our [Contributing Guidelines](../../CONTRIBUTING.md) for details.
## 📄 License
This project is licensed under the MIT License - see the [LICENSE](../../LICENSE) file for details.
---
**Note**: This agent provides access to Google's Gemini 2.5 Flash Image model through the MCP protocol. The implementation returns both image content (base64-encoded) and text metadata according to MCP specifications, allowing for direct image display in compatible clients. A valid Google AI API key is required and usage is subject to Google's terms of service and usage limits.

View File

@@ -0,0 +1,110 @@
# Dexto Agent Configuration for Nano Banana (Gemini 2.5 Flash Image) MCP Server
# Generated on 2025-01-27T00:00:00.000Z
# Optional greeting shown at chat start (UI can consume this)
greeting: "🍌 Hi! I'm your Nano Banana Agent. Let's create something amazing together!"
systemPrompt: |
You are an AI assistant specialized in advanced image generation and editing using Google's Nano Banana (Gemini 2.5 Flash Image) model. You have access to cutting-edge AI tools for:
- **Image Generation**: Create stunning images from text prompts with various styles and aspect ratios
- **Image Editing**: Modify existing images using natural language descriptions
- **Object Removal**: Remove unwanted objects while perfectly preserving the background
- **Background Changes**: Replace backgrounds seamlessly while keeping subjects intact
- **Image Fusion**: Combine multiple images into creative compositions
- **Style Transfer**: Apply artistic styles to images with character consistency
- **Advanced Features**: Character consistency, scene preservation, and multi-image processing
When working with images:
1. Always validate that input images exist and are in supported formats (JPG, PNG, WebP, GIF)
2. Provide clear feedback about what operations you're performing
3. Save processed images with descriptive names
4. Include image information (dimensions, file size, format) in your responses
5. Suggest additional enhancements and creative possibilities when appropriate
6. Leverage Nano Banana's signature features like the figurine effect and character consistency
Key Nano Banana Capabilities:
- **Near-instantaneous** processing with high visual coherence
- **Character consistency** across multiple edits
- **Scene preservation** with seamless background blending
- **Safety features** including SynthID watermarks
- **Multi-image processing** for complex compositions
Popular use cases:
- Selfie enhancement and creative variations
- Product photography with clean backgrounds
- Artistic style applications
- Object removal from photos
- Background replacement for portraits
- Creating figurine effects (Nano Banana's signature feature)
- Image fusion for creative compositions
Supported image formats: JPG, JPEG, PNG, WebP, GIF
Maximum file size: 20MB per image
mcpServers:
nano_banana:
type: stdio
command: npx
args:
- -y
- "@truffle-ai/nano-banana-server"
connectionMode: strict
env:
GEMINI_API_KEY: $GOOGLE_GENERATIVE_AI_API_KEY
timeout: 60000
toolConfirmation:
mode: "auto-approve"
allowedToolsStorage: "memory"
llm:
provider: google
model: gemini-2.5-flash
apiKey: $GOOGLE_GENERATIVE_AI_API_KEY
storage:
cache:
type: in-memory
database:
type: sqlite
blob:
type: local # CLI provides storePath automatically
maxBlobSize: 52428800 # 50MB per blob
maxTotalSize: 1073741824 # 1GB total storage
cleanupAfterDays: 30
# Prompts - image generation and editing examples shown as clickable buttons in WebUI
prompts:
- type: inline
id: generate-landscape
title: "🎨 Generate Landscape"
description: "Create a scenic image from text"
prompt: "Generate a stunning image of a majestic mountain landscape at sunset with vibrant colors and dramatic clouds."
category: generation
priority: 10
showInStarters: true
- type: inline
id: generate-portrait
title: "👤 Generate Portrait"
description: "Create portrait images"
prompt: "Generate a professional portrait of a person in business attire with a clean studio background."
category: generation
priority: 9
showInStarters: true
- type: inline
id: generate-abstract
title: "🌀 Generate Abstract Art"
description: "Create abstract artistic images"
prompt: "Generate an abstract art piece with swirling colors and geometric patterns inspired by Kandinsky."
category: generation
priority: 8
showInStarters: true
- type: inline
id: generate-product
title: "📦 Generate Product Image"
description: "Create product photography"
prompt: "Generate a professional product photo of a sleek modern smartphone on a minimalist white background."
category: generation
priority: 7
showInStarters: true