feat: Add intelligent auto-router and enhanced integrations
- Add intelligent-router.sh hook for automatic agent routing - Add AUTO-TRIGGER-SUMMARY.md documentation - Add FINAL-INTEGRATION-SUMMARY.md documentation - Complete Prometheus integration (6 commands + 4 tools) - Complete Dexto integration (12 commands + 5 tools) - Enhanced Ralph with access to all agents - Fix /clawd command (removed disable-model-invocation) - Update hooks.json to v5 with intelligent routing - 291 total skills now available - All 21 commands with automatic routing 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
200
dexto/agents/nano-banana-agent/README.md
Normal file
200
dexto/agents/nano-banana-agent/README.md
Normal file
@@ -0,0 +1,200 @@
|
||||
# Nano Banana Agent
|
||||
|
||||
A Dexto agent that provides access to Google's **Gemini 2.5 Flash Image** model for image generation and editing through a lean, powerful MCP server.
|
||||
|
||||
## 🎯 What is Gemini 2.5 Flash Image?
|
||||
|
||||
Gemini 2.5 Flash Image is Google's cutting-edge AI model that enables:
|
||||
- **Near-instantaneous** image generation and editing
|
||||
- **Object removal** with perfect background preservation
|
||||
- **Background alteration** while maintaining subject integrity
|
||||
- **Image fusion** for creative compositions
|
||||
- **Style modification** with character consistency
|
||||
- **Visible and invisible watermarks** (SynthID) for digital safety
|
||||
|
||||
## 🚀 Key Features
|
||||
|
||||
### Core Capabilities
|
||||
- **Image Generation**: Create images from text prompts with various styles and aspect ratios
|
||||
- **Image Editing**: Modify existing images based on natural language descriptions
|
||||
- **Object Removal**: Remove unwanted objects while preserving the background
|
||||
- **Background Changes**: Replace backgrounds while keeping subjects intact
|
||||
- **Image Fusion**: Combine multiple images into creative compositions
|
||||
- **Style Transfer**: Apply artistic styles to images
|
||||
|
||||
### Advanced Features
|
||||
- **Character Consistency**: Maintain facial features and identities across edits
|
||||
- **Scene Preservation**: Seamless blending with original lighting and composition
|
||||
- **Multi-Image Processing**: Handle batch operations and complex compositions
|
||||
- **Safety Features**: Built-in safety filters and provenance signals
|
||||
|
||||
## 🛠️ Setup
|
||||
|
||||
### Prerequisites
|
||||
- Dexto framework installed
|
||||
- Google AI API key (Gemini API access)
|
||||
- Node.js 20.0.0 or higher
|
||||
|
||||
### Installation
|
||||
1. **Set up environment variables**:
|
||||
```bash
|
||||
export GOOGLE_GENERATIVE_AI_API_KEY="your-google-ai-api-key"
|
||||
# or
|
||||
export GEMINI_API_KEY="your-google-ai-api-key"
|
||||
```
|
||||
|
||||
2. **Run the agent** (the MCP server will be automatically downloaded via npx):
|
||||
```bash
|
||||
# From the dexto repository root
|
||||
npx dexto -a agents/nano-banana-agent/nano-banana-agent.yml
|
||||
```
|
||||
|
||||
The agent configuration uses `npx @truffle-ai/nano-banana-server` to automatically download and run the latest version of the MCP server.
|
||||
|
||||
## 📋 Available Tools
|
||||
|
||||
The agent provides access to 3 essential tools:
|
||||
|
||||
### 1. `generate_image`
|
||||
Generate new images from text prompts.
|
||||
|
||||
**Example:**
|
||||
```
|
||||
Generate a majestic mountain landscape at sunset in realistic style with 16:9 aspect ratio
|
||||
```
|
||||
|
||||
### 2. `process_image`
|
||||
Process existing images based on detailed instructions. This tool can handle any image editing task including object removal, background changes, style transfer, adding elements, and more.
|
||||
|
||||
**Example:**
|
||||
```
|
||||
Remove the red car in the background from /path/to/photo.jpg
|
||||
```
|
||||
|
||||
**Example:**
|
||||
```
|
||||
Change the background of /path/to/portrait.jpg to a beach sunset with palm trees
|
||||
```
|
||||
|
||||
**Example:**
|
||||
```
|
||||
Apply Van Gogh painting style with thick brushstrokes to /path/to/photo.jpg
|
||||
```
|
||||
|
||||
### 3. `process_multiple_images`
|
||||
Process multiple images together based on detailed instructions. This tool can combine images, create collages, blend compositions, or perform any multi-image operation.
|
||||
|
||||
**Example:**
|
||||
```
|
||||
Place the person from /path/to/person.jpg into the landscape from /path/to/landscape.jpg as if they were standing there
|
||||
```
|
||||
|
||||
## 📤 Response Format
|
||||
|
||||
Successful operations return both image data and metadata:
|
||||
```json
|
||||
{
|
||||
"content": [
|
||||
{
|
||||
"type": "image",
|
||||
"data": "base64-encoded-image-data",
|
||||
"mimeType": "image/png"
|
||||
},
|
||||
{
|
||||
"type": "text",
|
||||
"text": "{\n \"output_path\": \"/absolute/path/to/saved/image.png\",\n \"size_bytes\": 12345,\n \"format\": \"image/png\"\n}"
|
||||
}
|
||||
]
|
||||
}
|
||||
```
|
||||
|
||||
## 🎨 Popular Use Cases
|
||||
|
||||
### 1. **Selfie Enhancement**
|
||||
- Remove blemishes and unwanted objects
|
||||
- Change backgrounds for professional photos
|
||||
- Apply artistic filters and styles
|
||||
- Create figurine effects (Nano Banana's signature feature)
|
||||
|
||||
### 2. **Product Photography**
|
||||
- Remove backgrounds for clean product shots
|
||||
- Add or remove objects from scenes
|
||||
- Apply consistent styling across product images
|
||||
|
||||
### 3. **Creative Compositions**
|
||||
- Fuse multiple images into unique scenes
|
||||
- Apply artistic styles to photos
|
||||
- Create imaginative scenarios from real photos
|
||||
|
||||
### 4. **Content Creation**
|
||||
- Generate images for social media
|
||||
- Create variations of existing content
|
||||
- Apply brand-consistent styling
|
||||
|
||||
## 🔧 Configuration
|
||||
|
||||
### Environment Variables
|
||||
- `GOOGLE_GENERATIVE_AI_API_KEY` or `GEMINI_API_KEY`: Your Google AI API key (required)
|
||||
|
||||
### Agent Settings
|
||||
- **LLM Provider**: Google Gemini 2.5 Flash
|
||||
- **Storage**: In-memory cache with SQLite database
|
||||
- **Tool Confirmation**: Auto-approve mode for better development experience
|
||||
|
||||
## 📁 Supported Formats
|
||||
|
||||
**Input/Output Formats:**
|
||||
- JPEG (.jpg, .jpeg)
|
||||
- PNG (.png)
|
||||
- WebP (.webp)
|
||||
- GIF (.gif)
|
||||
|
||||
**File Size Limits:**
|
||||
- Maximum: 20MB per image
|
||||
- Recommended: Under 10MB for optimal performance
|
||||
|
||||
## 🎯 Example Interactions
|
||||
|
||||
### Generate a Creative Image
|
||||
```
|
||||
User: "Generate a futuristic cityscape at night with flying cars and neon lights"
|
||||
Agent: I'll create a futuristic cityscape image for you using Nano Banana's image generation capabilities.
|
||||
```
|
||||
|
||||
### Remove Unwanted Objects
|
||||
```
|
||||
User: "Remove the power lines from this photo: /path/to/landscape.jpg"
|
||||
Agent: I'll remove the power lines from your landscape photo while preserving the natural background.
|
||||
```
|
||||
|
||||
### Create Figurine Effect
|
||||
```
|
||||
User: "Transform this selfie into a mini figurine on a desk: /path/to/selfie.jpg"
|
||||
Agent: I'll create Nano Banana's signature figurine effect, transforming your selfie into a mini figurine displayed on a desk.
|
||||
```
|
||||
|
||||
### Change Background
|
||||
```
|
||||
User: "Change the background of this portrait to a professional office setting: /path/to/portrait.jpg"
|
||||
Agent: I'll replace the background with a professional office setting while keeping you as the main subject.
|
||||
```
|
||||
|
||||
## 🔒 Safety & Ethics
|
||||
|
||||
Nano Banana includes built-in safety features:
|
||||
- **SynthID Watermarks**: Invisible provenance signals
|
||||
- **Safety Filters**: Content moderation and filtering
|
||||
- **Character Consistency**: Maintains identity integrity
|
||||
- **Responsible AI**: Designed to prevent misuse
|
||||
|
||||
## 🤝 Contributing
|
||||
|
||||
We welcome contributions! Please see our [Contributing Guidelines](../../CONTRIBUTING.md) for details.
|
||||
|
||||
## 📄 License
|
||||
|
||||
This project is licensed under the MIT License - see the [LICENSE](../../LICENSE) file for details.
|
||||
|
||||
---
|
||||
|
||||
**Note**: This agent provides access to Google's Gemini 2.5 Flash Image model through the MCP protocol. The implementation returns both image content (base64-encoded) and text metadata according to MCP specifications, allowing for direct image display in compatible clients. A valid Google AI API key is required and usage is subject to Google's terms of service and usage limits.
|
||||
110
dexto/agents/nano-banana-agent/nano-banana-agent.yml
Normal file
110
dexto/agents/nano-banana-agent/nano-banana-agent.yml
Normal file
@@ -0,0 +1,110 @@
|
||||
# Dexto Agent Configuration for Nano Banana (Gemini 2.5 Flash Image) MCP Server
|
||||
# Generated on 2025-01-27T00:00:00.000Z
|
||||
|
||||
# Optional greeting shown at chat start (UI can consume this)
|
||||
greeting: "🍌 Hi! I'm your Nano Banana Agent. Let's create something amazing together!"
|
||||
|
||||
systemPrompt: |
|
||||
You are an AI assistant specialized in advanced image generation and editing using Google's Nano Banana (Gemini 2.5 Flash Image) model. You have access to cutting-edge AI tools for:
|
||||
|
||||
- **Image Generation**: Create stunning images from text prompts with various styles and aspect ratios
|
||||
- **Image Editing**: Modify existing images using natural language descriptions
|
||||
- **Object Removal**: Remove unwanted objects while perfectly preserving the background
|
||||
- **Background Changes**: Replace backgrounds seamlessly while keeping subjects intact
|
||||
- **Image Fusion**: Combine multiple images into creative compositions
|
||||
- **Style Transfer**: Apply artistic styles to images with character consistency
|
||||
- **Advanced Features**: Character consistency, scene preservation, and multi-image processing
|
||||
|
||||
When working with images:
|
||||
1. Always validate that input images exist and are in supported formats (JPG, PNG, WebP, GIF)
|
||||
2. Provide clear feedback about what operations you're performing
|
||||
3. Save processed images with descriptive names
|
||||
4. Include image information (dimensions, file size, format) in your responses
|
||||
5. Suggest additional enhancements and creative possibilities when appropriate
|
||||
6. Leverage Nano Banana's signature features like the figurine effect and character consistency
|
||||
|
||||
Key Nano Banana Capabilities:
|
||||
- **Near-instantaneous** processing with high visual coherence
|
||||
- **Character consistency** across multiple edits
|
||||
- **Scene preservation** with seamless background blending
|
||||
- **Safety features** including SynthID watermarks
|
||||
- **Multi-image processing** for complex compositions
|
||||
|
||||
Popular use cases:
|
||||
- Selfie enhancement and creative variations
|
||||
- Product photography with clean backgrounds
|
||||
- Artistic style applications
|
||||
- Object removal from photos
|
||||
- Background replacement for portraits
|
||||
- Creating figurine effects (Nano Banana's signature feature)
|
||||
- Image fusion for creative compositions
|
||||
|
||||
Supported image formats: JPG, JPEG, PNG, WebP, GIF
|
||||
Maximum file size: 20MB per image
|
||||
|
||||
mcpServers:
|
||||
nano_banana:
|
||||
type: stdio
|
||||
command: npx
|
||||
args:
|
||||
- -y
|
||||
- "@truffle-ai/nano-banana-server"
|
||||
connectionMode: strict
|
||||
env:
|
||||
GEMINI_API_KEY: $GOOGLE_GENERATIVE_AI_API_KEY
|
||||
timeout: 60000
|
||||
|
||||
toolConfirmation:
|
||||
mode: "auto-approve"
|
||||
allowedToolsStorage: "memory"
|
||||
|
||||
llm:
|
||||
provider: google
|
||||
model: gemini-2.5-flash
|
||||
apiKey: $GOOGLE_GENERATIVE_AI_API_KEY
|
||||
|
||||
storage:
|
||||
cache:
|
||||
type: in-memory
|
||||
database:
|
||||
type: sqlite
|
||||
blob:
|
||||
type: local # CLI provides storePath automatically
|
||||
maxBlobSize: 52428800 # 50MB per blob
|
||||
maxTotalSize: 1073741824 # 1GB total storage
|
||||
cleanupAfterDays: 30
|
||||
|
||||
# Prompts - image generation and editing examples shown as clickable buttons in WebUI
|
||||
prompts:
|
||||
- type: inline
|
||||
id: generate-landscape
|
||||
title: "🎨 Generate Landscape"
|
||||
description: "Create a scenic image from text"
|
||||
prompt: "Generate a stunning image of a majestic mountain landscape at sunset with vibrant colors and dramatic clouds."
|
||||
category: generation
|
||||
priority: 10
|
||||
showInStarters: true
|
||||
- type: inline
|
||||
id: generate-portrait
|
||||
title: "👤 Generate Portrait"
|
||||
description: "Create portrait images"
|
||||
prompt: "Generate a professional portrait of a person in business attire with a clean studio background."
|
||||
category: generation
|
||||
priority: 9
|
||||
showInStarters: true
|
||||
- type: inline
|
||||
id: generate-abstract
|
||||
title: "🌀 Generate Abstract Art"
|
||||
description: "Create abstract artistic images"
|
||||
prompt: "Generate an abstract art piece with swirling colors and geometric patterns inspired by Kandinsky."
|
||||
category: generation
|
||||
priority: 8
|
||||
showInStarters: true
|
||||
- type: inline
|
||||
id: generate-product
|
||||
title: "📦 Generate Product Image"
|
||||
description: "Create product photography"
|
||||
prompt: "Generate a professional product photo of a sleek modern smartphone on a minimalist white background."
|
||||
category: generation
|
||||
priority: 7
|
||||
showInStarters: true
|
||||
Reference in New Issue
Block a user