19 KiB
GLM Tools, Skills, and Agents Documentation
This document provides a comprehensive overview of all agents, skills, and tools available in the Super Z system.
Table of Contents
Subagents (Task Tool)
The Task tool launches specialized agents (subprocesses) that autonomously handle complex tasks. Each agent type has specific capabilities and tools available to it.
Agent Types
1. general-purpose
Description: General-purpose agent for researching complex questions, searching for code, and executing multi-step tasks. Use this when you need to perform search for a keyword or file and are not confident that you will find the right match in the first few tries.
Tools Available: All tools (*)
Use Cases:
- Complex research questions
- Multi-step tasks requiring various tools
- Code searching across multiple files
- Open-ended exploration tasks
2. statusline-setup
Description: Use this agent to configure the user's Claude Code status line setting.
Tools Available: Read, Edit
Use Cases:
- Status line configuration
- Settings management
3. Explore
Description: Fast agent specialized for exploring codebases. Use this when you need to quickly find files by patterns (e.g., "src/components/**/*.tsx"), search code for keywords (e.g., "API endpoints"), or answer questions about the codebase (e.g., "how do API endpoints work?").
Tools Available: All tools
Thoroughness Levels:
- quick: Basic searches
- medium: Moderate exploration
- very thorough: Comprehensive analysis across multiple locations and naming conventions
Use Cases:
- Finding files by patterns
- Searching code for keywords
- Understanding codebase structure
- Answering questions about code architecture
4. Plan
Description: Software architect agent for designing implementation plans. Use this when you need to plan the implementation strategy for a task. Returns step-by-step plans, identifies critical files, and considers architectural trade-offs.
Tools Available: All tools
Use Cases:
- Implementation planning
- Architecture design
- Identifying critical files
- Evaluating trade-offs
5. frontend-styling-expert
Description: Use this agent when you need help with CSS, styling frameworks, responsive design, UI/UX implementation, animations, layout systems, or any visual/presentational aspects of web development.
Tools Available: All tools
Example Use Cases:
- User: "I need to create a responsive navigation menu that collapses on mobile" → Use this agent to help design responsive navigation component
- User: "The button hover effects aren't smooth enough" → Use this agent to optimize animations
- User: "Can you help me center this div?" → Use this agent to provide the best centering solution
- After writing HTML structure: Use this agent to add professional styling to components
Specialties:
- CSS frameworks
- Responsive design
- UI/UX implementation
- Animations
- Layout systems
- Visual/presentational aspects
6. full-stack-developer
Description: Full-stack Next.js 15 development agent that builds production-ready web applications with React, API routes, and Prisma databases.
Strongly Recommended For:
- Building complete websites
- Creating data visualization dashboards
- Developing blog/CMS systems
- Implementing responsive web pages
- Integrating AI features (chat/image generation/web search)
- Real-time collaborative applications with WebSocket
Tools Available: All tools
Key Strengths:
- Frontend-first development approach
- Feature-complete modules combining UI components (shadcn/ui)
- Backend APIs
- Database operations
- Work documentation in /agent-ctx directory
Skills System
Skills provide specialized capabilities and domain knowledge. Before starting any task, the system must follow a workflow to identify and invoke appropriate skills.
Available Skills
1. ASR (Automatic Speech Recognition)
Command: ASR
Description: Implement speech-to-text (ASR/automatic speech recognition) capabilities using the z-ai-web-dev-sdk.
Use Cases:
- Transcribe audio files
- Convert speech to text
- Build voice input features
- Process audio recordings
Technical Details:
- Supports base64 encoded audio files
- Returns accurate text transcriptions
- Uses z-ai-web-dev-sdk
2. LLM (Large Language Model)
Command: LLM
Description: Implement large language model (LLM) chat completions using the z-ai-web-dev-sdk.
Use Cases:
- Build conversational AI applications
- Create chatbots
- Develop AI assistants
- Text generation features
Technical Details:
- Supports multi-turn conversations
- System prompts
- Context management
- Uses z-ai-web-dev-sdk
3. TTS (Text-to-Speech)
Command: TTS
Description: Implement text-to-speech (TTS) capabilities using the z-ai-web-dev-sdk.
Use Cases:
- Convert text into natural-sounding speech
- Create audio content
- Build voice-enabled applications
- Generate spoken audio files
Technical Details:
- Supports multiple voices
- Adjustable speed
- Various audio formats
- Uses z-ai-web-dev-sdk
4. VLM (Vision Language Model)
Command: VLM
Description: Implement vision-based AI chat capabilities using the z-ai-web-dev-sdk.
Use Cases:
- Analyze images
- Describe visual content
- Create applications combining image understanding with conversational AI
Technical Details:
- Supports image URLs
- Supports base64 encoded images
- Multimodal interactions
- Uses z-ai-web-dev-sdk
5. csv-data-summarizer
Command: csv-data-summarizer
Description: Automatically analyzes CSV files with comprehensive statistical summaries, data quality checks, and intelligent visualizations.
Features:
- Detects data types (sales, customer, financial, operational, survey)
- Adapts analysis accordingly
- Generates correlation heatmaps, time-series plots, distribution charts
- Categorical breakdowns
- Missing data analysis
- Automatic date column detection
- Industry-specific insights
Technical Details:
- Built with pandas, matplotlib, and seaborn
- NO user prompting required - immediately runs full analysis
- Supports all relevant visualizations based on detected data patterns
Use Cases:
- User uploads CSV
- Data analysis requests
- Dataset structure understanding
6. deep-research
Command: deep-research
Description: Conduct enterprise-grade research with multi-source synthesis, citation tracking, and verification.
Use Cases:
- Comprehensive analysis requiring 10+ sources
- Verified claims
- Comparison of approaches
- Research reports
- Trend analysis
Triggers:
- "deep research"
- "comprehensive analysis"
- "research report"
- "compare X vs Y"
- "analyze trends"
NOT For:
- Simple lookups
- Debugging
- Questions answerable with 1-2 searches
7. docx
Command: docx
Description: Comprehensive document creation, editing, and analysis with support for tracked changes, comments, formatting preservation, and text extraction.
Use Cases:
- Creating new documents
- Modifying or editing content
- Working with tracked changes
- Adding comments
- Any other document tasks
File Types: .docx files
8. finance
Command: finance
Description: Comprehensive Finance API integration skill for real-time and historical financial data analysis, market research, and investment decision-making.
Priority Use Cases:
- Stock price queries
- Market data analysis
- Company financial information
- Portfolio tracking
- Market news retrieval
- Stock screening
- Technical analysis
- Any financial market-related requests
Note: This skill should be the primary choice for all Finance API interactions and financial data needs.
9. frontend-design
Command: frontend-design
Description: Transform UI style requirements into production-ready frontend code with systematic design tokens, accessibility compliance, and creative execution.
Use Cases:
- Building websites
- Web applications
- React/Vue components
- Dashboards
- Landing pages
- Any web UI requiring design consistency and aesthetic quality
10. get-fortune-analysis
Command: get-fortune-analysis
Description: Fortune analysis skill for specialized use cases.
11. gift-evaluator
Command: gift-evaluator
Description: The PRIMARY tool for Spring Festival gift analysis and social interaction generation.
Use Cases:
- Users upload photos of gifts (alcohol, tea, supplements, etc.)
- Inquire about gift value
- Authenticity verification
- Social response generation
Features:
- Integrates visual perception
- Market valuation
- HTML card generation
12. image-edit
Command: image-edit
Description: Implement AI image editing and modification capabilities using the z-ai-web-dev-sdk.
Use Cases:
- Edit existing images
- Create variations
- Modify visual content
- Redesign assets
- Transform images based on text descriptions
Technical Details:
- Supports multiple image sizes
- Returns base64 encoded results
- Includes CLI tool for quick image editing
- Uses z-ai-web-dev-sdk
13. image-generation
Command: image-generation
Description: Implement AI image generation capabilities using the z-ai-web-dev-sdk.
Use Cases:
- Create images from text descriptions
- Generate visual content
- Create artwork
- Design assets
- Build applications with AI-powered image creation
Technical Details:
- Supports multiple image sizes
- Returns base64 encoded images
- Includes CLI tool for quick image generation
- Uses z-ai-web-dev-sdk
CLI Usage:
# Generate image
z-ai-generate --prompt "A beautiful landscape" --output "./image.png"
# Short form
z-ai-generate -p "A cute cat" -o "./cat.png" -s 1024x1024
Supported Sizes:
- 1024x1024
- 768x1344
- 864x1152
- 1344x768
- 1152x864
- 1440x720
- 720x1440
14. image-understand
Command: image-understand
Description: Implement specialized image understanding capabilities using the z-ai-web-dev-sdk.
Use Cases:
- Analyze static images
- Extract visual information
- Perform OCR
- Detect objects
- Classify images
- Understand visual content
Supported Formats: PNG, JPEG, GIF, WebP, BMP
15. pdf
Command: pdf
Description: Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms.
Use Cases:
- Fill in PDF forms
- Programmatically process PDFs
- Generate PDF documents
- Analyze PDF documents at scale
16. podcast-generate
Command: podcast-generate
Description: Generate podcast episodes from user-provided content or by searching the web for specified topics.
Modes:
- User uploads text file/article: Creates dual-host dialogue podcast (or single-host upon request)
- No content provided: Searches web for information about user-specified topic and generates podcast
Features:
- Duration scales with content size (3-20 minutes, ~240 chars/min)
- Uses z-ai-web-dev-sdk for LLM script generation and TTS audio synthesis
- Outputs podcast script (Markdown) and complete audio file (WAV)
17. pptx
Command: pptx
Description: Presentation editing, creation, and analysis.
Use Cases:
- Editing existing presentations
- Adding slides to existing files
- Creating new presentations
- Working with layouts
- Adding comments or speaker notes
File Types: .pptx files
18. story-video-generation
Command: story-video-generation
Description: 根据用户输入的一句话,自动拓展为小故事,自动生成分场景图片,并将图片合成为视频。
Triggers:
- "生成故事"
- "故事视频"
- "把一句话变成视频"
- "制作故事视频"
19. video-generation
Command: video-generation
Description: Implement AI-powered video generation capabilities using the z-ai-web-dev-sdk.
Use Cases:
- Generate videos from text prompts or images
- Create video content programmatically
- Build applications that produce video outputs
Technical Details:
- Supports asynchronous task management
- Status polling
- Result retrieval
- Uses z-ai-web-dev-sdk
20. video-understand
Command: video-understand
Description: Implement specialized video understanding capabilities using the z-ai-web-dev-sdk.
Use Cases:
- Analyze video content
- Understand motion and temporal sequences
- Extract information from video frames
- Describe video scenes
- Perform video-based AI analysis
Supported Formats: MP4, AVI, MOV, and other common video formats
21. web-reader
Command: web-reader
Description: Implement web page content extraction capabilities using the z-ai-web-dev-sdk.
Use Cases:
- Scrape web pages
- Extract article content
- Retrieve page metadata
- Build applications that process web content
Features:
- Automatic content extraction
- Title retrieval
- HTML retrieval
- Publication time retrieval
22. web-search
Command: web-search
Description: Implement web search capabilities using the z-ai-web-dev-sdk.
Use Cases:
- Search for real-time information from the web
- Retrieve up-to-date content beyond knowledge cutoff
- Find latest news and data
Returns: Structured search results with URLs, snippets, and metadata
23. xlsx
Command: xlsx
Description: Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization.
Use Cases:
- Creating new spreadsheets with formulas and formatting
- Reading or analyzing data
- Modifying existing spreadsheets while preserving formulas
- Data analysis and visualization in spreadsheets
- Recalculating formulas
Supported Formats: .xlsx, .xlsm, .csv, .tsv, etc.
Tools Reference
1. Task Tool
Purpose: Launch a new agent to handle complex, multi-step tasks autonomously.
Parameters:
subagent_type(Required): Type of specialized agentprompt(Required): Task for the agent to performdescription(Optional): Short 3-5 word descriptionmodel(Optional): Model to use (sonnet, opus, haiku)resume(Optional): Agent ID to resume from
2. Bash Tool
Purpose: Execute bash commands in a persistent shell session.
Parameters:
command(Required): Command to executedescription(Optional): Description of the commandtimeout(Optional): Timeout in milliseconds (max 600000ms)
3. Glob Tool
Purpose: Fast file pattern matching tool.
Parameters:
pattern(Required): Glob pattern (e.g., "**/*.js")path(Optional): Directory to search (defaults to current)
4. Grep Tool
Purpose: Powerful search tool built on ripgrep.
Parameters:
pattern(Required): Regular expression patternpath(Optional): Directory to searchoutput_mode: content, files_with_matches, countglob: File pattern filterfile_type: File type filter (js, py, rust, go, java, etc.)-A,-B,-C: Context lines-i: Case insensitive-n: Show line numbersmultiline: Enable multiline mode
5. LS Tool
Purpose: List files and directories.
Parameters:
path(Required): Absolute path to listignore: Glob patterns to ignore
6. Read Tool
Purpose: Read files from the filesystem.
Parameters:
filepath(Required): Absolute path to the fileoffset(Optional): Starting line numberlimit(Optional): Number of lines to read
7. Edit Tool
Purpose: Perform exact string replacements in files.
Parameters:
filepath(Required): Absolute pathold_str(Required): Text to replacenew_str(Required): Replacement textreplace_all: Replace all occurrences
8. MultiEdit Tool
Purpose: Multiple edits to a single file in one operation.
Parameters:
filepath(Required): Absolute pathedits(Required): Array of edit operations
9. Write Tool
Purpose: Write a file to the filesystem.
Parameters:
filepath(Required): Absolute pathcontent(Required): File content
10. TodoWrite / TodoRead Tools
Purpose: Manage task lists for coding sessions.
Parameters (TodoWrite):
todos(Required): Array of todo items with id, content, status, priority
11. Skill Tool
Purpose: Execute a skill within the main conversation.
Parameters:
command(Required): Skill name (no arguments)
12. Complete Tool
Purpose: Mark completion of code development or PPT creation tasks.
Parameters:
project_type(Required): Type of project ("web_dev")summary(Optional): Summary of completed project
SDK Integration (z-ai-web-dev-sdk)
Chat Completions Example
import ZAI from 'z-ai-web-dev-sdk';
async function main() {
try {
const zai = await ZAI.create()
const completion = await zai.chat.completions.create({
messages: [
{
role: 'system',
content: 'You are a helpful assistant.'
},
{
role: 'user',
content: 'Hello, who are you?'
}
],
// Other parameters like temperature, max_tokens, etc.
});
const messageContent = completion.choices[0]?.message?.content;
console.log('Assistant says:', messageContent);
} catch (error) {
console.error('An error occurred:', error.message);
}
}
Image Generation Example
import ZAI from 'z-ai-web-dev-sdk';
async function generateImage() {
try {
const zai = await ZAI.create();
const response = await zai.images.generations.create({
prompt: 'A cute cat playing in the garden',
size: '1024x1024'
});
const imageBase64 = response.data[0].base64;
console.log('Generated image base64:', imageBase64);
} catch (error) {
console.error('Image generation failed:', error.message);
}
}
Web Search Example
import ZAI from 'z-ai-web-dev-sdk';
async function testSearch() {
try {
const zai = await ZAI.create()
const searchResult = await zai.functions.invoke("web_search", {
query: "What is the capital of France?",
num: 10
})
console.log('Full API Response:', searchResult)
} catch (error: any) {
console.error('An error occurred:', error.message);
}
}
Search Result Type
interface SearchFunctionResultItem {
url: string;
name: string;
snippet: string;
host_name: string;
rank: number;
date: string;
favicon: string;
}
Project Environment Notes
- Next.js 15 with App Router
- Port: 3000 (auto dev server)
bun run devruns automatically - do not run manually- Use
bun run lintto check code quality - z-ai-web-dev-sdk MUST be used in the backend only
- User can only see the / route defined in src/app/page.tsx
File Output Directory
All generated files must be saved to:
/home/z/my-project/download/
Document generated automatically from Super Z system configuration