Files
SuperCharged-Claude-Code-Up…/dexto/agents/talk2pdf-agent
admin b52318eeae feat: Add intelligent auto-router and enhanced integrations
- Add intelligent-router.sh hook for automatic agent routing
- Add AUTO-TRIGGER-SUMMARY.md documentation
- Add FINAL-INTEGRATION-SUMMARY.md documentation
- Complete Prometheus integration (6 commands + 4 tools)
- Complete Dexto integration (12 commands + 5 tools)
- Enhanced Ralph with access to all agents
- Fix /clawd command (removed disable-model-invocation)
- Update hooks.json to v5 with intelligent routing
- 291 total skills now available
- All 21 commands with automatic routing

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
b52318eeae · 2026-01-28 00:27:56 +04:00
History
..

Talk2PDF Agent

A comprehensive AI agent for parsing and analyzing PDF documents using the Talk2PDF MCP Server.

This agent provides intelligent PDF document processing through a TypeScript-based MCP server that can extract text, metadata, and search for specific content within PDF files.

Features

📄 PDF Parsing & Text Extraction

  • Full Document Parsing: Extract complete text content from PDF files
  • Metadata Extraction: Get document information (title, author, page count, creation date)
  • Format Support: Handle various PDF versions and structures
  • Error Handling: Graceful handling of corrupted or protected PDFs

🔍 Content Search & Analysis

  • Section Extraction: Search for and extract specific content sections
  • Intelligent Filtering: Find content containing specific terms or patterns
  • Context Preservation: Maintain document structure and formatting
  • Multi-page Support: Process documents of any length

🧠 AI-Powered Analysis

  • Document Summarization: Generate intelligent summaries of PDF content
  • Key Information Extraction: Identify and extract important details
  • Question Answering: Answer questions about document content
  • Content Classification: Analyze document type and structure

Quick Start

Prerequisites

  • Node.js 20+: For the Dexto framework
  • TypeScript: Automatically managed by the MCP server

Installation

  1. Run the Agent:
    # From the dexto project root
    dexto --agent agents/talk2pdf-agent/talk2pdf-agent.yml
    

That's it! The MCP server will be automatically downloaded and installed via npx on first run.

Configuration

The agent is configured to use the published MCP server:

mcpServers:
  talk2pdf:
    type: stdio
    command: npx
    args:
      - "@truffle-ai/talk2pdf-mcp"
    timeout: 30000
    connectionMode: strict

MCP Server

This agent uses the Talk2PDF MCP Server, which is maintained separately at:

🔗 https://github.com/truffle-ai/mcp-servers/tree/main/src/talk2pdf

The MCP server repository provides:

  • Complete technical documentation
  • Development and contribution guidelines
  • Server implementation details
  • Advanced configuration options

Available Tools

PDF Processing Tools

parse_pdf

Extract complete text content and metadata from a PDF file.

Parameters:

  • filePath (string): Path to the PDF file to parse

Returns:

  • Full text content of the document
  • Document metadata (title, author, page count, creation date, etc.)
  • File information (size, format)

extract_section

Search for and extract specific content sections from a PDF.

Parameters:

  • filePath (string): Path to the PDF file
  • searchTerms (string): Terms or patterns to search for
  • maxResults (number, optional): Maximum number of results to return

Returns:

  • Matching content sections with context
  • Page numbers and locations
  • Relevance scoring

Supported PDF Features

  • Standard PDF formats: PDF 1.4 through 2.0
  • Text-based PDFs: Documents with extractable text content
  • Multi-page documents: No page limit restrictions
  • Metadata support: Title, author, creation date, modification date
  • Various encodings: UTF-8, Latin-1, and other standard encodings

Example Usage

Basic PDF Parsing

"Parse the PDF at /path/to/document.pdf and show me the full content"
"Extract all text and metadata from my research paper"
"What's in this PDF file?"
"Find all sections about 'machine learning' in the PDF"
"Extract the introduction and conclusion from this document" 
"Search for mentions of 'budget' in the financial report"

Document Analysis

"Summarize the main points from this PDF"
"What is this document about?"
"Extract the key findings from the research paper"
"List all the recommendations mentioned in the report"

Intelligent Q&A

"What are the main conclusions of this study?"
"Who are the authors of this document?"
"When was this document created?"
"How many pages does this PDF have?"

Troubleshooting

Common Issues

  1. Server Installation: The MCP server will be automatically installed via npx on first run. No manual setup required.

  2. PDF Access Issues: Ensure the PDF file path is correct and the file is readable. Protected or encrypted PDFs may require special handling.

  3. Memory Issues: For very large PDFs (100+ pages), processing may take longer. Consider breaking large documents into sections.

  4. Text Extraction: If text appears garbled, the PDF may use non-standard encoding or be scanned image-based (OCR not supported).

Error Handling

The agent provides clear error messages for common issues:

  • File not found or inaccessible
  • Invalid PDF format
  • Corrupted PDF files
  • Permission-protected documents

Getting Help

  • MCP Server Issues: Report at the mcp-servers repository
  • Agent Configuration: Report at the main Dexto repository
  • Feature Requests: Use the mcp-servers repository for tool-related requests

License

This project is part of the Dexto AI agent framework.