v2.1.3: fix Crof mimo-v2.5-pro reasoning_content token exhaustion
- Strip reasoning_content from proxy output (Codex doesn't use it) - Force max_tokens=64000 minimum for openai-compat providers - Prevents models that emit large reasoning from running out of tokens
This commit is contained in:
@@ -1,5 +1,13 @@
|
||||
# Changelog
|
||||
|
||||
## v2.1.3 (2026-05-19)
|
||||
|
||||
- **Fixed Crof mimo-v2.5-pro stopping mid-response (finish_reason=length)**
|
||||
- Root cause: model emits 600+ `reasoning_content` SSE chunks that exhaust `max_tokens` before any actual content is generated
|
||||
- Strip `reasoning_content` from proxy output — Codex doesn't use reasoning, avoids wasting output tokens on invisible text
|
||||
- Force `max_tokens` minimum of 64000 for openai-compat providers — gives models room for both reasoning and content
|
||||
- Works for all openai-compat providers (Crof, Z.AI, DeepSeek, OpenRouter, etc.)
|
||||
|
||||
## v2.1.2 (2026-05-19)
|
||||
|
||||
- **Fixed Crof.ai and providers stopping after first tool call (root cause: None tool IDs)**
|
||||
|
||||
Reference in New Issue
Block a user