v2.1.3: fix Crof mimo-v2.5-pro reasoning_content token exhaustion

- Strip reasoning_content from proxy output (Codex doesn't use it) - Force max_tokens=64000 minimum for openai-compat providers - Prevents models that emit large reasoning from running out of tokens
2026-05-19 21:59:38 +04:00
parent 662d8e961e
commit 77423c5c35
5 changed files with 18 additions and 10 deletions
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -1,5 +1,13 @@
 # Changelog

+## v2.1.3 (2026-05-19)
+
+- **Fixed Crof mimo-v2.5-pro stopping mid-response (finish_reason=length)**
+  - Root cause: model emits 600+ `reasoning_content` SSE chunks that exhaust `max_tokens` before any actual content is generated
+  - Strip `reasoning_content` from proxy output — Codex doesn't use reasoning, avoids wasting output tokens on invisible text
+  - Force `max_tokens` minimum of 64000 for openai-compat providers — gives models room for both reasoning and content
+  - Works for all openai-compat providers (Crof, Z.AI, DeepSeek, OpenRouter, etc.)
+
 ## v2.1.2 (2026-05-19)

 - **Fixed Crof.ai and providers stopping after first tool call (root cause: None tool IDs)**