v3.10.12: sticky endpoint, parallel discovery, anti-stall, smart errors, missing headers

2026-05-26 00:57:16 +04:00
parent fced1653f2
commit 0896ba5e55
6 changed files with 245 additions and 36 deletions
--- a/README.md
+++ b/README.md
@@ -554,6 +554,7 @@ The launcher generates model catalog JSON with dual field naming to satisfy both

 Codex Launcher includes special handling for Gemini 3 / Antigravity OAuth:

+- **Sticky endpoint with parallel discovery**: First request probes `cloudcode-pa.googleapis.com` and `daily-cloudcode-pa.googleapis.com` simultaneously — first 200 wins and is cached. All subsequent requests go straight to the cached endpoint. If it fails (429/502/503), cache is cleared and all endpoints are re-probed in parallel. Zero wasted time on rate-limited endpoints.
 - **Thought signature preservation**: Captures `thoughtSignature` from Gemini responses
  and reattaches them on follow-up requests to maintain tool-call continuity.
 - **Edit-intent detection**: When follow-up requests contain edit keywords, a tool-use
@@ -561,7 +562,7 @@ Codex Launcher includes special handling for Gemini 3 / Antigravity OAuth:
 - **User instruction enforcement**: The latest user message is guaranteed to be the
  final content turn sent to Gemini, even after compaction.
 - **Smart compaction**: Old tool outputs capped at 3000 chars, recent 6 at 20000 chars.
- **Context compaction**: Aggressive auto-trimming when approaching 60% of model context
+- **Context compaction**: Aggressive auto-trimming when approaching 80% of model context
  limit (1M tokens Gemini, 200K Claude, 128K GPT-OSS). Prevents token limit errors.
 - **Model ID mapping**: Display names (e.g. `Gemini 3.5 Flash (High)`) mapped to REST API
  slugs (e.g. `gemini-3-flash`). See `docs/ANTIGRAVITY.md` for details.