CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-05-11 00:11:36 +08:00

Author	SHA1	Message	Date
Luis Pater	ecf1c2590c	fix: preserve Antigravity cancellation errors	2026-05-04 21:18:18 +08:00
1137043480	bf0e5c23f7	fix: prevent goroutine leaks in streaming executors via context-aware channel sends All streaming executors use bare channel sends (out <- chunk) inside goroutines that process upstream SSE responses. When the downstream consumer disconnects (client timeout, network drop, etc.), these sends block indefinitely, causing the goroutine and all associated resources (HTTP response body, scanner buffers, translation state) to leak permanently. Over time, leaked goroutines accumulate monotonically, leading to RSS growth from ~30MB to 3.7GB+ and eventual OOM kills on resource-constrained VPS hosts. Fix: Replace all bare 'out <- ...' sends with: select { case out <- ...: case <-ctx.Done(): return } This ensures goroutines terminate promptly when the request context is canceled, allowing GC to reclaim all associated resources. Affected executors (9 files, 36+ send sites): - antigravity_executor.go (5 sites) - gemini_cli_executor.go (6 sites) - gemini_vertex_executor.go (6 sites) - aistudio_executor.go (4 sites) - gemini_executor.go (3 sites) - openai_compat_executor.go (3 sites) - claude_executor.go (4 sites) - codex_executor.go (2 sites) - kimi_executor.go (3 sites)	2026-05-03 11:25:04 -04:00
Luis Pater	f56a19e5b8	feat: add tri-state support for `disable-image-generation` configuration - Introduced `DisableImageGenerationMode` with support for `false`, `true`, and `chat` values. - Updated payload handling to preserve `image_generation` on images endpoints when `chat` mode is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to respect tri-state logic. - Added unit tests for `DisableImageGenerationMode` behavior and endpoint-specific handling. - Enhanced configuration diff logging to support `DisableImageGenerationMode`.	2026-04-30 12:10:27 +08:00
Luis Pater	a1f0ed9575	Merge pull request #3071 from sususu98/fix/antigravity-credits-log Mark Antigravity credits requests in access logs	2026-04-29 22:56:41 +08:00
sususu98	0e1235122e	fix antigravity client agent headers	2026-04-28 19:04:40 +08:00
sususu98	e78d45acc9	fix antigravity user agent handling	2026-04-28 19:04:40 +08:00
xbang	a992dee4e8	fix(antigravity): use real antigravity UA when polling credits balance The loadCodeAssist polling call hardcoded the User-Agent to google-api-nodejs-client/9.15.1. Google Cloud Code returns the paidTier object WITHOUT the availableCredits array for that UA, so updateAntigravityCreditsBalance always saw "no credits", set the hint to Available=false for every Google One AI Ultra account, and the conductor-level credits fallback could never find a candidate. Switching to resolveUserAgent(auth) (the same UA used for streamGenerateContent / generateContent) makes the response include availableCredits, so the credits hint is populated correctly and the fallback can actually inject enabledCreditTypes:["GOOGLE_ONE_AI"] when free tier is exhausted.	2026-04-28 16:21:15 +08:00
sususu98	6fc23568df	logging: mark antigravity credits requests	2026-04-26 23:04:27 +08:00
sususu98	7ad1900041	perf(antigravity): async credits hint refresh for warm tokens	2026-04-23 23:58:10 +08:00
sususu98	14d46a0a5d	feat(antigravity): conductor-level credits fallback for Claude models Move credits handling from executor-level retry to conductor-level orchestration. When all free-tier auths are exhausted (429/503), the conductor discovers auths with available Google One AI credits and retries with enabledCreditTypes injected via context flag. Key changes: - Add AntigravityCreditsHint system for tracking per-auth credits state - Conductor tries credits fallback after all auths fail (Execute/Stream/Count) - Executor injects enabledCreditTypes only when conductor sets context flag - Credits fallback respects provider scope (requires antigravity in providers) - Add context cancellation check in credits fallback to avoid wasted requests - Remove executor-level attemptCreditsFallback and preferCredits machinery - Restructure 429 decision logic (parse details first, keyword fallback) - Expand shouldAbort to cover INVALID_ARGUMENT/FAILED_PRECONDITION/500+UNKNOWN - Support human-readable retry delay parsing (e.g. "1h43m56s")	2026-04-23 13:44:20 +08:00
sususu98	8fecd625d2	fix(antigravity): cap maxOutputTokens using registry max_completion_tokens Claude models on antigravity have a 64000 token output limit but max_tokens from downstream requests was passed through uncapped, causing 400 INVALID_ARGUMENT from Google when clients sent 128000.	2026-04-15 11:57:55 +08:00
sususu98	278a89824c	fix(antigravity): strip thinking blocks with empty signatures instead of rejecting Thinking blocks with empty signatures come from proxy-generated responses (Antigravity/Gemini routed as Claude). These should be silently dropped from the request payload before forwarding, not rejected with 400. Fixes 10 "missing thinking signature" errors.	2026-04-14 15:14:48 +08:00
sususu98	f5ed5c7453	fix(antigravity): skip full schema cleanup for empty tool requests Avoid whole-payload schema sanitization when translated Antigravity requests have no actual tool schemas, including missing and empty tools arrays. Add regression coverage so image-heavy no-tool requests keep bypassing the old memory amplification path. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-12 12:51:42 +08:00
sususu98	d801393841	feat(antigravity): prefer prod URL as first priority Promote cloudcode-pa.googleapis.com to the first position in the fallback order, with daily and sandbox URLs as fallbacks.	2026-04-10 19:37:56 +08:00
Luis Pater	39dc4557c1	Merge pull request #2412 from sususu98/feat/signature-cache-toggle feat: configurable signature cache toggle for Antigravity/Claude thinking blocks	2026-04-09 21:54:47 +08:00
ZTXBOSS666	30e94b6792	fix(antigravity): refine 429 handling and credits fallback Includes: restore SDK docs under docs/; update antigravity executor credits tests; gofmt.	2026-04-09 21:48:32 +08:00
sususu98	cf249586a9	feat(antigravity): configurable signature cache with bypass-mode validation Antigravity 的 Claude thinking signature 处理新增 cache/bypass 双模式，并为 bypass 模式实现按 SIGNATURE-CHANNEL-SPEC.md 的签名校验。新增 antigravity-signature-cache-enabled 配置项（默认 true）： - cache mode（true）：使用服务端缓存的签名，行为与原有逻辑完全一致 - bypass mode（false）：直接使用客户端提供的签名，经过校验和归一化支持配置热重载，运行时可切换模式。校验流程： 1. 剥离历史 cache-mode 的 'modelGroup#' 前缀（如 claude#Exxxx → Exxxx） 2. 首字符必须为 'E'（单层编码）或 'R'（双层编码），否则拒绝 3. R 开头：base64 解码 → 内层必须以 'E' 开头 → 继续单层校验 4. E 开头：base64 解码 → 首字节必须为 0x12（Claude protobuf 标识） 5. 所有合法签名归一化为 R 形式（双层 base64）发往 Antigravity 后端非法签名处理策略： - 非严格模式（默认）：translator 静默丢弃无签名的 thinking block - 严格模式（antigravity-signature-bypass-strict: true）： executor 层在请求发往上游前直接返回 HTTP 400 按 SIGNATURE-CHANNEL-SPEC.md 解析 Claude 签名的完整 protobuf 结构： - Top-level Field 2（容器）→ Field 1（渠道块） - 渠道块提取：channel_id (Field 1)、infrastructure (Field 2)、 model_text (Field 6)、field7 (Field 7) - 计算 routing_class、infrastructure_class、schema_features - 使用 google.golang.org/protobuf/encoding/protowire 解析 - resolveThinkingSignature 拆分为 resolveCacheModeSignature / resolveBypassModeSignature - hasResolvedThinkingSignature：mode-aware 签名有效性判断（cache: len>=50 via HasValidSignature，bypass: non-empty） - validateAntigravityRequestSignatures：executor 预检，仅在 bypass + strict 模式下拦截非法签名返回 400 - 响应侧签名缓存逻辑与 cache mode 集成 - Cache mode 行为完全保留：无 '#' 前缀的原生签名静默丢弃	2026-04-09 21:12:40 +08:00
Luis Pater	91e7591955	fix(executor): add transient 429 resource exhausted handling with retry logic	2026-04-08 02:48:53 +08:00
rensumo	8d5e470e1f	feat: dynamically fetch antigravity UA version from releases API Fetch the latest version from the antigravity auto-updater releases endpoint and cache it for 6 hours. Falls back to 1.21.9 if the API is unreachable or returns unexpected data.	2026-04-04 14:52:59 +08:00
rensumo	f3ab8f4bc5	chore: update antigravity UA version to 1.21.9	2026-04-04 07:35:08 +08:00
Luis Pater	09e480036a	feat(auth): add support for managing custom headers in auth files Closes #2457	2026-04-02 19:11:09 +08:00
Luis Pater	d2c7e4e96a	refactor(runtime): move executor utilities to `helps` package and update references	2026-04-01 03:08:20 +08:00
xixiwenxuanhe	a0bf33eca6	fix(antigravity): preserve fallback and honor config gate	2026-03-31 00:14:05 +08:00
xixiwenxuanhe	88dd9c715d	feat(antigravity): add AI credits quota fallback	2026-03-30 23:58:12 +08:00
Luis Pater	2bd646ad70	refactor: replace `sjson.Set` usage with `sjson.SetBytes` to optimize mutable JSON transformations	2026-03-19 17:58:54 +08:00
hkfires	dea3e74d35	feat(antigravity): refactor model handling and remove unused code	2026-03-12 09:24:45 +08:00
hkfires	9229708b6c	revert(executor): re-apply PR #1735 antigravity changes with cleanup	2026-03-02 19:30:32 +08:00
hkfires	b907d21851	revert(executor): revert antigravity_executor.go changes from PR #1735	2026-03-02 12:54:15 +08:00
Luis Pater	8aa2cce8c5	Merge PR #1735 into dev with conflict resolution and fixes	2026-03-02 03:22:51 +08:00
hkfires	134f41496d	fix(antigravity): update model configurations and add new models for Antigravity	2026-03-01 10:05:29 +08:00
maplelove	2baf35b3ef	fix(executor): bump antigravity UA to 1.19.6 and align image_gen payload	2026-02-27 14:09:37 +08:00
maplelove	846e75b893	feat(gemini): route gemini-3.1-flash-image identically to gemini-3-pro-image	2026-02-27 13:32:06 +08:00
maplelove	f3c164d345	feat(antigravity): update to v1.19.5 with new models and Claude 4-6 migration	2026-02-27 10:34:27 +08:00
maplelove	4040b1e766	Merge remote-tracking branch 'upstream/dev' into dev # Conflicts: # internal/runtime/executor/antigravity_executor.go	2026-02-27 10:29:50 +08:00
comalot	514ae341c8	fix(antigravity): deep copy cached model metadata	2026-02-24 20:14:01 +08:00
comalot	8ce07f38dd	fix(antigravity): keep primary model list and backfill empty auths	2026-02-24 16:16:44 +08:00
maplelove	8f97a5f77c	feat(registry): expose input modalities, token limits, and generation methods for Antigravity models	2026-02-23 13:33:51 +08:00
maplelove	8b5af2ab84	fix(executor): match real Antigravity OAuth UA, remove redundant header scrubbing on new requests	2026-02-22 23:20:12 +08:00
maplelove	d887716ebd	refactor(executor): switch HttpRequest to whitelist-based header filtering	2026-02-22 21:00:12 +08:00
maplelove	5dc1848466	feat(scrub): add comprehensive browser fingerprint and client identity header scrubbing	2026-02-22 20:51:00 +08:00
maplelove	9491517b26	fix(executor): use singleton transport to prevent OOM from connection pool leaks	2026-02-22 20:17:30 +08:00
maplelove	9370b5bd04	fix(executor): completely scrub all proxy tracing headers in executor	2026-02-22 19:43:10 +08:00
maplelove	abb51a0d93	fix(executor): correctly disable http2 ALPN in Antigravity client to resolve connection reset errors	2026-02-22 19:23:48 +08:00
maplelove	c8d809131b	fix(executor): improve antigravity reverse proxy emulation - force http/1.1 instead of http/2 - explicit connection close - strip proxy headers X-Forwarded-For and X-Real-IP - add project id to fetch models payload	2026-02-22 18:41:58 +08:00
Kirill Turanskiy	1f8f198c45	feat: passthrough upstream response headers to clients CPA previously stripped ALL response headers from upstream AI provider APIs, preventing clients from seeing rate-limit info, request IDs, server-timing and other useful headers. Changes: - Add Headers field to Response and StreamResult structs - Add FilterUpstreamHeaders helper (hop-by-hop + security denylist) - Add WriteUpstreamHeaders helper (respects CPA-set headers) - ExecuteWithAuthManager/ExecuteCountWithAuthManager now return headers - ExecuteStreamWithAuthManager returns headers from initial connection - All 11 provider executors populate Response.Headers - All handler call sites write filtered upstream headers before response Filtered headers (not forwarded): - RFC 7230 hop-by-hop: Connection, Transfer-Encoding, Keep-Alive, etc. - Security: Set-Cookie - CPA-managed: Content-Length, Content-Encoding	2026-02-18 00:16:22 +03:00
Luis Pater	a5a25dec57	refactor(translator, executor): remove redundant `bytes.Clone` calls for improved performance - Replaced all instances of `bytes.Clone` with direct references to enhance efficiency. - Simplified payload handling across executors and translators by eliminating unnecessary data duplication.	2026-02-06 03:26:29 +08:00
Luis Pater	09ecfbcaed	refactor(executor): optimize payload cloning and streamline SDK translator usage - Replaced unnecessary `bytes.Clone` calls for `opts.OriginalRequest` throughout executors. - Introduced intermediate variable `originalPayloadSource` to simplify payload processing. - Ensured better clarity and structure in request translation logic.	2026-02-06 01:44:20 +08:00
Luis Pater	25c6b479c7	refactor(util, executor): optimize payload handling and schema processing - Replaced repetitive string operations with a centralized `escapeGJSONPathKey` function. - Streamlined handling of JSON schema cleaning for Gemini and Antigravity requests. - Improved payload management by transitioning from byte slices to strings for processing. - Removed unnecessary cloning of byte slices in several places.	2026-02-05 19:00:30 +08:00
Luis Pater	f887f9985d	Merge pull request #1248 from shekohex/feat/responses-compact feat(openai): add responses/compact support	2026-01-31 03:12:55 +08:00
Shady Khalifa	95096bc3fc	feat(openai): add responses/compact support	2026-01-26 16:36:01 +02:00

1 2 3

141 Commits