CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-05-10 15:57:52 +08:00

Author	SHA1	Message	Date
Luis Pater	e50cabac4b	chore: upgrade CLIProxyAPI dependency to v7 across the project - Updated all references from v6 to v7 for `github.com/router-for-me/CLIProxyAPI`. - Ensured consistency in imports within core libraries, tests, and integration tests. - Added missing tests for new features in Redis Protocol integration.	2026-05-08 11:46:46 +08:00
Luis Pater	28b4b19e7e	Merge pull request #3208 from kdcokenny/codex-websocket-protocol-parity Align Codex websocket protocol semantics	2026-05-05 01:29:19 +08:00
Kenny	6b4bc0a9a8	Align Codex default identity and docs	2026-05-03 21:13:37 -07:00
Kenny	c19ae1d5be	Align Codex websocket protocol semantics	2026-05-03 15:56:39 -07:00
1137043480	bf0e5c23f7	fix: prevent goroutine leaks in streaming executors via context-aware channel sends All streaming executors use bare channel sends (out <- chunk) inside goroutines that process upstream SSE responses. When the downstream consumer disconnects (client timeout, network drop, etc.), these sends block indefinitely, causing the goroutine and all associated resources (HTTP response body, scanner buffers, translation state) to leak permanently. Over time, leaked goroutines accumulate monotonically, leading to RSS growth from ~30MB to 3.7GB+ and eventual OOM kills on resource-constrained VPS hosts. Fix: Replace all bare 'out <- ...' sends with: select { case out <- ...: case <-ctx.Done(): return } This ensures goroutines terminate promptly when the request context is canceled, allowing GC to reclaim all associated resources. Affected executors (9 files, 36+ send sites): - antigravity_executor.go (5 sites) - gemini_cli_executor.go (6 sites) - gemini_vertex_executor.go (6 sites) - aistudio_executor.go (4 sites) - gemini_executor.go (3 sites) - openai_compat_executor.go (3 sites) - claude_executor.go (4 sites) - codex_executor.go (2 sites) - kimi_executor.go (3 sites)	2026-05-03 11:25:04 -04:00
Luis Pater	f56a19e5b8	feat: add tri-state support for `disable-image-generation` configuration - Introduced `DisableImageGenerationMode` with support for `false`, `true`, and `chat` values. - Updated payload handling to preserve `image_generation` on images endpoints when `chat` mode is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to respect tri-state logic. - Added unit tests for `DisableImageGenerationMode` behavior and endpoint-specific handling. - Enhanced configuration diff logging to support `DisableImageGenerationMode`.	2026-04-30 12:10:27 +08:00
Luis Pater	e3e60f914b	feat: support disabling image generation globally - Added `disable-image-generation` configuration flag to disable the `image_generation` tool globally. - Updated payload handling to remove `image_generation` tools from request payload arrays when the flag is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to return 404 when the feature is disabled. - Enhanced configuration diff logging to track changes for the `disable-image-generation` flag. - Added accompanying unit tests for the new feature in payload helpers and image handler logic.	2026-04-30 03:42:27 +08:00
Luis Pater	c5bea6f6f8	Merge pull request #3020 from Matthias319/fix/codex-error-classification fix(codex): classify context, thinking-signature, previous-response, and auth failures	2026-04-26 22:26:40 +08:00
Luis Pater	c7b28ba058	feat(executor): add support for Codex image generation tool usage tracking - Introduced `publishCodexImageToolUsage` to report image generation tool metrics. - Updated executor logic to handle image generation tool events and defaults. - Added parsing logic for `image_gen` tool usage details in `helps/usage_helpers.go`. - Updated `UsageReporter` for additional model-specific usage publishing. - Refactored usage detail normalizations. Closes: #3063	2026-04-26 22:19:03 +08:00
Luis Pater	a7e92e2639	feat(auth): disallow free-tier Codex auth during selection process - Introduced `disallowFreeAuthFromMetadata` and `isFreeCodexAuth` to enforce skipping free-tier credentials. - Modified scheduler logic to honor `DisallowFreeAuthMetadataKey` during auth selection. - Updated `ensureImageGenerationTool` to skip tool injection for free-tier Codex auth. - Added context utility `WithDisallowFreeAuth` and integrated with image handlers. - Augmented relevant tests to cover free-tier exclusion scenarios.	2026-04-24 23:18:56 +08:00
Matthias319	4056c2590b	fix(codex): classify known upstream failures Normalize Codex context, thinking-signature, previous-response, and auth failures to explicit error codes: context_too_large, thinking_signature_invalid, previous_response_not_found, auth_unavailable. Refs #2596.	2026-04-24 17:13:23 +02:00
Luis Pater	f1ba6151a9	feat(codex): pass base model to enable conditional image_generation tool injection - Modified `ensureImageGenerationTool` to accept `baseModel` for conditional logic. - Ensured `gpt-5.3-codex-spark` models bypass image_generation tool injection. - Updated relevant tests and executor logic to reflect changes.	2026-04-24 07:21:03 +08:00
MoYeRanQianZhi	31934ae04c	feat(codex): enable image generation for all Codex upstream requests Codex CLI gates the built-in image_generation tool behind AuthMode::Chatgpt (OAuth only). When clients connect via API key auth through CPA, the tool is absent from requests, making image generation unavailable through the reverse proxy. Changes: 1. Inject image_generation tool (codex_executor.go): Add ensureImageGenerationTool() that appends {"type":"image_generation","output_format":"png"} to the tools array if not already present. Applied to all three execution paths: Execute, executeCompact, and ExecuteStream. 2. Route aliases for Codex CLI direct access (server.go): Add /backend-api/codex/responses routes that map to the same OpenAI Responses API handlers as /v1/responses. This allows Codex CLI to connect via chatgpt_base_url config while keeping AuthMode::Chatgpt, which enables the built-in image_generation tool on the client side. 3. Unit tests (codex_executor_imagegen_test.go): Cover no-tools, existing tools, already-present, empty array, and mixed built-in tool scenarios.	2026-04-23 01:24:40 +08:00
stringer07	b6781d69be	perf(codex): avoid repeated output patch writes	2026-04-21 16:29:54 +08:00
stringer07	bb8408cef5	fix(codex): backfill streaming response output	2026-04-21 16:03:56 +08:00
Luis Pater	d949921143	feat(auth): add proxy URL override support to auth constructors and executors - Introduced `WithProxyURL` variants for `CodexAuth`, `ClaudeAuth`, `IFlowAuth`, and `DeviceFlowClient`. - Updated executors to use proxy-aware constructors for improved configurability. - Added unit tests to validate proxy override precedence and functionality. Closes: #2823	2026-04-16 22:11:39 +08:00
Luis Pater	c8b7e2b8d6	fix(executor): ensure empty stream completions use output_item.done as fallback Fixed: #2583	2026-04-07 18:21:12 +08:00
Luis Pater	330e12d3c2	fix(codex): conditionally set `Session_id` header for Mac OS user agents and clean up redundant logic	2026-04-01 11:11:45 +08:00
Luis Pater	d2c7e4e96a	refactor(runtime): move executor utilities to `helps` package and update references	2026-04-01 03:08:20 +08:00
Luis Pater	51fd58d74f	fix(codex): use normalizeCodexInstructions to set default instructions	2026-03-31 12:16:57 +08:00
Luis Pater	faae9c2f7c	Merge pull request #2422 from MonsterQiu/fix/codex-compact-instructions fix(codex): add default instructions for /responses/compact	2026-03-31 12:14:20 +08:00
MonsterQiu	39b9a38fbc	fix(codex): normalize null instructions across responses paths	2026-03-31 10:32:39 +08:00
MonsterQiu	bd855abec9	fix(codex): normalize null instructions for responses requests	2026-03-31 10:29:02 +08:00
MonsterQiu	d3b94c9241	fix(codex): normalize null instructions for compact requests	2026-03-30 22:58:05 +08:00
MonsterQiu	d11936f292	fix(codex): add default instructions for /responses/compact	2026-03-30 22:44:46 +08:00
Luis Pater	13aa5b3375	Revert "fix(codex): restore prompt cache continuity for Codex requests"	2026-03-29 22:18:14 +08:00
Luis Pater	55271403fb	Merge pull request #2374 from VooDisss/codex-cache-clean fix(codex): restore prompt cache continuity for Codex requests	2026-03-28 21:16:51 +08:00
Luis Pater	b9b127a7ea	Merge pull request #2347 from edlsh/fix/codex-strip-stream-options fix(codex): strip stream_options from Responses API requests	2026-03-28 21:03:01 +08:00
VooDisss	26eca8b6ba	fix(codex): preserve continuity and safe affinity fallback Restore Claude continuity after the continuity refactor, keep auth-affinity keys out of upstream Codex session identifiers, and only persist affinity after successful execution so retries can still rotate to healthy credentials when the first auth fails.	2026-03-27 18:27:33 +02:00
VooDisss	511b8a992e	fix(codex): restore prompt cache continuity for Codex requests Prompt caching on Codex was not reliably reusable through the proxy because repeated chat-completions requests could reach the upstream without the same continuity envelope. In practice this showed up most clearly with OpenCode, where cache reads worked in the reference client but not through CLIProxyAPI, although the root cause is broader than OpenCode itself. The proxy was breaking continuity in several ways: executor-layer Codex request preparation stripped prompt_cache_retention, chat-completions translation did not preserve that field, continuity headers used a different shape than the working client behavior, and OpenAI-style Codex requests could be sent without a stable prompt_cache_key. When that happened, session_id fell back to a fresh random value per request, so upstream Codex treated repeated requests as unrelated turns instead of as part of the same cacheable context. This change fixes that by preserving caller-provided prompt_cache_retention on Codex execution paths, preserving prompt_cache_retention when translating OpenAI chat-completions requests to Codex, aligning Codex continuity headers to session_id, and introducing an explicit Codex continuity policy that derives a stable continuity key from the best available signal. The resolution order prefers an explicit prompt_cache_key, then execution session metadata, then an explicit idempotency key, then stable request-affinity metadata, then a stable client-principal hash, and finally a stable auth-ID hash when no better continuity signal exists. The same continuity key is applied to both prompt_cache_key in the request body and session_id in the request headers so repeated requests reuse the same upstream cache/session identity. The auth manager also keeps auth selection sticky for repeated request sequences, preventing otherwise-equivalent Codex requests from drifting across different upstream auth contexts and accidentally breaking cache reuse. To keep the implementation maintainable, the continuity resolution and diagnostics are centralized in a dedicated Codex continuity helper instead of being scattered across executor flow code. Regression coverage now verifies retention preservation, continuity-key precedence, stable auth-ID fallback, websocket parity, translator preservation, and auth-affinity behavior. Manual validation confirmed prompt cache reads now occur through CLIProxyAPI when using Codex via OpenCode, and the fix should also benefit other clients that rely on stable repeated Codex request continuity.	2026-03-27 17:49:29 +02:00
edlsh	754f3bcbc3	fix(codex): strip stream_options from Responses API requests The Codex/OpenAI Responses API does not support the stream_options parameter. When clients (e.g. Amp CLI) include stream_options in their requests, CLIProxyAPI forwards it as-is, causing a 400 error: {"detail":"Unsupported parameter: stream_options"} Strip stream_options alongside the other unsupported parameters (previous_response_id, prompt_cache_retention, safety_identifier) in Execute, ExecuteStream, and CountTokens.	2026-03-25 11:58:36 -04:00
pjpj	36973d4a6f	Handle Codex capacity errors as retryable	2026-03-25 23:25:31 +08:00
hkfires	528b1a2307	feat(codex): pass through codex client identity headers	2026-03-25 08:48:18 +08:00
Luis Pater	2bd646ad70	refactor: replace `sjson.Set` usage with `sjson.SetBytes` to optimize mutable JSON transformations	2026-03-19 17:58:54 +08:00
lang-911	70988d387b	Add Codex websocket header defaults	2026-03-11 00:34:57 -07:00
Luis Pater	ac135fc7cb	Fixed: #1815 test(executor): add unit tests for prompt cache key generation in OpenAI `cacheHelper`	2026-03-05 22:49:23 +08:00
lyd123qw2008	a99522224f	refactor(codex): make retry-after parsing deterministic for tests	2026-02-21 14:13:38 +08:00
lyd123qw2008	f5d46b9ca2	fix(codex): honor usage_limit_reached resets_at for retry_after	2026-02-21 13:50:23 +08:00
Luis Pater	61da7bd981	Merge PR #1626 into codex/pr-1626	2026-02-19 04:49:14 +08:00
Luis Pater	e5b5dc870f	chore(executor): remove unused Openai-Beta header from Codex executor	2026-02-19 02:19:48 +08:00
Kirill Turanskiy	1f8f198c45	feat: passthrough upstream response headers to clients CPA previously stripped ALL response headers from upstream AI provider APIs, preventing clients from seeing rate-limit info, request IDs, server-timing and other useful headers. Changes: - Add Headers field to Response and StreamResult structs - Add FilterUpstreamHeaders helper (hop-by-hop + security denylist) - Add WriteUpstreamHeaders helper (respects CPA-set headers) - ExecuteWithAuthManager/ExecuteCountWithAuthManager now return headers - ExecuteStreamWithAuthManager returns headers from initial connection - All 11 provider executors populate Response.Headers - All handler call sites write filtered upstream headers before response Filtered headers (not forwarded): - RFC 7230 hop-by-hop: Connection, Transfer-Encoding, Keep-Alive, etc. - Security: Set-Cookie - CPA-managed: Content-Length, Content-Encoding	2026-02-18 00:16:22 +03:00
Luis Pater	ae1e8a5191	chore(runtime, registry): update Codex client version and GPT-5.3 model creation date	2026-02-13 12:47:48 +08:00
Luis Pater	b4e034be1c	refactor(executor): centralize Codex client version and user agent constants - Introduced `codexClientVersion` and `codexUserAgent` constants for better maintainability. - Updated `EnsureHeader` calls to use the new constants.	2026-02-06 05:30:28 +08:00
Luis Pater	a5a25dec57	refactor(translator, executor): remove redundant `bytes.Clone` calls for improved performance - Replaced all instances of `bytes.Clone` with direct references to enhance efficiency. - Simplified payload handling across executors and translators by eliminating unnecessary data duplication.	2026-02-06 03:26:29 +08:00
Luis Pater	09ecfbcaed	refactor(executor): optimize payload cloning and streamline SDK translator usage - Replaced unnecessary `bytes.Clone` calls for `opts.OriginalRequest` throughout executors. - Introduced intermediate variable `originalPayloadSource` to simplify payload processing. - Ensured better clarity and structure in request translation logic.	2026-02-06 01:44:20 +08:00
hkfires	ac802a4646	refactor(codex): remove codex instructions injection support	2026-02-01 14:33:31 +08:00
Shady Khalifa	04b2290927	fix(codex): avoid empty prompt_cache_key	2026-01-27 19:06:42 +02:00
Shady Khalifa	53920b0399	fix(openai): drop stream for responses/compact	2026-01-27 18:27:34 +02:00
Shady Khalifa	95096bc3fc	feat(openai): add responses/compact support	2026-01-26 16:36:01 +02:00
hkfires	f30ffd5f5e	feat(executor): add request_id to error logs Extract error.message from JSON error responses when summarizing error bodies for debug logs	2026-01-25 21:31:46 +08:00

1 2 3

114 Commits