CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-05-22 20:29:40 +08:00

Author	SHA1	Message	Date
Luis Pater	8300ee8bbe	feat(auth): enhance home auth session reuse with scoped caching and ref counting - Added `homeRuntimeAuthSessions` and `homeRuntimeAuthRefs` for scoped caching of home auths per session. - Updated `pickNextViaHome` to prevent reuse of already-tried pinned auths during session retries. - Implemented reference counting for shared auths across multiple sessions to improve memory management. - Enhanced session cleanup logic to clear cached auths only when all referencing sessions are closed. - Added unit tests to validate scoped caching, retry logic, and session cleanup behavior.	2026-05-10 14:00:13 +08:00
Luis Pater	dc1cc7f115	feat(auth): add websocket session reuse for home auths with caching support - Introduced `homeRuntimeAuths` to cache home auths for websocket session reuse. - Updated `pickNextViaHome` to prioritize cached auths for pinned websocket sessions. - Implemented automatic clearing of cached home auths when home mode is disabled. - Added unit tests to validate caching behavior, clearing logic, and fallback scenarios.	2026-05-10 13:39:14 +08:00
Luis Pater	a44e5eb1ab	Merge branch 'v7' into dev v7.0.0	2026-05-10 02:33:42 +08:00
Luis Pater	67fb4eb98e	feat(auth): add `shouldReturnLastErrorOnPickFailure` helper and improve error handling in home mode - Introduced `shouldReturnLastErrorOnPickFailure` to streamline error return logic during provider selection. - Added `isHomeRequestRetryExceededError` for better home-specific error classification. - Updated fallback conditions to enhance error handling clarity in `pickNextMixed`.	2026-05-10 02:09:53 +08:00
Luis Pater	66c3dae06b	feat(home): implement `count` for home auth dispatch requests and enable usage statistics - Added `count` attribute to `homeAuthCount` requests to improve home message batching. - Enabled usage statistics for home mode by default and added config-level enforcement. - Adjusted failure logging to include detailed metadata in `UsageReporter`. - Updated multiple executors to pass error details to `PublishFailure` for better debugging. - Enhanced unit tests to validate `count` behavior and usage statistics enforcement across components.	2026-05-10 01:30:43 +08:00
Luis Pater	1abf8625d8	feat(logging): add home request-log forwarding support - Introduced `SetHomeEnabled` to enable/disable request-log forwarding to the home control plane. - Implemented `forwardRequestLogToHome` for non-streaming logs and `homeStreamingLogWriter` for real-time streaming logs. - Enhanced `FileRequestLogger` to bypass local logging when home forwarding is enabled. - Updated server configuration to dynamically toggle home request-log forwarding based on changes. - Added corresponding unit tests to ensure correct forwarding behavior and fallback mechanisms.	2026-05-09 23:39:59 +08:00
Luis Pater	3cbd587b2c	Merge pull request #3283 from wuchulonly/fix/responses-ws-tool-output-context Fix Responses WebSocket tool output context repair	2026-05-09 21:08:43 +08:00
Luis Pater	41f4ee7c7d	feat(auth): enhance auth index generation with improved file path handling - Updated `EnsureIndex` logic to incorporate absolute and cleaned file paths when generating auth indexes. - Refined metadata handling to include OAuth type in auth index seed. - Improved compatibility for `json` file paths as sources in auth attributes. - Added unit tests to validate correct auth index behavior for various path and type scenarios.	2026-05-09 21:03:11 +08:00
Luis Pater	c69ff49758	feat(auth): add support for persisting `disabled` flag in token storage - Updated `FileTokenStore` and related stores (`objectstore`, `gitstore`, `postgresstore`) to include the `disabled` flag in metadata for token storage. - Adjusted `Auth` metadata handling to initialize empty maps when absent. - Refined logic in `auto_refresh_loop` and `conductor` to exclude `disabled` tokens from refresh checks. - Added comprehensive unit tests to verify proper handling of the `disabled` flag in storage and retrieval operations.	2026-05-09 19:48:42 +08:00
Luis Pater	68fddaa319	Merge pull request #3292 from lihan3238/fix-3272 fix: apply default auth-dir when config value is empty	2026-05-09 17:34:05 +08:00
Luis Pater	09ac8a1165	Merge pull request #3221 from mochenya/main fix(executor): ignore null OpenAI stream usage chunks	2026-05-09 11:53:26 +08:00
Luis Pater	0dcb8bd714	refactor(cliproxy): remove `ClaudeCodeSessionAffinity` support and simplify session affinity logic	2026-05-09 10:51:49 +08:00
Luis Pater	0f0fcd2304	feat(config): add per-auth `disable_cooling` override support - Introduced `disable_cooling` metadata field for fine-grained control over cooldown scheduling. - Updated `Auth` object to include `Metadata` with conditional logic for handling empty states. - Added YAML configuration support for `disable_cooling` in API key definitions across providers. - Enhanced unit tests to validate `disable_cooling` behavior in various scenarios.	2026-05-09 10:51:27 +08:00
Luis Pater	c67096b687	feat(server): add support for loading configuration from a remote home control plane - Introduced `-home` and `-home-password` flags for specifying home control plane address and authentication. - Implemented fetching and parsing configuration from the home control plane when `-home` is used. - Adjusted server configuration handling to bypass local config files when loading from home. - Ensured compatibility with cloud deploy mode and validation of home configurations.	2026-05-09 07:14:44 +08:00
Luis Pater	1721994111	feat(management): expose additional OAuth and configuration helpers - Added new helper methods for OAuth session management (`RegisterOAuthSession`, `CompleteOAuthSession`, etc.). - Introduced `WriteConfig` for persisting management configurations. - Exported `Handler` type and `NewHandler` constructors for SDK consumers.	2026-05-09 00:23:45 +08:00
lihan3238	4cbe172934	refactor: extract DefaultAuthDir constant per review feedback	2026-05-08 22:28:38 +08:00
lihan3238	4071fdef84	fix: apply default auth-dir when config value is empty When auth-dir is not specified in config.yaml, ResolveAuthDir returns an empty string which causes os.MkdirAll to fail with no path. Use the documented default ~/.cli-proxy-api instead. Fixes #3272	2026-05-08 21:47:41 +08:00
Codex	c883114a4d	fix responses websocket tool output context	2026-05-08 05:12:30 +00:00
Luis Pater	e50cabac4b	chore: upgrade CLIProxyAPI dependency to v7 across the project - Updated all references from v6 to v7 for `github.com/router-for-me/CLIProxyAPI`. - Ensured consistency in imports within core libraries, tests, and integration tests. - Added missing tests for new features in Redis Protocol integration.	2026-05-08 11:46:46 +08:00
Luis Pater	785b00c312	Merge pull request #3237 from seakee/docs/add-cpa-manager-usage-statistics docs: add CPA-Manager to usage statistics recommendations v6.10.9	2026-05-07 09:32:32 +08:00
Luis Pater	a034cf8b8d	Merge pull request #3247 from edlsh/fix/amp-thread-actors-route fix(amp): proxy thread actors route	2026-05-07 09:31:16 +08:00
edlsh	01171742a6	fix(amp): proxy thread actors route	2026-05-06 13:12:35 -04:00
Luis Pater	fb08b92402	feat(executor): add upstream disconnect handling for Codex WebSocket sessions - Introduced `UpstreamDisconnectChan` for Codex WebSocket sessions to notify downstream connections of upstream disconnections. - Implemented `notifyUpstreamDisconnect` to signal errors and close channels on disconnect events. - Added integration tests to validate WebSocket session behavior on upstream disconnect. - Updated OpenAI WebSocket response handlers to properly close connections upon upstream disconnect notifications.	2026-05-06 22:09:33 +08:00
seakee	ad3f4f2ce5	📝 docs(readme): add CPA-Manager usage statistics recommendation Add CPA-Manager to the Usage Statistics recommendations across English, Chinese, and Japanese READMEs. Highlight request-level monitoring, cost estimation, LiteLLM price sync, SQLite persistence, and Codex account-pool operations for multi-account maintenance.	2026-05-06 15:49:57 +08:00
Luis Pater	ed1458aa6d	chore(docs): update sponsor details in README - Replaced sponsor `z.ai` with `PackyCode` and updated related descriptions, images, and links in `README.md`, `README_CN.md`, and `README_JA.md`. - Removed outdated sponsor entries for `Poixe AI` in all README files. - Added new image assets for PackyCode (`packycode-cn.png` and `packycode-en.png`).	2026-05-06 00:41:50 +08:00
mochenya	99dfbaef61	fix(executor): ignore null OpenAI stream usage chunks - Added validation so OpenAI-style usage parsing only accepts object payloads with token fields. - Prevented streaming usage:null chunks from publishing zero-token records before the final usage chunk arrives. - Reused the shared OpenAI-style parser for stream usage to support both chat completions and responses token field names. - Added tests covering null usage chunks and input/output token usage fields in streaming responses.	2026-05-05 12:31:33 +08:00
Luis Pater	da6c599efd	refactor(management): rename `GetUsage` to `GetUsageQueue` and update routes/tests - Renamed handler and test methods for better clarity on functionality. - Updated route from `/v0/management/usage` to `/v0/management/usage-queue`. - Adjusted integration and unit tests to reflect new naming and routes. v6.10.8	2026-05-05 03:02:25 +08:00
Luis Pater	61b39d49bd	feat(management): add usage record retrieval endpoint - Implemented `/v0/management/usage` endpoint for fetching queued usage records from Redis. - Included validation for `count` parameter to ensure positive integers. - Added unit tests for queue retrieval and validation, with authentication validation in integration tests. - Updated management routing to include the new endpoint.	2026-05-05 02:53:04 +08:00
Luis Pater	ba5d8ca733	feat(usage): add support for requested model alias handling - Introduced methods for setting and retrieving model aliases in execution and usage contexts. - Enhanced `UsageReporter` and related structures to include client-requested aliases. - Updated tests to validate alias propagation and ensure correct usage reporting. - Adjusted metadata handling in CLIProxyAPI executors to address alias integration. v6.10.7	2026-05-05 01:47:53 +08:00
Luis Pater	28b4b19e7e	Merge pull request #3208 from kdcokenny/codex-websocket-protocol-parity Align Codex websocket protocol semantics	2026-05-05 01:29:19 +08:00
Luis Pater	bdc424007e	Merge pull request #2896 from edlsh/fix/oauth-tool-rename-per-request-map fix(amp): smart-mode tool name fixes + deep-mode response repair v6.10.6	2026-05-05 00:58:39 +08:00
Luis Pater	e4a93c02c5	fix(executor): enhance parsing of OpenAI stream data lines - Added trimming for stream input lines to prevent processing of unnecessary whitespace. - Improved handling of unsupported prefixes and malformed JSON responses, ensuring errors are recorded and propagated appropriately. Fixed: #2690 v6.10.5	2026-05-04 23:42:26 +08:00
Luis Pater	8262a03f29	Merge PR #2568 : fix Claude refresh backoff	2026-05-04 21:44:11 +08:00
Luis Pater	ecf1c2590c	fix: preserve Antigravity cancellation errors	2026-05-04 21:18:18 +08:00
Luis Pater	162897e02a	Merge remote-tracking branch 'origin/pr/3205' into dev	2026-05-04 21:17:01 +08:00
Luis Pater	c1caa454b3	fix(translator): handle empty tool function names in OpenAI Claude responses - Added check to prevent processing of empty `function.name` values, ensuring valid data is handled. Fixed: #2557	2026-05-04 21:00:33 +08:00
Luis Pater	bf6fa402e2	fix(executor): strip Vertex OpenAI response tool call IDs for consistency - Integrated `StripVertexOpenAIResponsesToolCallIDs` to remove tool call ID data from request bodies and translated requests. - Ensures uniformity and avoids unnecessary payload data propagation. Fixed: #2549	2026-05-04 17:54:16 +08:00
Luis Pater	85c0150653	feat(translator): add token usage tracking and improve usage handling - Introduced `claudeUsageTokens` struct for detailed token usage tracking. - Replaced `calculateClaudeUsageTokens` with `Merge` and `OpenAIUsage` methods for better modularity. - Enhanced integration of usage tokens into response processing, enabling more accurate reporting of token details. Fixed: #2419	2026-05-04 16:57:50 +08:00
Luis Pater	89d80bfff4	fix(executor): adjust ApplyThinking order and add payload override test - Moved `ApplyThinking` logic earlier in `openai_compat_executor` to align with configuration application sequence. - Added test to verify payload override precedence over Thinking suffix configuration.	2026-05-04 16:45:25 +08:00
Luis Pater	a1eba112f3	Merge pull request #2416 from kslamph/fix/gemini-cli-projectid fix(gemini-cli): use backend project ID from onboarding response	2026-05-04 16:08:31 +08:00
Kenny	6b4bc0a9a8	Align Codex default identity and docs	2026-05-03 21:13:37 -07:00
Kenny	08b0fe6816	Fix Codex websocket retry metadata	2026-05-03 19:01:44 -07:00
Kenny	c19ae1d5be	Align Codex websocket protocol semantics	2026-05-03 15:56:39 -07:00
Luis Pater	17be6442a8	fix(translator): improve tool response handling for non-string content - Added `setToolCallOutputContent` to process various content types, including arrays and fallback cases. - Implemented robust handling for specific tool output types like text, image URLs, and files, ensuring proper serialization. - Improved fallback logic to handle unexpected or missing data. Fixed: #2313 Closes: #2349 v6.10.4	2026-05-04 05:50:01 +08:00
Luis Pater	38dad2afdf	chore(docker): upgrade base image to alpine 3.23 Fixed: #2265 v6.10.3	2026-05-04 05:36:09 +08:00
Luis Pater	8e6ef3fa64	fix(websocket): ensure state consistency on auth errors in streaming - Added logic to reset `pinnedAuthID` and replay transcript on unauthorized, forbidden, or throttling errors. - Enhanced error handling in `forwardResponsesWebsocket` with detailed status inspection. - Introduced `shouldReleaseResponsesWebsocketPinnedAuth` to determine auth reset conditions. - Updated state management to preserve prior request and response data during forced replay. Fixed: #2230	2026-05-04 05:23:23 +08:00
Luis Pater	a1487b0958	fix(translator): handle non-string types in tools result processing - Skip setting values for non-string `type` fields to prevent runtime errors. Closes: #2226	2026-05-04 05:08:31 +08:00
Luis Pater	82ebe24b9e	Merge pull request #2266 from DragonFSKY/fix/ws-compact-tool-output-mismatch fix(websocket): skip stale state merge after client-side compact	2026-05-04 04:40:43 +08:00
Luis Pater	2753d9fb71	feat: add validation for Claude streaming responses - Implemented `validateClaudeStreamingResponse` to ensure upstream streaming data integrity. - Added new tests to verify response validation, including empty streams, error events, incomplete streams, and valid streams. - Integrated validation logic into the Claude executor's streaming handler, returning detailed errors for malformed upstream data. Fixed: #2193	2026-05-04 03:37:31 +08:00
1137043480	bf0e5c23f7	fix: prevent goroutine leaks in streaming executors via context-aware channel sends All streaming executors use bare channel sends (out <- chunk) inside goroutines that process upstream SSE responses. When the downstream consumer disconnects (client timeout, network drop, etc.), these sends block indefinitely, causing the goroutine and all associated resources (HTTP response body, scanner buffers, translation state) to leak permanently. Over time, leaked goroutines accumulate monotonically, leading to RSS growth from ~30MB to 3.7GB+ and eventual OOM kills on resource-constrained VPS hosts. Fix: Replace all bare 'out <- ...' sends with: select { case out <- ...: case <-ctx.Done(): return } This ensures goroutines terminate promptly when the request context is canceled, allowing GC to reclaim all associated resources. Affected executors (9 files, 36+ send sites): - antigravity_executor.go (5 sites) - gemini_cli_executor.go (6 sites) - gemini_vertex_executor.go (6 sites) - aistudio_executor.go (4 sites) - gemini_executor.go (3 sites) - openai_compat_executor.go (3 sites) - claude_executor.go (4 sites) - codex_executor.go (2 sites) - kimi_executor.go (3 sites)	2026-05-03 11:25:04 -04:00

1 2 3 4 5 ...

2454 Commits