CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-05-10 07:46:25 +08:00

Author	SHA1	Message	Date
Luis Pater	e50cabac4b	chore: upgrade CLIProxyAPI dependency to v7 across the project - Updated all references from v6 to v7 for `github.com/router-for-me/CLIProxyAPI`. - Ensured consistency in imports within core libraries, tests, and integration tests. - Added missing tests for new features in Redis Protocol integration.	2026-05-08 11:46:46 +08:00
Luis Pater	a034cf8b8d	Merge pull request #3247 from edlsh/fix/amp-thread-actors-route fix(amp): proxy thread actors route	2026-05-07 09:31:16 +08:00
edlsh	01171742a6	fix(amp): proxy thread actors route	2026-05-06 13:12:35 -04:00
Luis Pater	fb08b92402	feat(executor): add upstream disconnect handling for Codex WebSocket sessions - Introduced `UpstreamDisconnectChan` for Codex WebSocket sessions to notify downstream connections of upstream disconnections. - Implemented `notifyUpstreamDisconnect` to signal errors and close channels on disconnect events. - Added integration tests to validate WebSocket session behavior on upstream disconnect. - Updated OpenAI WebSocket response handlers to properly close connections upon upstream disconnect notifications.	2026-05-06 22:09:33 +08:00
Luis Pater	da6c599efd	refactor(management): rename `GetUsage` to `GetUsageQueue` and update routes/tests - Renamed handler and test methods for better clarity on functionality. - Updated route from `/v0/management/usage` to `/v0/management/usage-queue`. - Adjusted integration and unit tests to reflect new naming and routes.	2026-05-05 03:02:25 +08:00
Luis Pater	61b39d49bd	feat(management): add usage record retrieval endpoint - Implemented `/v0/management/usage` endpoint for fetching queued usage records from Redis. - Included validation for `count` parameter to ensure positive integers. - Added unit tests for queue retrieval and validation, with authentication validation in integration tests. - Updated management routing to include the new endpoint.	2026-05-05 02:53:04 +08:00
Luis Pater	ba5d8ca733	feat(usage): add support for requested model alias handling - Introduced methods for setting and retrieving model aliases in execution and usage contexts. - Enhanced `UsageReporter` and related structures to include client-requested aliases. - Updated tests to validate alias propagation and ensure correct usage reporting. - Adjusted metadata handling in CLIProxyAPI executors to address alias integration.	2026-05-05 01:47:53 +08:00
Luis Pater	28b4b19e7e	Merge pull request #3208 from kdcokenny/codex-websocket-protocol-parity Align Codex websocket protocol semantics	2026-05-05 01:29:19 +08:00
Luis Pater	bdc424007e	Merge pull request #2896 from edlsh/fix/oauth-tool-rename-per-request-map fix(amp): smart-mode tool name fixes + deep-mode response repair	2026-05-05 00:58:39 +08:00
Luis Pater	e4a93c02c5	fix(executor): enhance parsing of OpenAI stream data lines - Added trimming for stream input lines to prevent processing of unnecessary whitespace. - Improved handling of unsupported prefixes and malformed JSON responses, ensuring errors are recorded and propagated appropriately. Fixed: #2690	2026-05-04 23:42:26 +08:00
Luis Pater	8262a03f29	Merge PR #2568 : fix Claude refresh backoff	2026-05-04 21:44:11 +08:00
Luis Pater	ecf1c2590c	fix: preserve Antigravity cancellation errors	2026-05-04 21:18:18 +08:00
Luis Pater	162897e02a	Merge remote-tracking branch 'origin/pr/3205' into dev	2026-05-04 21:17:01 +08:00
Luis Pater	c1caa454b3	fix(translator): handle empty tool function names in OpenAI Claude responses - Added check to prevent processing of empty `function.name` values, ensuring valid data is handled. Fixed: #2557	2026-05-04 21:00:33 +08:00
Luis Pater	bf6fa402e2	fix(executor): strip Vertex OpenAI response tool call IDs for consistency - Integrated `StripVertexOpenAIResponsesToolCallIDs` to remove tool call ID data from request bodies and translated requests. - Ensures uniformity and avoids unnecessary payload data propagation. Fixed: #2549	2026-05-04 17:54:16 +08:00
Luis Pater	85c0150653	feat(translator): add token usage tracking and improve usage handling - Introduced `claudeUsageTokens` struct for detailed token usage tracking. - Replaced `calculateClaudeUsageTokens` with `Merge` and `OpenAIUsage` methods for better modularity. - Enhanced integration of usage tokens into response processing, enabling more accurate reporting of token details. Fixed: #2419	2026-05-04 16:57:50 +08:00
Luis Pater	89d80bfff4	fix(executor): adjust ApplyThinking order and add payload override test - Moved `ApplyThinking` logic earlier in `openai_compat_executor` to align with configuration application sequence. - Added test to verify payload override precedence over Thinking suffix configuration.	2026-05-04 16:45:25 +08:00
Luis Pater	a1eba112f3	Merge pull request #2416 from kslamph/fix/gemini-cli-projectid fix(gemini-cli): use backend project ID from onboarding response	2026-05-04 16:08:31 +08:00
Kenny	6b4bc0a9a8	Align Codex default identity and docs	2026-05-03 21:13:37 -07:00
Kenny	08b0fe6816	Fix Codex websocket retry metadata	2026-05-03 19:01:44 -07:00
Kenny	c19ae1d5be	Align Codex websocket protocol semantics	2026-05-03 15:56:39 -07:00
Luis Pater	17be6442a8	fix(translator): improve tool response handling for non-string content - Added `setToolCallOutputContent` to process various content types, including arrays and fallback cases. - Implemented robust handling for specific tool output types like text, image URLs, and files, ensuring proper serialization. - Improved fallback logic to handle unexpected or missing data. Fixed: #2313 Closes: #2349	2026-05-04 05:50:01 +08:00
Luis Pater	a1487b0958	fix(translator): handle non-string types in tools result processing - Skip setting values for non-string `type` fields to prevent runtime errors. Closes: #2226	2026-05-04 05:08:31 +08:00
Luis Pater	2753d9fb71	feat: add validation for Claude streaming responses - Implemented `validateClaudeStreamingResponse` to ensure upstream streaming data integrity. - Added new tests to verify response validation, including empty streams, error events, incomplete streams, and valid streams. - Integrated validation logic into the Claude executor's streaming handler, returning detailed errors for malformed upstream data. Fixed: #2193	2026-05-04 03:37:31 +08:00
1137043480	bf0e5c23f7	fix: prevent goroutine leaks in streaming executors via context-aware channel sends All streaming executors use bare channel sends (out <- chunk) inside goroutines that process upstream SSE responses. When the downstream consumer disconnects (client timeout, network drop, etc.), these sends block indefinitely, causing the goroutine and all associated resources (HTTP response body, scanner buffers, translation state) to leak permanently. Over time, leaked goroutines accumulate monotonically, leading to RSS growth from ~30MB to 3.7GB+ and eventual OOM kills on resource-constrained VPS hosts. Fix: Replace all bare 'out <- ...' sends with: select { case out <- ...: case <-ctx.Done(): return } This ensures goroutines terminate promptly when the request context is canceled, allowing GC to reclaim all associated resources. Affected executors (9 files, 36+ send sites): - antigravity_executor.go (5 sites) - gemini_cli_executor.go (6 sites) - gemini_vertex_executor.go (6 sites) - aistudio_executor.go (4 sites) - gemini_executor.go (3 sites) - openai_compat_executor.go (3 sites) - claude_executor.go (4 sites) - codex_executor.go (2 sites) - kimi_executor.go (3 sites)	2026-05-03 11:25:04 -04:00
Luis Pater	672fdd14ed	feat: filter and drop empty assistant messages in Kimi executor - Added `filterKimiEmptyAssistantMessages` to identify and remove empty assistant messages with no content, tool links, or reasoning. - Integrated logging to track the number of dropped messages. - Updated tests to validate the filtering logic for both empty and valid assistant messages. Fixed: #1730	2026-05-03 22:40:42 +08:00
Luis Pater	af65908cb0	feat: enhance tool mapping with namespace and web search support - Added functions to handle tool conversion, including namespace-based tools and web search tools. - Improved parameter normalization and tool input schema standardization. - Integrated logic to handle qualified tool names and map override functionality. - Refactored existing tool processing for better extensibility and maintainability. Fixed: #3199	2026-05-03 22:26:23 +08:00
Luis Pater	56df36895a	feat: add configurable retention period for Redis usage queue - Introduced `redis-usage-queue-retention-seconds` config parameter with a default of 60 seconds and a max of 3600 seconds. - Updated logic in `redisqueue` to honor configurable retention periods for enqueued usage data. - Modified config validation and initialization to support and enforce retention limits. - Enhanced change tracking in `config_diff` to detect updates to this parameter.	2026-05-02 20:43:16 +08:00
Luis Pater	18bb9c315f	chore: remove usage tracking and logging functionality - Deleted the `LoggerPlugin` along with associated usage tracking and in-memory statistics logic. - Removed all related tests (`logger_plugin_test.go`, `usage_tab_test.go`) and external-facing handler (`usage.go`) for usage statistics export/import. - Cleaned up TUI integration by deleting `usage_tab.go`.	2026-05-02 04:50:58 +08:00
Luis Pater	b8bba053fc	feat: add tracking for auth request success and failure counts - Introduced `Success` and `Failed` fields in auth records to track request outcomes. - Updated `/v0/management/auth-files` and `/v0/management/api-key-usage` responses to include success and failure counts. - Enhanced tests to validate tracking logic and API responses.	2026-05-02 03:40:00 +08:00
Luis Pater	8c2f1a80d3	feat: enhance API key usage grouping with base URL inclusion - Updated `GetAPIKeyUsage` to group API key usage by "base_url\|api_key" composite keys. - Adjusted logic to handle `base_url` extraction from auth attributes. - Revised unit tests to validate "base_url\|api_key" grouping behavior.	2026-05-02 02:20:49 +08:00
Luis Pater	b0dc9df887	feat: add API key usage endpoint with provider and key grouping - Implemented `GetAPIKeyUsage` to expose recent request data grouped by provider and API key. - Added supporting function `mergeRecentRequestBuckets` for bucket aggregation. - Registered new endpoint `/v0/management/api-key-usage` in the management API. - Included extensive unit tests for provider and key-based grouping validation. - Updated `formatRecentRequestBucketLabel` to support configurable bucket duration.	2026-05-01 23:34:18 +08:00
Luis Pater	6187919000	feat: add support for recent request tracking in auth records - Implemented `RecentRequestsSnapshot` in `Auth` to capture bucketed recent request data. - Added new fields and methods to `Auth` for tracking request success and failure counts over time. - Updated `/v0/management/auth-files` response to include recent request data for each auth record. - Introduced unit tests to validate request tracking and snapshot generation logic.	2026-05-01 22:55:22 +08:00
Luis Pater	4035abc0cd	refactor(logging): replace gin-specific context handling with generic context-based request metadata utilities - Introduced reusable utilities in `requestmeta` to manage endpoint and response status in request contexts. - Refactored plugins and handlers to use context-based metadata, removing direct dependency on `gin`. - Updated tests to validate new context utilities and replaced `gin`-based context handling. Fixed: #3166	2026-04-30 23:36:07 +08:00
Luis Pater	6ba7c810a7	feat: apply image_generation filtering before payload rules - Updated `ApplyPayloadConfigWithRoot` to prioritize `disable-image-generation` filtering before applying payload rules. - Ensured payload overrides can explicitly re-enable `image_generation` when required. - Added unit tests to validate `image_generation` restoration through overrides.	2026-04-30 12:42:08 +08:00
Luis Pater	f56a19e5b8	feat: add tri-state support for `disable-image-generation` configuration - Introduced `DisableImageGenerationMode` with support for `false`, `true`, and `chat` values. - Updated payload handling to preserve `image_generation` on images endpoints when `chat` mode is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to respect tri-state logic. - Added unit tests for `DisableImageGenerationMode` behavior and endpoint-specific handling. - Enhanced configuration diff logging to support `DisableImageGenerationMode`.	2026-04-30 12:10:27 +08:00
Luis Pater	46018417ad	feat: remove `tool_choice` for `image_generation` when disabled - Added logic to remove `tool_choice` entries of type `image_generation` from payloads when `disable-image-generation` is enabled. - Updated `ApplyPayloadConfigWithRoot` to handle new removal logic. - Added unit tests to verify `tool_choice` removal behavior.	2026-04-30 08:24:14 +08:00
Luis Pater	e3e60f914b	feat: support disabling image generation globally - Added `disable-image-generation` configuration flag to disable the `image_generation` tool globally. - Updated payload handling to remove `image_generation` tools from request payload arrays when the flag is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to return 404 when the feature is disabled. - Enhanced configuration diff logging to track changes for the `disable-image-generation` flag. - Added accompanying unit tests for the new feature in payload helpers and image handler logic.	2026-04-30 03:42:27 +08:00
Luis Pater	a1f0ed9575	Merge pull request #3071 from sususu98/fix/antigravity-credits-log Mark Antigravity credits requests in access logs	2026-04-29 22:56:41 +08:00
Luis Pater	b3b92f334e	Merge pull request #3125 from sususu98/fix/gemini-cli-usage-metadata-variants fix: parse gemini cli usage metadata variants	2026-04-29 22:56:14 +08:00
sususu98	0d107dd566	fix: respect declared claude web search tool names	2026-04-29 19:24:53 +08:00
sususu98	1c0c426b85	fix: align claude codex translation	2026-04-29 18:47:03 +08:00
sususu98	4982512da2	fix: parse gemini cli usage metadata variants	2026-04-29 13:10:53 +08:00
Luis Pater	2ea8f77efb	feat(models): add GPT-5.5 to the registry with support for advanced tasks	2026-04-29 09:49:26 +08:00
Luis Pater	f95ab4cdf1	Merge pull request #3089 from XYenon/feat/session-affinity feat: support Codex/PI session headers for session affinity	2026-04-28 23:54:16 +08:00
sususu98	0e1235122e	fix antigravity client agent headers	2026-04-28 19:04:40 +08:00
sususu98	e78d45acc9	fix antigravity user agent handling	2026-04-28 19:04:40 +08:00
xbang	a992dee4e8	fix(antigravity): use real antigravity UA when polling credits balance The loadCodeAssist polling call hardcoded the User-Agent to google-api-nodejs-client/9.15.1. Google Cloud Code returns the paidTier object WITHOUT the availableCredits array for that UA, so updateAntigravityCreditsBalance always saw "no credits", set the hint to Available=false for every Google One AI Ultra account, and the conductor-level credits fallback could never find a candidate. Switching to resolveUserAgent(auth) (the same UA used for streamGenerateContent / generateContent) makes the response include availableCredits, so the credits hint is populated correctly and the fallback can actually inject enabledCreditTypes:["GOOGLE_ONE_AI"] when free tier is exhausted.	2026-04-28 16:21:15 +08:00
XYenon	3ac39dcc7d	feat: support Codex/PI session headers for session affinity Amp-Thread-ID: https://ampcode.com/threads/T-019dce25-c070-773a-ac52-11c541220b30 Co-authored-by: Amp <amp@ampcode.com>	2026-04-27 17:10:50 +08:00
sususu	c523101439	Preserve Codex reasoning signatures for Claude	2026-04-27 16:51:35 +08:00

1 2 3 4 5 ...

1770 Commits