CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-06-23 02:36:49 +08:00

Author	SHA1	Message	Date
sususu98	1f2504ebcc	fix(claude): bypass signature sanitizer for non-Claude models (#3946 ) * fix(claude): bypass signature sanitizer for non-Claude models * test(claude): use subtests for sanitizer gate model cases v7.2.27	2026-06-21 23:06:38 +08:00
Luis Pater	369e560f1d	feat(api): refactor provider key logic for API key usage and add test for compatibility grouping - Extracted provider key determination into `apiKeyUsageProviderKey` for reuse and better readability. - Updated `GetAPIKeyUsage` to utilize the new function. - Added a new test case to validate compatibility grouping logic via `compat_name` attribute. Closes: #3940 #3941 v7.2.26	2026-06-21 11:37:52 +08:00
Luis Pater	babef2a1ef	feat(cliproxy): add `unregisterOpenAICompatExecutor` and sync runtime configuration - Implemented `unregisterOpenAICompatExecutor` to remove OpenAI-compatible executors dynamically. - Updated `ensureExecutorsForAuth` to unregister incompatible executors while handling plugin candidates. - Adjusted runtime configuration logic to ensure proper synchronization during updates with `syncPluginModelRuntime`. v7.2.25	2026-06-21 11:21:05 +08:00
Luis Pater	5bc0c68284	feat(pluginhost): improve error handling with HTTP status codes for plugin calls - Added `rpcPluginError` to encapsulate plugin errors with HTTP status codes. - Enhanced `decodeEnvelopeResult` to preserve and return detailed plugin errors with status codes. - Introduced `isPluginErrorEnvelope` to identify plugin error envelopes. - Updated plugin call logic in Unix and Windows loaders to differentiate plugin errors from system errors. - Added unit tests to verify error handling and status code preservation. v7.2.24	2026-06-21 10:12:10 +08:00
Luis Pater	31549af180	fix(watcher): update Gemini provider name to "gemini-cli" in file synthesizer logic	2026-06-21 09:38:10 +08:00
Luis Pater	1f21f946ba	feat(api): implement support for multi-auth expansion in plugin systems - Added logic to expand single auth JSON payloads into multiple plugin virtual auth records. - Updated related API endpoints such as `PatchAuthFileStatus` and `DeleteAuthFile` to handle plugin virtual auths with rollback mechanisms. - Introduced `NormalizePluginOAuthCallbackProvider` and other normalization functions for better handling of OAuth callbacks. - Enhanced tests to validate multi-auth parsing, rollback behavior, and API response consistency. v7.2.23	2026-06-21 07:24:29 +08:00
Luis Pater	9a8098d2d5	feat(api): prioritize non-template Codex client models and adjust priority calculation logic - Added `applyCodexClientNonTemplatePriorities` to assign higher priorities to non-template Codex client models dynamically. - Implemented `maxCodexClientTemplatePriority` to set base priority for non-template models relative to template models. - Updated unit tests to validate priority calculation for custom models.	2026-06-21 00:10:03 +08:00
Luis Pater	bb414de33f	feat(api): add "max" reasoning depth and `service_tiers` to Codex client models - Introduced a new "max" level for reasoning depth in Codex client model configuration, providing maximum problem-solving capability. - Added `service_tiers` field to model responses for better tier categorization. - Updated unit tests to validate the inclusion and default behavior of `service_tiers` and the new "max" reasoning depth.	2026-06-20 22:49:43 +08:00
Luis Pater	eb8d0d0684	Merge pull request #3900 from sususu98/fix/antigravity-replay-fc-order-upstream-dev fix(antigravity): HOME Gemini replay and functionCall ordering before functionResponse	2026-06-20 14:55:48 +08:00
Luis Pater	35c3d80ab9	feat(translator): add support for handling video URLs in Gemini requests - Updated `ConvertOpenAIRequestToGemini` to process `video_url` message types, extracting and embedding video metadata (MIME type and base64 data) into Gemini request parts. - Added corresponding unit tests to validate `video_url` handling, ensuring proper extraction and data preservation. Closes: #3920 v7.2.22	2026-06-20 14:46:39 +08:00
Luis Pater	09179a707f	feat(registry): add "max" level and remove deprecated Gemini models - Added a new "max" level to model `thinking.levels` for enhanced customization. - Removed deprecated Gemini 3 Pro (High/Low) models and associated fields from the registry. - Simplified `thinking` attribute structure for `grok-build-0.1` and `grok-composer-2.5-fast` models. Closes: #3901	2026-06-20 14:22:29 +08:00
Luis Pater	57e1bf97a5	feat(translator): ensure preservation of tool and call IDs in Gemini request and response translations - Updated `ConvertGeminiRequestToClaude`, `ConvertGeminiRequestToCodex`, and their respective response counterparts to include logic for retaining and using tool/call IDs when present from gateway-provided inputs. - Enhanced pairing logic between function calls and responses to handle custom and auto-generated IDs consistently. - Introduced tests validating ID preservation and proper behavior in both streaming and non-streaming flows. Closes: #3878	2026-06-20 14:12:14 +08:00
Luis Pater	011ffe1dd5	feat(translator): enforce FIFO order in tool call ID consumption for Gemini requests - Updated `ConvertGeminiRequestToOpenAI` to consume `tool_call_id`s in FIFO order, ensuring consistent mapping between calls and responses. - Added fallback logic for unmatched or extra function responses to generate unique `tool_call_id`s with a `call_` prefix. - Introduced comprehensive unit tests to validate order enforcement, distinct ID assignment, and fallback behavior. Closes: #3874	2026-06-20 14:01:50 +08:00
Luis Pater	5771abbc81	feat(management): add `ResetQuota` endpoint and auth manager quota reset functionality - Introduced a new `/reset-quota` API endpoint in the management handler to clear quota and cooldown state for auth records. - Implemented `ResetQuota` method in the auth manager to handle runtime and registry state resets for affected models. - Added tests to validate quota reset behavior, including proper state updates and registry consistency. - Refactored utility functions to support deduplication and registered models handling in quota resets. Closes: #3866	2026-06-20 13:49:40 +08:00
Luis Pater	b4bec344f6	feat(translator): sanitize `parametersJsonSchema` in OpenAI to Gemini request handling - Applied `CleanJSONSchemaForGemini` to ensure compatibility by removing unsupported fields and cleaning schema requirements. - Added test cases to validate schema transformation, ensuring only necessary `required` fields are retained. - Enhanced both request and response flows to consistently clean and sanitize tool parameter schemas. Closes: #3863	2026-06-20 13:34:05 +08:00
Luis Pater	a79ae80f6e	feat(registry): improve model fallback logic and refactor Claude model handling - Replaced internal model-specific constants with exported defaults (`DefaultClaudeMaxInputTokens`, `DefaultClaudeMaxOutputTokens`). - Refactored Claude model formatting to centralize schema application in `formatHomeClaudeModel`, with fields like `type`, `display_name`, and token limits. - Added new unit tests for fallback behavior, metadata fields, and token handling in Claude and Gemini models. - Removed deprecated test coverage for Amp provider model routes. Closes: #3833	2026-06-20 13:09:44 +08:00
Luis Pater	b0ca3794ad	Merge pull request #3834 from dcrdev/main	2026-06-20 12:56:25 +08:00
Luis Pater	790ec307f9	feat(config): add support for `rebuild_mid_system_message` configuration - Introduced `RebuildMidSystemMessage` field in config to move system messages into the top-level Claude system field. - Updated executor to handle mid-system message rebuilding when enabled via config or auth attributes. - Added unit tests to verify rebuilding behavior and default behavior when disabled. - Updated configuration example and API handlers to support the new field. Closes: #3792	2026-06-20 12:15:06 +08:00
Luis Pater	f66376f0b9	feat(auth): add per-auth OAuth model alias support - Introduced `SetOAuthModelAliasesAttribute` and `OAuthModelAliasesFromAttributes` for managing per-auth model aliases. - Enhanced OAuth model resolution logic to prioritize per-auth aliases over global aliases. - Updated metadata handling to extract and sanitize per-account model aliases. - Added tests to validate alias precedence, empty attributes, and conflict scenarios. Closes: #3764	2026-06-20 11:31:57 +08:00
Luis Pater	379167c992	feat(translator): add benchmarking for `convertSystemRoleToDeveloper` with large inputs - Introduced `BenchmarkConvertSystemRoleToDeveloperLargeInput` to evaluate performance on various input sizes and configurations. - Improved `convertSystemRoleToDeveloper` logic to optimize JSON rebuilding and reduce unnecessary operations. - Created helper `makeLargeResponsesInputForBenchmark` to generate test data for performance scenarios. Closes: #3751	2026-06-20 11:10:53 +08:00
Luis Pater	28e2f9798c	feat(executor): add session isolation for `grok-composer` models - Introduced `xaiRequiresIsolatedConversation` to enforce session ID generation for `grok-composer` models. - Updated request preparation logic to handle isolated conversations by setting `prompt_cache_key` and `x-grok-conv-id`. - Added unit tests with coverage for session isolation, stateless models, and explicit `prompt_cache_key` scenarios. Closes: #3750	2026-06-20 10:54:16 +08:00
Luis Pater	bc652c7bf0	feat(translator): add support for `text.format` conversion in OpenAI to Gemini requests - Implemented `applyOpenAIResponsesTextFormatToGemini` to handle `json_object` and `json_schema` formats. - Updated generation config to set appropriate `responseMimeType` and optional `responseJsonSchema`. - Introduced unit tests to validate correct handling of `text.format` conversion scenarios. Closes: #3721	2026-06-20 10:14:39 +08:00
Luis Pater	75fa62653f	feat(executor): normalize `parallel_tool_calls` based on `tools` presence - Added `normalizeCodexParallelToolCallsForTools` to conditionally remove `parallel_tool_calls` when `tools` are missing or empty. - Integrated normalization into Codex executor workflows for improved request handling. - Introduced unit tests to validate behavior across different tool scenarios. Closes: #3903	2026-06-20 09:47:45 +08:00
Luis Pater	4c78e40da4	feat(auth): unify provider key handling with OpenAI compatibility support - Replaced direct `strings.ToLower` usage with `util.OpenAICompatibleProviderKey` for generating provider keys. - Updated auth and executor workflows to use namespaced keys for OpenAI-compatible providers. - Adjusted tests to validate namespaced key handling, including new test cases for provider registration and execution logic. - Added `OpenAICompatibleProviderKey` helper in `util` for consistent key transformations. Closes: #3600 v7.2.21	2026-06-20 00:58:20 +08:00
Luis Pater	c44d4fcc7c	feat(schema): add removal of `$comment` and `enumDescriptions` in JSON schema processing - Updated JSON schema handling to remove `$comment` and `enumDescriptions` fields during schema transformations. - Adjusted test cases to validate the removal of these fields both at root and nested levels. - Expanded unsupported schema keywords to include `$comment` and `enumDescriptions` for Gemini compatibility. Closes: #3512	2026-06-20 00:22:08 +08:00
Luis Pater	34639c3cf9	feat(translator): defer Codex function call starts until function name is available - Updated `ConvertCodexResponseToClaude` to delay emitting `function_call` start events until the `name` field is resolved. - Introduced `pendingCodexFunctionCall` for buffering incomplete function calls. - Added tests to ensure proper behavior for deferred starts, including argument buffering and finalization. Closes: #3471	2026-06-20 00:11:12 +08:00
Luis Pater	51aa5ba925	feat(translator): preserve `input_audio` fields in OpenAI request conversions - Updated `ConvertOpenAIRequestToGemini` and `ConvertOpenAIRequestToCodex` to handle `input_audio`, retaining `data` and `format` fields. - Added helper `openAIInputAudioMimeType` for determining MIME types from audio formats. - Introduced unit tests to validate correct preservation of `input_audio` data and format. Closes: #3447	2026-06-20 00:03:54 +08:00
Luis Pater	1581700646	chore(deps): bump `github.com/jackc/pgx/v5` from v5.7.6 to v5.9.2 Closes: #3420	2026-06-19 23:50:48 +08:00
Luis Pater	1b849b6d8f	feat(translator): attach `reasoning_content` to assistant and tool messages in OpenAI response conversion - Enhanced `ConvertOpenAIResponsesRequestToOpenAIChatCompletions` to include `reasoning_content` in assistant and tool call messages. - Introduced `collectOpenAIResponsesReasoningContent` for aggregating reasoning summaries. - Added tests to validate reasoning attachment in various scenarios, including empty reasoning, tool calls, and reasoning followed by user messages. Closes: #3397	2026-06-19 21:36:55 +08:00
Luis Pater	041a065b2f	Merge branch 'remove-gemini-cli' into dev # Conflicts: # internal/api/handlers/management/auth_files.go # internal/thinking/provider/geminicli/apply.go v7.2.20	2026-06-19 14:40:29 +08:00
Luis Pater	aa2ad995a5	feat(translator): preserve `input_image` details in OpenAI response conversion - Updated handling in `ConvertOpenAIResponsesRequestToOpenAIChatCompletions` to retain `input_image` detail fields such as `image_url` and `detail`. - Added `TestConvertOpenAIResponsesRequestToOpenAIChatCompletions_PreservesInputImageDetail` to verify preservation of image details during transformation. Closes: #3385	2026-06-19 05:56:25 +08:00
Luis Pater	aed54adbea	feat(translator): preserve structured `tool_choice` in OpenAI response conversions - Updated `ConvertOpenAIResponsesRequestToOpenAIChatCompletions` to retain `tool_choice` with raw byte handling. - Added `TestConvertOpenAIResponsesRequestToOpenAIChatCompletions_PreservesStructuredToolChoice` to ensure function and type fields are preserved in transformations. Closes: #3384	2026-06-19 05:48:57 +08:00
Luis Pater	07c297a51b	feat(auth): add persistent cooldown state management with file-backed store - Introduced `CooldownStateStore` interface for managing independent cooldown state persistence. - Implemented `FileCooldownStateStore` for storing cooldown states as per-auth `.cds` files with atomic writes and stale file cleanup. - Enhanced `Manager` to support restoring state from `CooldownStateStore` and persisting state changes during auth updates. - Updated tests to validate cooldown state saving, loading, concurrency handling, and error scenarios. Closes: #3368 v7.2.19	2026-06-19 05:34:44 +08:00
Luis Pater	d33ac5e1e9	feat(auth): add transient error cooldown configuration and adjust retry logic - Introduced `SetTransientErrorCooldownSeconds` to enable configurable cooldowns for transient errors (e.g., 408/500/502/503/504). - Updated retry scheduling logic to use the new `nextTransientErrorRetryAfter` function. - Modified config parsing to include `transient-error-cooldown-seconds` with support for disabling or defaulting to legacy behavior. - Expanded tests to validate transient cooldown logic with various configurations and edge cases. Closes: #3315	2026-06-19 04:26:04 +08:00
Luis Pater	4926630a60	feat(translator): support namespace tools in OpenAI response transformations - Added `convertResponsesToolToOpenAIChatTools` and helper methods to handle namespace tools during request conversions. - Enhanced response handling to restore namespace context for function calls using `applyResponsesFunctionCallNamespaceFields` and related utilities. - Updated tests to validate namespace flattening, function call restoration, and non-stream response handling. Closes: #3298	2026-06-19 04:12:51 +08:00
Luis Pater	893412e965	feat(translator): normalize `service_tier` in Codex requests and add tests - Added `normalizeCodexServiceTier` to standardize `service_tier` values (`fast` to `priority`, omit unsupported tiers). - Updated `ConvertClaudeRequestToCodex` to apply normalization logic. - Introduced `TestConvertClaudeRequestToCodex_ServiceTier` to validate behavior across various `service_tier` inputs. Closes: #3276 #3294	2026-06-19 03:58:55 +08:00
Luis Pater	c020e2d03f	feat(translator): drop `apply_patch` custom tool in OpenAI responses - Added logic in `ConvertOpenAIResponsesRequestToClaude` to exclude `apply_patch` custom tools. - Introduced `isOpenAIResponsesApplyPatchCustomTool` helper function to identify and filter the tool. - Added `TestConvertOpenAIResponsesRequestToClaude_DropsApplyPatchCustomTool` to validate the behavior. Closes: #3243	2026-06-19 03:43:26 +08:00
Luis Pater	1d0551a991	feat(config): improve config reload handling and introduce async management save hook - Refactored `ConfigReloadHook` to use `reloadConfigFromWatcher` for consistency. - Added async `reloadConfigAfterManagementSaveAsync` to handle post-save operations. - Introduced `ReloadConfigIfChanged` in watcher for manual trigger support. - Enhanced config reload paths to separate auth synthesis from standard updates. - Updated `applyConfigUpdate` logic to allow more granular reload behaviors. Closes: #3235	2026-06-19 03:35:09 +08:00
Luis Pater	052f1934b8	fix(auth): classify transport errors as `home_unavailable` with retryable flag - Updated error handling in `RPopAuth` to distinguish `auth_not_found` from transport errors. - Added a new test, `TestPickNextViaHomeClassifiesTransportErrorsAsHomeUnavailable`, to validate correct error classification and retryable property. v7.2.18	2026-06-19 02:54:48 +08:00
Luis Pater	ae6c5eaea5	feat(runtime): add support for `gpt-image-1.5` and direct image API proxying - Introduced the `gpt-image-2` model in Codex built-ins and updated visibility logic in the registry. - Added direct proxy support for OpenAI image generation and editing endpoints. - Implemented new execution paths for `/images/generations` and `/images/edit`, ensuring seamless handling for both JSON and multipart payloads. - Expanded test coverage to validate the new model and direct proxy features, including streaming scenarios and error handling. v7.2.17	2026-06-19 00:06:12 +08:00
Luis Pater	41c52b9df6	test(management): add concurrency test for Codex OAuth session handling - Introduced `TestRequestCodexTokenCompletionKeepsConcurrentSessionPending` to validate proper handling of concurrent OAuth sessions. - Refactored Codex OAuth logic to use `newCodexOAuthService` for improved testability. Closes: #3171	2026-06-18 23:15:43 +08:00
Luis Pater	c13dbcc24e	feat(translator): add test and logic to ensure `object` schemas include `properties` field - Added `TestConvertClaudeRequestToOpenAI_ToolSchemaAddsMissingObjectProperties` to validate automatic addition of missing `properties` in `object` schemas. - Introduced `normalizeObjectSchemaProperties` to recursively ensure schemas of type `object` include an empty `properties` field if absent. - Updated `ConvertClaudeRequestToOpenAI` to apply schema normalization for improved compatibility with OpenAI schema expectations. Closes: #3165	2026-06-18 22:54:20 +08:00
Luis Pater	ac8fb9706f	feat(thinking): remove `thinkingConfig` for `ModeNone` with zero budget and no level - Updated Gemini, Gemini CLI, and Antigravity logic to delete `thinkingConfig` when `ModeNone` is set, `Budget=0`, and `Level` is empty. - Adjusted tests to validate this behavior across multiple scenarios and models with zero-allowed configurations. - Extended test cases for additional coverage of mixed-model behavior. Closes: #3138	2026-06-18 22:39:02 +08:00
sususu98	ec8c2c2913	test(antigravity): cover invalid-signature replay cache clear Add executor httptest for upstream 400 clearing HOME reasoning replay items, and a whitespace call_id matcher regression test for replay filtering.	2026-06-18 22:31:50 +08:00
sususu98	c55157dc2e	fix(antigravity): PR review replay scope, signature merge, and tool keys - Derive replay session from generateStableSessionID when sessionId is absent - Merge cached thoughtSignature into existing functionCall by call id - Allow filter to pass function_call_part when only signature replay is needed - Include native functionCall id in replay dedupe keys - Add unit tests for signature merge, stable scope, and parallel tool ids	2026-06-18 20:29:44 +08:00
sususu98	ef19f5fc81	fix(antigravity): address review on replay call_id and args parsing Trim call_id once for matching-function-response checks; use args.Value() in synthetic model functionCall insertion; guard functionResponse lookup when call_id is empty.	2026-06-18 19:52:54 +08:00
sususu98	b17d29ad35	fix(antigravity): insert replayed functionCall before matching functionResponse When HOME replay restores a cached function_call_part and the request already contains the matching functionResponse, insert a synthetic role=model content with functionCall (and thoughtSignature) immediately before that response content instead of writing into the same index. Add regression tests for user/model functionResponse shapes.	2026-06-18 19:50:46 +08:00
sususu98	292456a884	feat(antigravity): HOME reasoning replay for Gemini models Add executor-scoped replay cache aligned with Codex HOME replay: Scope, observe SSE/non-stream responses, store normalized thought_signature and function_call_part items, apply on the next streamGenerateContent request, and invalidate on invalid signature responses. Gemini/flash/agent models use HOME replay; native per-part signature replay is not wired on upstream/dev. Wire non-stream and stream paths in antigravity_executor and purge expired entries from signature_cache. Includes unit tests and HOME-provider-replay documentation.	2026-06-18 19:50:28 +08:00
sususu98	62c4b377dd	Revert "feat(antigravity): HOME reasoning replay for Gemini models" This reverts commit `365e8fc2ca`.	2026-06-18 16:02:50 +08:00
sususu98	365e8fc2ca	feat(antigravity): HOME reasoning replay for Gemini models Add executor-scoped replay cache aligned with Codex HOME replay: Scope, observe SSE/non-stream responses, store normalized thought_signature and function_call_part items, apply on the next streamGenerateContent request, and invalidate on invalid signature responses. Gemini/flash/agent models use HOME replay; native per-part signature replay is not wired on upstream/dev. Wire non-stream and stream paths in antigravity_executor and purge expired entries from signature_cache. Includes unit tests and HOME-provider-replay documentation.	2026-06-18 14:37:11 +08:00

1 2 3 4 5 ...

2832 Commits