CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-06-23 00:35:09 +08:00

Author	SHA1	Message	Date
hkfires	1a0784ae49	feat(registry): enhance model ID resolution and add case-insensitive matching for model lookups	2026-06-20 17:16:45 +08:00
Luis Pater	eb8d0d0684	Merge pull request #3900 from sususu98/fix/antigravity-replay-fc-order-upstream-dev fix(antigravity): HOME Gemini replay and functionCall ordering before functionResponse	2026-06-20 14:55:48 +08:00
Luis Pater	35c3d80ab9	feat(translator): add support for handling video URLs in Gemini requests - Updated `ConvertOpenAIRequestToGemini` to process `video_url` message types, extracting and embedding video metadata (MIME type and base64 data) into Gemini request parts. - Added corresponding unit tests to validate `video_url` handling, ensuring proper extraction and data preservation. Closes: #3920	2026-06-20 14:46:39 +08:00
Luis Pater	09179a707f	feat(registry): add "max" level and remove deprecated Gemini models - Added a new "max" level to model `thinking.levels` for enhanced customization. - Removed deprecated Gemini 3 Pro (High/Low) models and associated fields from the registry. - Simplified `thinking` attribute structure for `grok-build-0.1` and `grok-composer-2.5-fast` models. Closes: #3901	2026-06-20 14:22:29 +08:00
Luis Pater	57e1bf97a5	feat(translator): ensure preservation of tool and call IDs in Gemini request and response translations - Updated `ConvertGeminiRequestToClaude`, `ConvertGeminiRequestToCodex`, and their respective response counterparts to include logic for retaining and using tool/call IDs when present from gateway-provided inputs. - Enhanced pairing logic between function calls and responses to handle custom and auto-generated IDs consistently. - Introduced tests validating ID preservation and proper behavior in both streaming and non-streaming flows. Closes: #3878	2026-06-20 14:12:14 +08:00
Luis Pater	011ffe1dd5	feat(translator): enforce FIFO order in tool call ID consumption for Gemini requests - Updated `ConvertGeminiRequestToOpenAI` to consume `tool_call_id`s in FIFO order, ensuring consistent mapping between calls and responses. - Added fallback logic for unmatched or extra function responses to generate unique `tool_call_id`s with a `call_` prefix. - Introduced comprehensive unit tests to validate order enforcement, distinct ID assignment, and fallback behavior. Closes: #3874	2026-06-20 14:01:50 +08:00
Luis Pater	5771abbc81	feat(management): add `ResetQuota` endpoint and auth manager quota reset functionality - Introduced a new `/reset-quota` API endpoint in the management handler to clear quota and cooldown state for auth records. - Implemented `ResetQuota` method in the auth manager to handle runtime and registry state resets for affected models. - Added tests to validate quota reset behavior, including proper state updates and registry consistency. - Refactored utility functions to support deduplication and registered models handling in quota resets. Closes: #3866	2026-06-20 13:49:40 +08:00
Luis Pater	b4bec344f6	feat(translator): sanitize `parametersJsonSchema` in OpenAI to Gemini request handling - Applied `CleanJSONSchemaForGemini` to ensure compatibility by removing unsupported fields and cleaning schema requirements. - Added test cases to validate schema transformation, ensuring only necessary `required` fields are retained. - Enhanced both request and response flows to consistently clean and sanitize tool parameter schemas. Closes: #3863	2026-06-20 13:34:05 +08:00
Luis Pater	a79ae80f6e	feat(registry): improve model fallback logic and refactor Claude model handling - Replaced internal model-specific constants with exported defaults (`DefaultClaudeMaxInputTokens`, `DefaultClaudeMaxOutputTokens`). - Refactored Claude model formatting to centralize schema application in `formatHomeClaudeModel`, with fields like `type`, `display_name`, and token limits. - Added new unit tests for fallback behavior, metadata fields, and token handling in Claude and Gemini models. - Removed deprecated test coverage for Amp provider model routes. Closes: #3833	2026-06-20 13:09:44 +08:00
Luis Pater	b0ca3794ad	Merge pull request #3834 from dcrdev/main	2026-06-20 12:56:25 +08:00
Luis Pater	790ec307f9	feat(config): add support for `rebuild_mid_system_message` configuration - Introduced `RebuildMidSystemMessage` field in config to move system messages into the top-level Claude system field. - Updated executor to handle mid-system message rebuilding when enabled via config or auth attributes. - Added unit tests to verify rebuilding behavior and default behavior when disabled. - Updated configuration example and API handlers to support the new field. Closes: #3792	2026-06-20 12:15:06 +08:00
Luis Pater	f66376f0b9	feat(auth): add per-auth OAuth model alias support - Introduced `SetOAuthModelAliasesAttribute` and `OAuthModelAliasesFromAttributes` for managing per-auth model aliases. - Enhanced OAuth model resolution logic to prioritize per-auth aliases over global aliases. - Updated metadata handling to extract and sanitize per-account model aliases. - Added tests to validate alias precedence, empty attributes, and conflict scenarios. Closes: #3764	2026-06-20 11:31:57 +08:00
Luis Pater	379167c992	feat(translator): add benchmarking for `convertSystemRoleToDeveloper` with large inputs - Introduced `BenchmarkConvertSystemRoleToDeveloperLargeInput` to evaluate performance on various input sizes and configurations. - Improved `convertSystemRoleToDeveloper` logic to optimize JSON rebuilding and reduce unnecessary operations. - Created helper `makeLargeResponsesInputForBenchmark` to generate test data for performance scenarios. Closes: #3751	2026-06-20 11:10:53 +08:00
Luis Pater	28e2f9798c	feat(executor): add session isolation for `grok-composer` models - Introduced `xaiRequiresIsolatedConversation` to enforce session ID generation for `grok-composer` models. - Updated request preparation logic to handle isolated conversations by setting `prompt_cache_key` and `x-grok-conv-id`. - Added unit tests with coverage for session isolation, stateless models, and explicit `prompt_cache_key` scenarios. Closes: #3750	2026-06-20 10:54:16 +08:00
Luis Pater	bc652c7bf0	feat(translator): add support for `text.format` conversion in OpenAI to Gemini requests - Implemented `applyOpenAIResponsesTextFormatToGemini` to handle `json_object` and `json_schema` formats. - Updated generation config to set appropriate `responseMimeType` and optional `responseJsonSchema`. - Introduced unit tests to validate correct handling of `text.format` conversion scenarios. Closes: #3721	2026-06-20 10:14:39 +08:00
Luis Pater	75fa62653f	feat(executor): normalize `parallel_tool_calls` based on `tools` presence - Added `normalizeCodexParallelToolCallsForTools` to conditionally remove `parallel_tool_calls` when `tools` are missing or empty. - Integrated normalization into Codex executor workflows for improved request handling. - Introduced unit tests to validate behavior across different tool scenarios. Closes: #3903	2026-06-20 09:47:45 +08:00
Luis Pater	4c78e40da4	feat(auth): unify provider key handling with OpenAI compatibility support - Replaced direct `strings.ToLower` usage with `util.OpenAICompatibleProviderKey` for generating provider keys. - Updated auth and executor workflows to use namespaced keys for OpenAI-compatible providers. - Adjusted tests to validate namespaced key handling, including new test cases for provider registration and execution logic. - Added `OpenAICompatibleProviderKey` helper in `util` for consistent key transformations. Closes: #3600	2026-06-20 00:58:20 +08:00
Luis Pater	c44d4fcc7c	feat(schema): add removal of `$comment` and `enumDescriptions` in JSON schema processing - Updated JSON schema handling to remove `$comment` and `enumDescriptions` fields during schema transformations. - Adjusted test cases to validate the removal of these fields both at root and nested levels. - Expanded unsupported schema keywords to include `$comment` and `enumDescriptions` for Gemini compatibility. Closes: #3512	2026-06-20 00:22:08 +08:00
Luis Pater	34639c3cf9	feat(translator): defer Codex function call starts until function name is available - Updated `ConvertCodexResponseToClaude` to delay emitting `function_call` start events until the `name` field is resolved. - Introduced `pendingCodexFunctionCall` for buffering incomplete function calls. - Added tests to ensure proper behavior for deferred starts, including argument buffering and finalization. Closes: #3471	2026-06-20 00:11:12 +08:00
Luis Pater	51aa5ba925	feat(translator): preserve `input_audio` fields in OpenAI request conversions - Updated `ConvertOpenAIRequestToGemini` and `ConvertOpenAIRequestToCodex` to handle `input_audio`, retaining `data` and `format` fields. - Added helper `openAIInputAudioMimeType` for determining MIME types from audio formats. - Introduced unit tests to validate correct preservation of `input_audio` data and format. Closes: #3447	2026-06-20 00:03:54 +08:00
Luis Pater	1b849b6d8f	feat(translator): attach `reasoning_content` to assistant and tool messages in OpenAI response conversion - Enhanced `ConvertOpenAIResponsesRequestToOpenAIChatCompletions` to include `reasoning_content` in assistant and tool call messages. - Introduced `collectOpenAIResponsesReasoningContent` for aggregating reasoning summaries. - Added tests to validate reasoning attachment in various scenarios, including empty reasoning, tool calls, and reasoning followed by user messages. Closes: #3397	2026-06-19 21:36:55 +08:00
Luis Pater	041a065b2f	Merge branch 'remove-gemini-cli' into dev # Conflicts: # internal/api/handlers/management/auth_files.go # internal/thinking/provider/geminicli/apply.go	2026-06-19 14:40:29 +08:00
Luis Pater	aa2ad995a5	feat(translator): preserve `input_image` details in OpenAI response conversion - Updated handling in `ConvertOpenAIResponsesRequestToOpenAIChatCompletions` to retain `input_image` detail fields such as `image_url` and `detail`. - Added `TestConvertOpenAIResponsesRequestToOpenAIChatCompletions_PreservesInputImageDetail` to verify preservation of image details during transformation. Closes: #3385	2026-06-19 05:56:25 +08:00
Luis Pater	aed54adbea	feat(translator): preserve structured `tool_choice` in OpenAI response conversions - Updated `ConvertOpenAIResponsesRequestToOpenAIChatCompletions` to retain `tool_choice` with raw byte handling. - Added `TestConvertOpenAIResponsesRequestToOpenAIChatCompletions_PreservesStructuredToolChoice` to ensure function and type fields are preserved in transformations. Closes: #3384	2026-06-19 05:48:57 +08:00
Luis Pater	07c297a51b	feat(auth): add persistent cooldown state management with file-backed store - Introduced `CooldownStateStore` interface for managing independent cooldown state persistence. - Implemented `FileCooldownStateStore` for storing cooldown states as per-auth `.cds` files with atomic writes and stale file cleanup. - Enhanced `Manager` to support restoring state from `CooldownStateStore` and persisting state changes during auth updates. - Updated tests to validate cooldown state saving, loading, concurrency handling, and error scenarios. Closes: #3368	2026-06-19 05:34:44 +08:00
Luis Pater	d33ac5e1e9	feat(auth): add transient error cooldown configuration and adjust retry logic - Introduced `SetTransientErrorCooldownSeconds` to enable configurable cooldowns for transient errors (e.g., 408/500/502/503/504). - Updated retry scheduling logic to use the new `nextTransientErrorRetryAfter` function. - Modified config parsing to include `transient-error-cooldown-seconds` with support for disabling or defaulting to legacy behavior. - Expanded tests to validate transient cooldown logic with various configurations and edge cases. Closes: #3315	2026-06-19 04:26:04 +08:00
Luis Pater	4926630a60	feat(translator): support namespace tools in OpenAI response transformations - Added `convertResponsesToolToOpenAIChatTools` and helper methods to handle namespace tools during request conversions. - Enhanced response handling to restore namespace context for function calls using `applyResponsesFunctionCallNamespaceFields` and related utilities. - Updated tests to validate namespace flattening, function call restoration, and non-stream response handling. Closes: #3298	2026-06-19 04:12:51 +08:00
Luis Pater	893412e965	feat(translator): normalize `service_tier` in Codex requests and add tests - Added `normalizeCodexServiceTier` to standardize `service_tier` values (`fast` to `priority`, omit unsupported tiers). - Updated `ConvertClaudeRequestToCodex` to apply normalization logic. - Introduced `TestConvertClaudeRequestToCodex_ServiceTier` to validate behavior across various `service_tier` inputs. Closes: #3276 #3294	2026-06-19 03:58:55 +08:00
Luis Pater	c020e2d03f	feat(translator): drop `apply_patch` custom tool in OpenAI responses - Added logic in `ConvertOpenAIResponsesRequestToClaude` to exclude `apply_patch` custom tools. - Introduced `isOpenAIResponsesApplyPatchCustomTool` helper function to identify and filter the tool. - Added `TestConvertOpenAIResponsesRequestToClaude_DropsApplyPatchCustomTool` to validate the behavior. Closes: #3243	2026-06-19 03:43:26 +08:00
Luis Pater	1d0551a991	feat(config): improve config reload handling and introduce async management save hook - Refactored `ConfigReloadHook` to use `reloadConfigFromWatcher` for consistency. - Added async `reloadConfigAfterManagementSaveAsync` to handle post-save operations. - Introduced `ReloadConfigIfChanged` in watcher for manual trigger support. - Enhanced config reload paths to separate auth synthesis from standard updates. - Updated `applyConfigUpdate` logic to allow more granular reload behaviors. Closes: #3235	2026-06-19 03:35:09 +08:00
Luis Pater	ae6c5eaea5	feat(runtime): add support for `gpt-image-1.5` and direct image API proxying - Introduced the `gpt-image-2` model in Codex built-ins and updated visibility logic in the registry. - Added direct proxy support for OpenAI image generation and editing endpoints. - Implemented new execution paths for `/images/generations` and `/images/edit`, ensuring seamless handling for both JSON and multipart payloads. - Expanded test coverage to validate the new model and direct proxy features, including streaming scenarios and error handling.	2026-06-19 00:06:12 +08:00
Luis Pater	41c52b9df6	test(management): add concurrency test for Codex OAuth session handling - Introduced `TestRequestCodexTokenCompletionKeepsConcurrentSessionPending` to validate proper handling of concurrent OAuth sessions. - Refactored Codex OAuth logic to use `newCodexOAuthService` for improved testability. Closes: #3171	2026-06-18 23:15:43 +08:00
Luis Pater	c13dbcc24e	feat(translator): add test and logic to ensure `object` schemas include `properties` field - Added `TestConvertClaudeRequestToOpenAI_ToolSchemaAddsMissingObjectProperties` to validate automatic addition of missing `properties` in `object` schemas. - Introduced `normalizeObjectSchemaProperties` to recursively ensure schemas of type `object` include an empty `properties` field if absent. - Updated `ConvertClaudeRequestToOpenAI` to apply schema normalization for improved compatibility with OpenAI schema expectations. Closes: #3165	2026-06-18 22:54:20 +08:00
Luis Pater	ac8fb9706f	feat(thinking): remove `thinkingConfig` for `ModeNone` with zero budget and no level - Updated Gemini, Gemini CLI, and Antigravity logic to delete `thinkingConfig` when `ModeNone` is set, `Budget=0`, and `Level` is empty. - Adjusted tests to validate this behavior across multiple scenarios and models with zero-allowed configurations. - Extended test cases for additional coverage of mixed-model behavior. Closes: #3138	2026-06-18 22:39:02 +08:00
sususu98	ec8c2c2913	test(antigravity): cover invalid-signature replay cache clear Add executor httptest for upstream 400 clearing HOME reasoning replay items, and a whitespace call_id matcher regression test for replay filtering.	2026-06-18 22:31:50 +08:00
sususu98	c55157dc2e	fix(antigravity): PR review replay scope, signature merge, and tool keys - Derive replay session from generateStableSessionID when sessionId is absent - Merge cached thoughtSignature into existing functionCall by call id - Allow filter to pass function_call_part when only signature replay is needed - Include native functionCall id in replay dedupe keys - Add unit tests for signature merge, stable scope, and parallel tool ids	2026-06-18 20:29:44 +08:00
sususu98	ef19f5fc81	fix(antigravity): address review on replay call_id and args parsing Trim call_id once for matching-function-response checks; use args.Value() in synthetic model functionCall insertion; guard functionResponse lookup when call_id is empty.	2026-06-18 19:52:54 +08:00
sususu98	b17d29ad35	fix(antigravity): insert replayed functionCall before matching functionResponse When HOME replay restores a cached function_call_part and the request already contains the matching functionResponse, insert a synthetic role=model content with functionCall (and thoughtSignature) immediately before that response content instead of writing into the same index. Add regression tests for user/model functionResponse shapes.	2026-06-18 19:50:46 +08:00
sususu98	292456a884	feat(antigravity): HOME reasoning replay for Gemini models Add executor-scoped replay cache aligned with Codex HOME replay: Scope, observe SSE/non-stream responses, store normalized thought_signature and function_call_part items, apply on the next streamGenerateContent request, and invalidate on invalid signature responses. Gemini/flash/agent models use HOME replay; native per-part signature replay is not wired on upstream/dev. Wire non-stream and stream paths in antigravity_executor and purge expired entries from signature_cache. Includes unit tests and HOME-provider-replay documentation.	2026-06-18 19:50:28 +08:00
sususu98	62c4b377dd	Revert "feat(antigravity): HOME reasoning replay for Gemini models" This reverts commit `365e8fc2ca`.	2026-06-18 16:02:50 +08:00
sususu98	365e8fc2ca	feat(antigravity): HOME reasoning replay for Gemini models Add executor-scoped replay cache aligned with Codex HOME replay: Scope, observe SSE/non-stream responses, store normalized thought_signature and function_call_part items, apply on the next streamGenerateContent request, and invalidate on invalid signature responses. Gemini/flash/agent models use HOME replay; native per-part signature replay is not wired on upstream/dev. Wire non-stream and stream paths in antigravity_executor and purge expired entries from signature_cache. Includes unit tests and HOME-provider-replay documentation.	2026-06-18 14:37:11 +08:00
Luis Pater	78ba8ba731	chore: remove Gemini CLI-related translator packages and logic - Deleted `geminicli` provider and related `Apply` logic. - Removed all translator packages specific to Gemini CLI (Claude, Codex integrations). - Purged associated test files for Gemini CLI translation. - Removed `GeminiAuthenticator` and all associated authentication logic (OAuth flows, token handling, refresh logic). - Deleted internal/executor Gemini OAuth support, including bearer token handling and runtime API logic. - Purged all tests, configs, and command-line flags specific to Gemini OAuth flows. - Updated documentation and aliases to reflect Gemini removal. - Renamed `parseRetryDelay` to `ParseRetryDelay` and `deleteJSONField` to `DeleteJSONField`. - Updated references in `antigravity_executor` and tests to use the new `helps` package. - Adjusted import paths and test cases to ensure compatibility with the new location. - Updated README files to reflect changes in the retry logic references. - Updated `.github/ISSUE_TEMPLATE/bug_report.md` to remove deprecated Gemini CLI mention.	2026-06-18 13:33:10 +08:00
Luis Pater	dd49a52003	feat(translator): add tests to validate trailing assistant prefill stripping and sanitize tool call IDs - Introduced tests for `ConvertOpenAIRequestToGemini`, `ConvertOpenAIResponsesRequestToGemini`, and related Claude functions to ensure trailing model-prefill turns are removed. - Enhanced tool call ID handling with `util.SanitizeClaudeToolID` to standardize IDs in Claude-related conversions and tests. - Updated logic in Gemini and Claude translators to handle edge cases for trailing assistant prefill and tool ID sanitization, ensuring compatibility across input variants. Closes: #3113	2026-06-18 04:13:04 +08:00
Luis Pater	cde5081e94	test(translator): add tests to validate omission of top-level `output_text` in OpenAI responses - Added `TestConvertOpenAIChatCompletionsResponseToOpenAIResponses_CompletedOmitsTopLevelOutputText` to ensure `output_text` is excluded in streamed responses. - Added `TestConvertOpenAIChatCompletionsResponseToOpenAIResponses_ToolCallCompletedOmitsTopLevelOutputText` to validate behavior during tool call completions. - Introduced `TestConvertOpenAIChatCompletionsResponseToOpenAIResponsesNonStream_OmitsTopLevelOutputText` to confirm the omission of `output_text` in non-streamed responses. - Expanded test coverage to ensure consistency with native OpenAI responses.	2026-06-18 03:43:03 +08:00
Luis Pater	acaf250fa8	feat(management): add test to validate priority preservation in auth file uploads - Implemented `TestUploadAuthFile_PreservesPriorityAttributes` to ensure priority attributes and metadata are preserved during auth file uploads. - Updated `UploadAuthFile` logic to utilize `SynthesizeAuthFile` for better handling of generated auth attributes and metadata. Closes: #2924	2026-06-17 21:48:34 +08:00
Luis Pater	f23fb122e7	feat(translator): ensure tool uses stay adjacent to tool results in message generation - Refactored `ConvertOpenAIResponsesRequestToClaude` logic to align tool use with corresponding tool results. - Introduced helper functions for appending and flushing pending reasoning and tool use messages. - Expanded tests to validate message order and content consistency when processing tool calls and results.	2026-06-17 13:23:51 +08:00
Luis Pater	96a8b0cfe2	feat(executor): normalize reasoning text events and enhance handling logic - Introduced `xaiNormalizeReasoningSummaryData` and related functions to normalize `reasoning_text` events into `reasoning_summary` shapes for standardization. - Updated WebSocket and streaming logic to process normalized reasoning summary events correctly. - Enhanced tests to validate normalization, order of events, and output structure in both stream and non-stream scenarios.	2026-06-17 13:00:00 +08:00
sususu98	c296790801	feat(misc): align Antigravity runtime UA with agy CLI version sources Use the agy CLI User-Agent family (antigravity/cli/{version} darwin/arm64) on CPA macOS/arm64 hosts instead of the legacy hub-style antigravity/{version} string. Resolve the cached version from the CLI auto-updater manifest (darwin_arm64.json), then the GCS latest pointer, then antigravity-cli GCS prefix listing, with fallback 1.0.8 when all sources fail. Update AntigravityUserAgent helpers and executor default UA comment to match.	2026-06-17 10:30:52 +08:00
Luis Pater	0e81dee7e2	Merge pull request #3873 from router-for-me/thinking	2026-06-17 09:16:04 +08:00
hkfires	8c6f279f0a	refactor(tests): remove obsolete test files and update reasoning effort logic	2026-06-17 08:11:25 +08:00

1 2 3 4 5 ...

2063 Commits