CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-06-23 05:26:20 +08:00

Author	SHA1	Message	Date
Luis Pater	75fa62653f	feat(executor): normalize `parallel_tool_calls` based on `tools` presence - Added `normalizeCodexParallelToolCallsForTools` to conditionally remove `parallel_tool_calls` when `tools` are missing or empty. - Integrated normalization into Codex executor workflows for improved request handling. - Introduced unit tests to validate behavior across different tool scenarios. Closes: #3903	2026-06-20 09:47:45 +08:00
Luis Pater	c44d4fcc7c	feat(schema): add removal of `$comment` and `enumDescriptions` in JSON schema processing - Updated JSON schema handling to remove `$comment` and `enumDescriptions` fields during schema transformations. - Adjusted test cases to validate the removal of these fields both at root and nested levels. - Expanded unsupported schema keywords to include `$comment` and `enumDescriptions` for Gemini compatibility. Closes: #3512	2026-06-20 00:22:08 +08:00
Luis Pater	041a065b2f	Merge branch 'remove-gemini-cli' into dev # Conflicts: # internal/api/handlers/management/auth_files.go # internal/thinking/provider/geminicli/apply.go	2026-06-19 14:40:29 +08:00
Luis Pater	ae6c5eaea5	feat(runtime): add support for `gpt-image-1.5` and direct image API proxying - Introduced the `gpt-image-2` model in Codex built-ins and updated visibility logic in the registry. - Added direct proxy support for OpenAI image generation and editing endpoints. - Implemented new execution paths for `/images/generations` and `/images/edit`, ensuring seamless handling for both JSON and multipart payloads. - Expanded test coverage to validate the new model and direct proxy features, including streaming scenarios and error handling.	2026-06-19 00:06:12 +08:00
sususu98	62c4b377dd	Revert "feat(antigravity): HOME reasoning replay for Gemini models" This reverts commit `365e8fc2ca`.	2026-06-18 16:02:50 +08:00
sususu98	365e8fc2ca	feat(antigravity): HOME reasoning replay for Gemini models Add executor-scoped replay cache aligned with Codex HOME replay: Scope, observe SSE/non-stream responses, store normalized thought_signature and function_call_part items, apply on the next streamGenerateContent request, and invalidate on invalid signature responses. Gemini/flash/agent models use HOME replay; native per-part signature replay is not wired on upstream/dev. Wire non-stream and stream paths in antigravity_executor and purge expired entries from signature_cache. Includes unit tests and HOME-provider-replay documentation.	2026-06-18 14:37:11 +08:00
Luis Pater	78ba8ba731	chore: remove Gemini CLI-related translator packages and logic - Deleted `geminicli` provider and related `Apply` logic. - Removed all translator packages specific to Gemini CLI (Claude, Codex integrations). - Purged associated test files for Gemini CLI translation. - Removed `GeminiAuthenticator` and all associated authentication logic (OAuth flows, token handling, refresh logic). - Deleted internal/executor Gemini OAuth support, including bearer token handling and runtime API logic. - Purged all tests, configs, and command-line flags specific to Gemini OAuth flows. - Updated documentation and aliases to reflect Gemini removal. - Renamed `parseRetryDelay` to `ParseRetryDelay` and `deleteJSONField` to `DeleteJSONField`. - Updated references in `antigravity_executor` and tests to use the new `helps` package. - Adjusted import paths and test cases to ensure compatibility with the new location. - Updated README files to reflect changes in the retry logic references. - Updated `.github/ISSUE_TEMPLATE/bug_report.md` to remove deprecated Gemini CLI mention.	2026-06-18 13:33:10 +08:00
Luis Pater	96a8b0cfe2	feat(executor): normalize reasoning text events and enhance handling logic - Introduced `xaiNormalizeReasoningSummaryData` and related functions to normalize `reasoning_text` events into `reasoning_summary` shapes for standardization. - Updated WebSocket and streaming logic to process normalized reasoning summary events correctly. - Enhanced tests to validate normalization, order of events, and output structure in both stream and non-stream scenarios.	2026-06-17 13:00:00 +08:00
sususu98	c296790801	feat(misc): align Antigravity runtime UA with agy CLI version sources Use the agy CLI User-Agent family (antigravity/cli/{version} darwin/arm64) on CPA macOS/arm64 hosts instead of the legacy hub-style antigravity/{version} string. Resolve the cached version from the CLI auto-updater manifest (darwin_arm64.json), then the GCS latest pointer, then antigravity-cli GCS prefix listing, with fallback 1.0.8 when all sources fail. Update AntigravityUserAgent helpers and executor default UA comment to match.	2026-06-17 10:30:52 +08:00
Luis Pater	b9d024af49	feat(executor): handle usage limit errors and enhance retry logic - Added `isCodexUsageLimitError` to detect and handle `usage_limit_reached` errors from Codex responses. - Updated `newCodexStatusErr` to treat usage limit errors as HTTP 429 with proper `RetryAfter` handling. - Enhanced test coverage to validate usage limit error handling, including reset time parsing and retry behavior. Closes: #2886	2026-06-17 07:19:02 +08:00
Luis Pater	8fad0d0325	feat(config+executor): add global Claude cloak mode toggle and improve credential fallback logic - Introduced `disable-claude-cloak-mode` configuration to globally disable Claude cloak mode with credential-level overrides. - Enhanced `getCloakConfigFromAuth` to support fallback to metadata for cloak settings. - Updated cloak configuration precedence logic, integrating global, credential, and default modes. - Updated config and watcher diff handling to include `disable-claude-cloak-mode`. Closes: #2789	2026-06-16 13:03:16 +08:00
Luis Pater	844b855974	feat(executor): sanitize web search tool domains to meet Anthropic requirements - Added `sanitizeClaudeWebSearchDomains` to remove empty `allowed_domains` and `blocked_domains` fields for built-in web_search tools, addressing ambiguity errors from Anthropic. - Integrated domain sanitization into the Claude message preparation pipeline. - Added test cases to validate correct handling of empty and non-empty domain fields across various tool types. Closes: #2681	2026-06-16 03:29:44 +08:00
Luis Pater	f33bc56bb9	feat(websockets): add transcript state tracking and compaction trigger support - Added methods for managing and tracking WebSocket transcript state, including recording, prepending, and replacing transcript inputs. - Implemented `executeCompactionTriggerFromWebsocketContext` to support compaction triggers using recorded transcript context. - Enhanced upstream-downstream ID mapping with additional utilities and state synchronization. - Expanded test coverage to validate transcript state management, compaction payload generation, and WebSocket response handling.	2026-06-15 10:41:35 +08:00
Luis Pater	ea90ab6f77	feat(websockets): implement XAIWebsocketsExecutor with enhanced execution and ID mapping - Developed `XAIWebsocketsExecutor` for handling xAI Responses via WebSocket transport. - Introduced session and state management with `codexWebsocketSessionStore` and `xaiWebsocketIDStateStore`. - Added robust ID mapping for upstream and downstream request/response sequences. - Enhanced error propagation and handling of WebSocket terminal events. - Included utility methods for WebSocket request preparation, connection management, and state tracking. - Added foundational support for compact and streamed responses via enhanced session tracking.	2026-06-15 08:22:07 +08:00
Luis Pater	3b96119050	feat(websockets): handle terminal events and improve error propagation - Enhanced Codex Websockets Executor to capture `response.done` as a terminal event, alongside `response.completed` and `error`. - Improved error propagation for upstream websocket errors with comprehensive message handling. - Introduced utility functions for recognizing terminal events and extracting error messages. - Expanded tests to validate new websocket event logic, including terminal event handling and upstream error propagation.	2026-06-15 01:03:19 +08:00
Luis Pater	529d9e92c9	feat(executor): add support for compact response handling in XAIExecutor - Introduced `executeCompact` to handle non-streaming compact responses via the `/responses/compact` endpoint. - Added `executeCompactionTriggerStream` for streaming responses triggered by `compaction_trigger`. - Enhanced request preparation with `prepareResponsesRequestTo` for dynamic response formats. - Updated logic to bypass streaming for `/responses/compact` and added fallback behaviors. - Added comprehensive tests for compact response handling and event streaming validations.	2026-06-15 00:29:38 +08:00
Luis Pater	2a050dc95d	feat: enhance fault tolerance for kv-based caching and introduce additional tests - Updated Antigravity Credits fallback to handle KV store unavailability as a service error. - Enhanced signature caching mechanisms with request-time KV access and sliding expiration. - Added and improved tests for KV client interactions, including error handling and expiration behaviors. - Introduced `CacheSignatureBestEffort` for non-critical signature caching and clarified function flows with required context. - Ensured consistent error reporting for missing or unavailable KV stores in various scenarios. - Replaced direct `homekv` calls with injectable KV client interfaces for `antigravity` and `codex_reasoning_replay` modules. - Improved error reporting and handling for KV operations, including `KVGet`, `KVSet`, `KVDel`, and `KVExpire`. - Introduced dedicated fake KV clients for expanded and granular test coverage. - Added new unit tests to validate KV client behaviors and error scenarios, ensuring robustness and sliding expiration functionality.	2026-06-14 21:11:35 +08:00
hkfires	8122b9fe4b	feat!: remove amp integration support BREAKING CHANGE: ampcode configuration, management endpoints, provider routing, and X-Amp-Thread-Id session affinity are no longer supported	2026-06-14 20:31:00 +08:00
Hao Wang	8d4a7f1f2e	feat(config): add "passthrough" mode for disable-image-generation Adds a fourth value for the disable-image-generation setting: - false: inject image_generation (unchanged) - true: strip everywhere + 404 on /v1/images/* (unchanged) - chat: strip on non-images endpoints, keep /v1/images/* (unchanged) - passthrough: never inject and never strip on non-images endpoints (the client payload is forwarded unchanged); behaves like "chat" on /v1/images/* endpoints. image_generation injection (codex executors) is already gated on the Off mode, and the /v1/images/* 404 gate is already gated on the All mode, so passthrough only required a change to the payload strip logic in payload_helpers.go, now expressed via shouldStripImageGeneration(). Closes #3831 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-13 22:41:15 +08:00
sususu98	48dcadd9ef	feat(antigravity): bridge Claude WebSearch to native googleSearch Add a native Antigravity WebSearch path for Claude typed WebSearch requests. Detect Claude Messages requests whose tools are only typed WebSearch tools (web_search_20250305 / web_search_20260209), and convert them into an Antigravity requestType=web_search payload instead of sending the request through the normal tool-calling path. Preserve the user's requested model. The native path is enabled only when that Antigravity model is known to support Google Search. Capability data fetched from Antigravity model info is used only as an enhancement to the local model registry, not as a replacement for the existing registry fallback behavior. Unsupported models keep the existing Antigravity request behavior and are not silently rerouted to another web-search-capable model. Translate Claude WebSearch request options to the verified Antigravity googleSearch shape: - max_uses -> googleSearch.enhancedContent.imageSearch.maxResultCount - allowed_domains -> googleSearch.includedDomains Leave blocked_domains and user_location unmapped because the Antigravity googleSearch request shape has no verified equivalent for them. This avoids sending speculative fields or pretending unsupported Claude WebSearch options are enforced upstream. Translate Antigravity web-search responses back into Claude-compatible output: server_tool_use blocks, web_search_tool_result blocks, cited text blocks, grounding URLs, and usage-compatible stream/non-stream responses. Cover the behavior with tests for request conversion, response conversion, grounding URL resolution, domain filter mapping, fetched capability hints, excluded-model handling, and unsupported-model behavior.	2026-06-13 10:07:48 +08:00
Luis Pater	8e39db2ec7	feat(plugin, api): introduce host model callback support with Go example and API handlers - Added an example plugin `host-model-callback` in Go to summarize host model callbacks. - Implemented `cliproxy_plugin_init`, `cliproxyPluginCall`, and other plugin functions for callback handling. - Introduced API handlers for `ModelExecution` and `ModelExecutionStream` with support for both streaming and non-streaming requests. - Included unit tests (`model_execution_test.go`) to validate execution logic and streaming responses.	2026-06-12 02:22:23 +08:00
Luis Pater	8e52c403f7	feat(auth): deduplicate concurrent refresh token requests with `singleflight` - Introduced `singleflight.Group` to prevent redundant token refresh calls across multiple auth implementations (`antigravity`, `kimi`, `xai`, `codex`). - Added tests to verify shared upstream calls during concurrent refresh requests. - Refactored token refresh logic to centralize and standardize deduplication mechanisms.	2026-06-10 03:19:26 +08:00
Luis Pater	5e41e079e5	fix(runtime): update formatting in codex image extraction comment	2026-06-09 02:20:57 +08:00
Folyd	2e81766c92	perf(codex): preallocate results and skip empty index sort Apply review feedback on codexExtractImageResults: preallocate the results slice to its known maximum capacity to avoid growth reallocations, and guard the itemsByIndex index-build/sort with a length check so no empty slice is allocated or sorted when only the fallback items are present.	2026-06-08 15:55:42 +00:00
Folyd	4330b92612	perf(codex): avoid rebuilding completed JSON when extracting generated images The OpenAI images path (/v1/images/*) previously called patchCodexCompletedOutput to concatenate collected output_item.done items back into the completed event and then re-parsed that rebuilt JSON to pull out the image results. For multi-megabyte base64 image payloads this produced two extra full-size copies per request (the concatenated output array plus the rebuilt completed event), inflating peak memory under concurrent image generation. Add codexExtractImageResults, which extracts image_generation_call results directly from either the completed event's response.output or the collected items, without the concatenate-and-reparse step. Semantics are preserved: completed output is preferred and collected items are used only when it is empty, matching the original patchCodexCompletedOutput behaviour. patchCodexCompletedOutput remains in use by the text/responses path, which still forwards the patched event downstream. Adds unit tests covering the completed-output path, the ordered fallback to collected items, output preference, fallback list, and the wrong-event-type guard.	2026-06-08 15:47:14 +00:00
Luis Pater	5753d1a089	feat(logging): enable file-backed request/response sources for enhanced API logging - Introduced support for file-backed logging of API requests and responses to handle large payloads efficiently. - Refactored `attachWebsocketLogSources` to `attachRequestLogSources` for broader request and response handling. - Added new methods for appending request/response data to file-backed sources and updated existing logging workflows for compatibility. - Improved cleanup and merge logic for file-backed sources during request processing. - Updated tests to cover newly introduced file-backed logging functionality.	2026-06-05 01:48:05 +08:00
Luis Pater	387c783b32	Merge pull request #3649 from intcua/fix/xai-empty-tools-orphan-tool-choice fix(executor/xai): drop orphaned tool_choice when Claude tools array is empty	2026-06-04 13:11:23 +08:00
sususu98	17af089189	fix(codex): avoid replaying orphan tool calls	2026-06-03 09:52:17 +08:00
Luis Pater	35ab084fc3	refactor(runtime): enhance `NewUtlsHTTPClient` with context-based RoundTripper - Updated `NewUtlsHTTPClient` to support context-aware RoundTrippers for protected hosts (e.g., Cloudflare bypass). - Replaced `anthropicHosts` with `utlsProtectedHosts` to generalize host handling logic. - Added unit test to validate context-based RoundTripper behavior. - Replaced `NewProxyAwareHTTPClient` with `NewUtlsHTTPClient` in relevant executors for improved TLS fingerprinting. Closes: #3680	2026-06-03 06:58:26 +08:00
Luis Pater	02d0d92a8e	Merge pull request #3677 from sususu98/codex/home-auth-loop-upstream-dev Fix Home auth refresh retry handling	2026-06-02 19:30:14 +08:00
sususu98	603a08fc1a	feat(codex): cache reasoning replay items	2026-06-02 16:08:40 +08:00
sususu98	c9dc6bd628	Fix Home auth refresh retry handling Parse Home refresh auth envelopes so refreshed access tokens are used instead of returning missing access token. Stop retrying when Home dispatch returns an auth that already failed within the same request.	2026-06-02 13:43:07 +08:00
Luis Pater	959067edfb	feat(usage): introduce executor type tracking in usage reporting - Replaced `NewUsageReporter` with `NewExecutorUsageReporter` to include executor type in usage records. - Updated all executors to use the new reporter implementation. - Extended `UsageReporter` to track and publish executor type. - Added tests to validate proper executor type recording and handling. - Enhanced RedisQueue plugin and payload schema with executor type support.	2026-06-02 00:43:16 +08:00
Luis Pater	05b972479a	feat(executor): refine session and conversation header handling for Codex - Updated session handling to replace `Session_id` and `Conversation_id` headers with new logic ensuring consistent use of `Cache.ID` and prompt keys. - Restored `Session_id` as a priority extraction source for `ExtractSessionID`. - Added tests to validate case-sensitive and case-insensitive headers, canonical account header usage, and session key preservation. - Removed legacy support for deprecated `Conversation_id` header to clean up API.	2026-06-01 11:27:10 +08:00
Luis Pater	fb4f39d300	test(models, executor): add XAI video model test and fix Codex User-Agent assertions	2026-06-01 02:59:31 +08:00
Luis Pater	bbcdaab79d	feat(executor): enhance Codex identity obfuscation with turn and window metadata handling - Modified `applyCodexIdentityConfuse*` functions to include `turn_id` and `window_id` in metadata transformations. - Updated test cases to validate the inclusion and restoration of these fields. - Removed deprecated `Conversation_id` header support and related logic for cleaner implementation.	2026-06-01 00:50:46 +08:00
lamtran	303685c230	fix(executor/xai): drop orphaned tool_choice when Claude tools array is empty When Claude Code sends a stop-hook evaluator request (or any request without tools), the payload includes "tools": [] (empty array). The claude->codex translator unconditionally emits tools: [] + tool_choice: "auto" + parallel_tool_calls: true into the Codex Responses shape. When that payload is routed to xAI, the upstream rejects with HTTP 400: "A tool_choice was set on the request but no tools were specified." Fix entirely in the xAI executor (translator package is policy-locked): add normalizeXAIToolChoiceForTools() after normalizeXAITools() to drop tool_choice and parallel_tool_calls whenever tools end up absent or empty (covering both the empty-from-source case and the all-filtered-out case where every tool was an unsupported type such as tool_search or image_generation). Per code-review feedback: always remove parallel_tool_calls when tools are missing (not gated on tool_choice presence) and existence-check each key before sjson delete to avoid unnecessary JSON parse/copy. Verification: - go build -o test-output ./cmd/server - go test ./internal/runtime/executor/... -count=1 - 5 new regression tests cover empty / missing / present / orphaned parallel_tool_calls / no-op-when-both-absent.	2026-05-31 23:13:15 +07:00
Luis Pater	0f24cafbdd	feat(executor): implement identity obfuscation for Codex requests and responses - Added `applyCodexIdentityConfuse*` functions for remapping request and response payloads and headers to enhance security. - Updated WebSocket and HTTP logic to handle identity state transformations seamlessly. - Introduced unit tests to verify remapping and restoration of identity-related fields.	2026-05-31 23:31:35 +08:00
Luis Pater	33983b6f3e	refactor(executor): consolidate Codex request translation logic - Introduced `translateCodexRequestPair` to simplify and reuse translation logic for handling original and modified payloads. - Updated relevant methods to use the new function. - Added unit tests to cover payload reuse and differentiation scenarios.	2026-05-31 14:38:54 +08:00
sususu98	aee7a5fbc5	feat: intercept incompatible signature replay	2026-05-29 15:22:57 +08:00
Luis Pater	71c185f614	feat(usage): add service tier tracking and defaults in usage reporting - Introduced `service_tier` metadata key to capture client-requested service tiers. - Updated usage records, context propagation, and plugins to include service tier data. - Added default handling logic for cases where `service_tier` is absent. - Implemented tests for `service_tier` extraction, defaults, and updates across components.	2026-05-28 22:15:54 +08:00
Luis Pater	65e760aa1a	feat(usage): include cache tokens in total token calculation and add tests - Updated `TotalTokens` calculation to account for `CacheReadTokens` and `CacheCreationTokens`. - Added tests to validate accurate token aggregation and fallback behavior for `CachedTokens`.	2026-05-28 21:34:54 +08:00
Luis Pater	94c1b25146	feat(executor): add TTFT tracking and reporting for enhanced performance metrics - Introduced Time-To-First-Token (TTFT) measurement and reporting across major executors. - Added TTFT calculation to `UsageReporter`, including support for HTTP clients and WebSocket communication. - Updated tests to validate TTFT tracking in streamed and non-streamed scenarios. - Ensured integration with `usage` plugin and augmented usage records with TTFT data.	2026-05-28 02:59:24 +08:00
Luis Pater	11f0f906bd	feat(logging): add `SetTranslatedReasoningEffort` to track reasoning levels in usage reporting - Introduced `SetTranslatedReasoningEffort` method in `UsageReporter` to capture and log reasoning efforts from translated payloads. - Updated executors to incorporate the new reporting functionality for handling reasoning efforts across various providers. - Enhanced logging for thinking level extraction with new helper function `ExtractTranslatedReasoningEffort`.	2026-05-28 02:19:45 +08:00
Luis Pater	e399edd3cc	feat(images): add support for configurable GPT Image 2 base model and improved SSE handling - Introduced `GPTImage2BaseModel` configuration for hosted image generation tools with validation for "gpt-" prefix. - Added logic to dynamically resolve and apply the base model in Codex executor workflows. - Enhanced server-sent events (SSE) implementation with keep-alive tickers and error events for stream reliability. - Updated configuration file examples and internal documentation.	2026-05-27 00:47:02 +08:00
sususu98	4a85b6b97e	fix: log gemini cli schema cleanup errors	2026-05-26 10:52:53 +08:00
sususu98	70a8cf026f	fix: clean gemini cli request schemas	2026-05-26 10:39:37 +08:00
Luis Pater	a0bb1f3a2b	feat(logging): add file-backed sources for request logging - Introduced `FileBodySource` to support large request log sections stored in temp files. - Added file-backed support for WebSocket timeline and API WebSocket timeline logging. - Updated `LogRequest` and middleware to integrate optional file-backed sources. - Implemented clean-up mechanisms to manage temporary log files after processing.	2026-05-25 21:55:16 +08:00
Luis Pater	48a1c88115	Merge pull request #3476 from sususu98/fix/codex-context-length-stream-errors-dev fix codex context length stream errors	2026-05-21 02:53:54 +08:00
Luis Pater	42e9605871	Merge pull request #3254 from sususu98/fix/antigravity-project-id-onboard fix: require antigravity project id	2026-05-21 02:52:32 +08:00

1 2 3 4 5 ...

626 Commits