CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-06-10 16:22:59 +08:00

Author	SHA1	Message	Date
Luis Pater	8e52c403f7	feat(auth): deduplicate concurrent refresh token requests with `singleflight` - Introduced `singleflight.Group` to prevent redundant token refresh calls across multiple auth implementations (`antigravity`, `kimi`, `xai`, `codex`). - Added tests to verify shared upstream calls during concurrent refresh requests. - Refactored token refresh logic to centralize and standardize deduplication mechanisms.	2026-06-10 03:19:26 +08:00
Luis Pater	5e41e079e5	fix(runtime): update formatting in codex image extraction comment	2026-06-09 02:20:57 +08:00
Folyd	2e81766c92	perf(codex): preallocate results and skip empty index sort Apply review feedback on codexExtractImageResults: preallocate the results slice to its known maximum capacity to avoid growth reallocations, and guard the itemsByIndex index-build/sort with a length check so no empty slice is allocated or sorted when only the fallback items are present.	2026-06-08 15:55:42 +00:00
Folyd	4330b92612	perf(codex): avoid rebuilding completed JSON when extracting generated images The OpenAI images path (/v1/images/*) previously called patchCodexCompletedOutput to concatenate collected output_item.done items back into the completed event and then re-parsed that rebuilt JSON to pull out the image results. For multi-megabyte base64 image payloads this produced two extra full-size copies per request (the concatenated output array plus the rebuilt completed event), inflating peak memory under concurrent image generation. Add codexExtractImageResults, which extracts image_generation_call results directly from either the completed event's response.output or the collected items, without the concatenate-and-reparse step. Semantics are preserved: completed output is preferred and collected items are used only when it is empty, matching the original patchCodexCompletedOutput behaviour. patchCodexCompletedOutput remains in use by the text/responses path, which still forwards the patched event downstream. Adds unit tests covering the completed-output path, the ordered fallback to collected items, output preference, fallback list, and the wrong-event-type guard.	2026-06-08 15:47:14 +00:00
Luis Pater	5753d1a089	feat(logging): enable file-backed request/response sources for enhanced API logging - Introduced support for file-backed logging of API requests and responses to handle large payloads efficiently. - Refactored `attachWebsocketLogSources` to `attachRequestLogSources` for broader request and response handling. - Added new methods for appending request/response data to file-backed sources and updated existing logging workflows for compatibility. - Improved cleanup and merge logic for file-backed sources during request processing. - Updated tests to cover newly introduced file-backed logging functionality.	2026-06-05 01:48:05 +08:00
Luis Pater	387c783b32	Merge pull request #3649 from intcua/fix/xai-empty-tools-orphan-tool-choice fix(executor/xai): drop orphaned tool_choice when Claude tools array is empty	2026-06-04 13:11:23 +08:00
sususu98	17af089189	fix(codex): avoid replaying orphan tool calls	2026-06-03 09:52:17 +08:00
Luis Pater	35ab084fc3	refactor(runtime): enhance `NewUtlsHTTPClient` with context-based RoundTripper - Updated `NewUtlsHTTPClient` to support context-aware RoundTrippers for protected hosts (e.g., Cloudflare bypass). - Replaced `anthropicHosts` with `utlsProtectedHosts` to generalize host handling logic. - Added unit test to validate context-based RoundTripper behavior. - Replaced `NewProxyAwareHTTPClient` with `NewUtlsHTTPClient` in relevant executors for improved TLS fingerprinting. Closes: #3680	2026-06-03 06:58:26 +08:00
Luis Pater	02d0d92a8e	Merge pull request #3677 from sususu98/codex/home-auth-loop-upstream-dev Fix Home auth refresh retry handling	2026-06-02 19:30:14 +08:00
sususu98	603a08fc1a	feat(codex): cache reasoning replay items	2026-06-02 16:08:40 +08:00
sususu98	c9dc6bd628	Fix Home auth refresh retry handling Parse Home refresh auth envelopes so refreshed access tokens are used instead of returning missing access token. Stop retrying when Home dispatch returns an auth that already failed within the same request.	2026-06-02 13:43:07 +08:00
Luis Pater	959067edfb	feat(usage): introduce executor type tracking in usage reporting - Replaced `NewUsageReporter` with `NewExecutorUsageReporter` to include executor type in usage records. - Updated all executors to use the new reporter implementation. - Extended `UsageReporter` to track and publish executor type. - Added tests to validate proper executor type recording and handling. - Enhanced RedisQueue plugin and payload schema with executor type support.	2026-06-02 00:43:16 +08:00
Luis Pater	05b972479a	feat(executor): refine session and conversation header handling for Codex - Updated session handling to replace `Session_id` and `Conversation_id` headers with new logic ensuring consistent use of `Cache.ID` and prompt keys. - Restored `Session_id` as a priority extraction source for `ExtractSessionID`. - Added tests to validate case-sensitive and case-insensitive headers, canonical account header usage, and session key preservation. - Removed legacy support for deprecated `Conversation_id` header to clean up API.	2026-06-01 11:27:10 +08:00
Luis Pater	fb4f39d300	test(models, executor): add XAI video model test and fix Codex User-Agent assertions	2026-06-01 02:59:31 +08:00
Luis Pater	bbcdaab79d	feat(executor): enhance Codex identity obfuscation with turn and window metadata handling - Modified `applyCodexIdentityConfuse*` functions to include `turn_id` and `window_id` in metadata transformations. - Updated test cases to validate the inclusion and restoration of these fields. - Removed deprecated `Conversation_id` header support and related logic for cleaner implementation.	2026-06-01 00:50:46 +08:00
lamtran	303685c230	fix(executor/xai): drop orphaned tool_choice when Claude tools array is empty When Claude Code sends a stop-hook evaluator request (or any request without tools), the payload includes "tools": [] (empty array). The claude->codex translator unconditionally emits tools: [] + tool_choice: "auto" + parallel_tool_calls: true into the Codex Responses shape. When that payload is routed to xAI, the upstream rejects with HTTP 400: "A tool_choice was set on the request but no tools were specified." Fix entirely in the xAI executor (translator package is policy-locked): add normalizeXAIToolChoiceForTools() after normalizeXAITools() to drop tool_choice and parallel_tool_calls whenever tools end up absent or empty (covering both the empty-from-source case and the all-filtered-out case where every tool was an unsupported type such as tool_search or image_generation). Per code-review feedback: always remove parallel_tool_calls when tools are missing (not gated on tool_choice presence) and existence-check each key before sjson delete to avoid unnecessary JSON parse/copy. Verification: - go build -o test-output ./cmd/server - go test ./internal/runtime/executor/... -count=1 - 5 new regression tests cover empty / missing / present / orphaned parallel_tool_calls / no-op-when-both-absent.	2026-05-31 23:13:15 +07:00
Luis Pater	0f24cafbdd	feat(executor): implement identity obfuscation for Codex requests and responses - Added `applyCodexIdentityConfuse*` functions for remapping request and response payloads and headers to enhance security. - Updated WebSocket and HTTP logic to handle identity state transformations seamlessly. - Introduced unit tests to verify remapping and restoration of identity-related fields.	2026-05-31 23:31:35 +08:00
Luis Pater	33983b6f3e	refactor(executor): consolidate Codex request translation logic - Introduced `translateCodexRequestPair` to simplify and reuse translation logic for handling original and modified payloads. - Updated relevant methods to use the new function. - Added unit tests to cover payload reuse and differentiation scenarios.	2026-05-31 14:38:54 +08:00
sususu98	aee7a5fbc5	feat: intercept incompatible signature replay	2026-05-29 15:22:57 +08:00
Luis Pater	71c185f614	feat(usage): add service tier tracking and defaults in usage reporting - Introduced `service_tier` metadata key to capture client-requested service tiers. - Updated usage records, context propagation, and plugins to include service tier data. - Added default handling logic for cases where `service_tier` is absent. - Implemented tests for `service_tier` extraction, defaults, and updates across components.	2026-05-28 22:15:54 +08:00
Luis Pater	65e760aa1a	feat(usage): include cache tokens in total token calculation and add tests - Updated `TotalTokens` calculation to account for `CacheReadTokens` and `CacheCreationTokens`. - Added tests to validate accurate token aggregation and fallback behavior for `CachedTokens`.	2026-05-28 21:34:54 +08:00
Luis Pater	94c1b25146	feat(executor): add TTFT tracking and reporting for enhanced performance metrics - Introduced Time-To-First-Token (TTFT) measurement and reporting across major executors. - Added TTFT calculation to `UsageReporter`, including support for HTTP clients and WebSocket communication. - Updated tests to validate TTFT tracking in streamed and non-streamed scenarios. - Ensured integration with `usage` plugin and augmented usage records with TTFT data.	2026-05-28 02:59:24 +08:00
Luis Pater	11f0f906bd	feat(logging): add `SetTranslatedReasoningEffort` to track reasoning levels in usage reporting - Introduced `SetTranslatedReasoningEffort` method in `UsageReporter` to capture and log reasoning efforts from translated payloads. - Updated executors to incorporate the new reporting functionality for handling reasoning efforts across various providers. - Enhanced logging for thinking level extraction with new helper function `ExtractTranslatedReasoningEffort`.	2026-05-28 02:19:45 +08:00
Luis Pater	e399edd3cc	feat(images): add support for configurable GPT Image 2 base model and improved SSE handling - Introduced `GPTImage2BaseModel` configuration for hosted image generation tools with validation for "gpt-" prefix. - Added logic to dynamically resolve and apply the base model in Codex executor workflows. - Enhanced server-sent events (SSE) implementation with keep-alive tickers and error events for stream reliability. - Updated configuration file examples and internal documentation.	2026-05-27 00:47:02 +08:00
sususu98	4a85b6b97e	fix: log gemini cli schema cleanup errors	2026-05-26 10:52:53 +08:00
sususu98	70a8cf026f	fix: clean gemini cli request schemas	2026-05-26 10:39:37 +08:00
Luis Pater	a0bb1f3a2b	feat(logging): add file-backed sources for request logging - Introduced `FileBodySource` to support large request log sections stored in temp files. - Added file-backed support for WebSocket timeline and API WebSocket timeline logging. - Updated `LogRequest` and middleware to integrate optional file-backed sources. - Implemented clean-up mechanisms to manage temporary log files after processing.	2026-05-25 21:55:16 +08:00
Luis Pater	48a1c88115	Merge pull request #3476 from sususu98/fix/codex-context-length-stream-errors-dev fix codex context length stream errors	2026-05-21 02:53:54 +08:00
Luis Pater	42e9605871	Merge pull request #3254 from sususu98/fix/antigravity-project-id-onboard fix: require antigravity project id	2026-05-21 02:52:32 +08:00
yavon007	0de0ad0d36	Add reasoning effort to usage events	2026-05-19 22:10:48 +08:00
sususu98	ad868308c0	fix codex context length stream errors	2026-05-19 16:05:40 +08:00
Luis Pater	feebe6c7f2	feat(api): add OpenAI compatibility for image models - Introduced OpenAI-compatible image model support in the API, enabling integration through image generation and editing endpoints. - Added registry type for OpenAIImageModelType to classify and validate compatibility. - Implemented request handling for OpenAI-compatible image models, including JSON and multipart formats. - Enhanced executor methods to support OpenAI-compatible image streaming and non-streaming requests. - Included tests to validate model registration, streaming behavior, and multipart payload formatting.	2026-05-19 10:13:26 +08:00
sususu98	644823529f	Merge pull request #3469 from sususu98/fix/gemini-max-output-token-cap Cap Gemini max output tokens	2026-05-19 09:48:08 +08:00
Luis Pater	bac006e72b	feat(thinking): add xAI provider support with reasoning.effort implementation - Implemented `xAI` provider for thinking configurations with support for reasoning.effort levels. - Registered `xAI` in available providers and updated relevant APIs for compatibility. - Added unit tests for `xAI` provider functionality, including fallback logic for unsupported levels. - Integrated `xAI` with executor handling and ensured conformance with OpenAI-compatible standards.	2026-05-19 03:09:53 +08:00
Luis Pater	ad98c9549a	feat(runtime): track upstream response headers in logging and usage reporting - Added APIs to store, retrieve, and clone upstream response headers in context for detailed logging. - Updated `RecordAPIResponseMetadata`, `RecordAPIWebsocketHandshake`, and related methods to capture response headers. - Extended `UsageReporter` to include response headers in published usage records. - Enhanced payload tests to validate response headers' integrity and persistence. - Refactored `usage.Record` to support optional `ResponseHeaders` field.	2026-05-19 01:29:23 +08:00
sususu98	1583cb4ef0	Cap Gemini max output tokens	2026-05-18 18:41:45 +08:00
sususu98	ec79951e7f	fix(proxy): support HTTP CONNECT dialer	2026-05-18 12:20:41 +08:00
Luis Pater	9ef99aa766	refactor(runtime): rename `FormProtocol` to `FromProtocol` across payload handling logic - Updated variable, function, and struct names from `FormProtocol` to `FromProtocol` for clarity. - Adjusted related payload matching and normalization logic. - Updated tests and examples to align with the new naming convention.	2026-05-17 23:39:07 +08:00
Luis Pater	2007a89594	feat(runtime): enhance payload rule resolution with dynamic path support - Introduced `resolvePayloadRulePaths` function to dynamically resolve rule paths supporting array queries and complex logic. - Updated payload processing logic (`apply defaults`, `overrides`, `filters`) to handle resolved paths for better flexibility. - Added helper functions for path parsing, query matching, and logical resolution to improve modularity and reusability. - Introduced payload condition match logic, including `match`, `not-match`, `exist`, and `not-exist` rules in `PayloadConfig`. - Enhanced `payloadModelRulesMatch` function to support conditional checks at various levels. - Added helper methods for evaluating JSON path conditions and values. - Updated tests to validate new conditional rules against different payload scenarios.	2026-05-17 23:06:43 +08:00
Luis Pater	26d13af28f	feat(runtime): enhance payload rule resolution with dynamic path support - Introduced `resolvePayloadRulePaths` function to dynamically resolve rule paths supporting array queries and complex logic. - Updated payload processing logic (`apply defaults`, `overrides`, `filters`) to handle resolved paths for better flexibility. - Added helper functions for path parsing, query matching, and logical resolution to improve modularity and reusability.	2026-05-17 16:42:35 +08:00
Luis Pater	4b13f9c255	Merge pull request #3439 from ben-vargas/fix-grok-tool-params fix(xai): default missing function tool parameters	2026-05-17 15:04:28 +08:00
Luis Pater	74cb53dee1	feat(xai): support namespace tools and enhance tool normalization logic - Added `namespace` tool type support, enabling nested tools to be normalized and moved to the top level. - Refactored tool normalization logic into `normalizeXAITool` for reusability and clarity. - Updated `xai_executor` test cases to validate namespace tool handling and nested tool normalization.	2026-05-17 15:02:36 +08:00
Ben Vargas	2607888a97	fix(xai): default missing function tool parameters	2026-05-16 17:57:40 -06:00
Luis Pater	8b3670b8dd	feat(xai): support namespace tools and enhance tool normalization logic - Added `namespace` tool type support, enabling nested tools to be normalized and moved to the top level. - Refactored tool normalization logic into `normalizeXAITool` for reusability and clarity. - Updated `xai_executor` test cases to validate namespace tool handling and nested tool normalization.	2026-05-17 05:22:57 +08:00
Luis Pater	ddd10539ad	feat(xai): normalize xAI input reasoning items and enhance test cases - Added `normalizeXAIInputReasoningItems` to clean up `input` reasoning items, removing null `content` and `encrypted_content` fields. - Updated `xai_executor` test cases to validate input normalization and reasoning item handling.	2026-05-17 04:51:17 +08:00
Luis Pater	088ab33df8	feat(api): add Codex client models support for OpenAI API - Introduced Codex client models framework in `openai` package. - Added JSON-based model definitions (`codex_client_models.json`) for Codex, including metadata, reasoning levels, and configuration options. - Implemented handlers to load, clone, and build Codex client models with support for visibility overrides and metadata application. - Enabled sorting and prioritization of models based on configuration or runtime criteria. - Added utility functions for managing and validating model attributes.	2026-05-17 04:48:34 +08:00
Luis Pater	53d1fd6c5c	feat(api, xai): add xAI Grok video model support with API integration - Introduced new xAI `grok-imagine-video` model for video generation with configurable options (e.g., duration, size, resolution). - Implemented video-specific API endpoints (`/v1/videos`, `/v1/videos/generations`, `/v1/videos/edits`, `/v1/videos/extensions`), including request validation and model handling. - Enhanced model registry with `xaiBuiltinVideoModelID` and metadata for video capabilities. - Added unit tests to validate video model support, request structures, and API response handling. - Extended `XAIExecutor` to integrate video generation and retrieval via runtime requests.	2026-05-17 02:53:50 +08:00
Luis Pater	2ff9e33e26	feat(api, xai): integrate xAI Grok image models and extend API endpoints for image support - Added new xAI Grok image models (`grok-imagine-image`, `grok-imagine-image-quality`) with high-fidelity and aspect ratio configurations. - Extended `isSupportedImagesModel` logic to validate xAI models. - Implemented API request builders for image generation/editing with customizable options (e.g., resolution, aspect ratio, response format). - Enhanced `/v1/images` endpoints to handle xAI model capabilities, including response normalization and model-specific handlers. - Updated unit tests to validate xAI model validation, request structure, and API integration.	2026-05-17 01:30:23 +08:00
Luis Pater	e4c957078c	feat(auth): add OAuth2 support for xAI with PKCE and token persistence - Implemented xAI OAuth2 integration with PKCE (Proof Key for Code Exchange) support. - Added logic for token exchange, refresh, and persistent storage in JSON format. - Created `xai` package with helpers for OAuth discovery, API token handling, and URL building. - Introduced `XAIExecutor` for integrating xAI credentials into runtime HTTP requests. - Added unit tests to validate OAuth flow, token persistence, and endpoint validation.	2026-05-17 01:02:35 +08:00
Luis Pater	6bfcb0ce79	feat(auth): improve unauthorized error handling for refresh and auto-refresh - Added `isUnauthorizedError` and `hasUnauthorizedAuthFailure` to classify and handle unauthorized errors. - Introduced `refreshErrorFromError` to map errors to standardized unauthorized responses. - Modified refresh logic to stop auto-refresh retries for unauthorized errors. - Updated tests to verify unauthorized error handling and refresh retry prevention.	2026-05-13 02:59:46 +08:00

1 2 3 4 5 ...

605 Commits