CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-07-02 04:54:42 +08:00

Author	SHA1	Message	Date
sususu98	53a21dfb0b	[codex] Drop foreign encrypted_content before xAI Grok upstream (#3961 ) * Drop foreign encrypted_content before xAI Grok upstream xAI Grok accepts provider-native encrypted_content as opaque replay state, but GPT/Codex reasoning signatures, Gemini thoughtSignature blobs, and Claude thinking signatures can all travel through OpenAI Responses-style reasoning.encrypted_content while remaining incompatible with xAI. Forwarding those foreign blobs to Grok causes upstream validation failures, especially when the foreign value is high-entropy enough to look ciphertext-like. Add a Grok encrypted_content transport validator that stays conservative and shape-oriented: - require unpadded standard base64 with no foreign characters - reject obvious GPT/Codex gAAAA reasoning signatures before decode - reject strict Claude thinking signatures in both official E-form and Antigravity R-form - reject known Gemini thoughtSignature envelopes by reusing the central Gemini validator, covering Gemini 2.5 field-1 and Gemini 3.x field-2 shapes - require decoded payloads to be long enough and high-entropy enough to look like native Grok ciphertext - avoid decrypting, protobuf-parsing, or otherwise interpreting native Grok payloads on the hot path Wire the validator into the xAI Responses request preparation path for reasoning and compaction input items. Invalid encrypted_content fields are deleted before the request is sent upstream, while the surrounding item is preserved and debug logging records only redacted metadata. Extend coverage with native Grok corpus preservation, Gemini field-1/field-2 rejection, Claude E-form and R-form rejection, invalid-blob sanitizer tests, and compact/websocket replay preservation. The foreign-provider checks are deliberately narrow so high-entropy Grok blobs are not rejected merely because they look random. * fix(xai): harden encrypted content sanitizer	2026-06-23 14:33:27 +08:00
Luis Pater	7c390a7a2e	feat(runtime): add Claude Code session handling with caching and tests - Introduced `ClaudeCodeSessionID` resolution logic, preferring headers over payload metadata. - Added `ClaudeCodePromptCache` to map sessions to stable prompt cache keys. - Refactored existing logic to integrate `ClaudeCodePromptCache` for session-based handling. - Included extensive unit tests to validate session ID extraction, cache reuse, and header prioritization.	2026-06-23 13:19:13 +08:00
Luis Pater	28e2f9798c	feat(executor): add session isolation for `grok-composer` models - Introduced `xaiRequiresIsolatedConversation` to enforce session ID generation for `grok-composer` models. - Updated request preparation logic to handle isolated conversations by setting `prompt_cache_key` and `x-grok-conv-id`. - Added unit tests with coverage for session isolation, stateless models, and explicit `prompt_cache_key` scenarios. Closes: #3750	2026-06-20 10:54:16 +08:00
Luis Pater	96a8b0cfe2	feat(executor): normalize reasoning text events and enhance handling logic - Introduced `xaiNormalizeReasoningSummaryData` and related functions to normalize `reasoning_text` events into `reasoning_summary` shapes for standardization. - Updated WebSocket and streaming logic to process normalized reasoning summary events correctly. - Enhanced tests to validate normalization, order of events, and output structure in both stream and non-stream scenarios.	2026-06-17 13:00:00 +08:00
Luis Pater	529d9e92c9	feat(executor): add support for compact response handling in XAIExecutor - Introduced `executeCompact` to handle non-streaming compact responses via the `/responses/compact` endpoint. - Added `executeCompactionTriggerStream` for streaming responses triggered by `compaction_trigger`. - Enhanced request preparation with `prepareResponsesRequestTo` for dynamic response formats. - Updated logic to bypass streaming for `/responses/compact` and added fallback behaviors. - Added comprehensive tests for compact response handling and event streaming validations.	2026-06-15 00:29:38 +08:00
Luis Pater	8e39db2ec7	feat(plugin, api): introduce host model callback support with Go example and API handlers - Added an example plugin `host-model-callback` in Go to summarize host model callbacks. - Implemented `cliproxy_plugin_init`, `cliproxyPluginCall`, and other plugin functions for callback handling. - Introduced API handlers for `ModelExecution` and `ModelExecutionStream` with support for both streaming and non-streaming requests. - Included unit tests (`model_execution_test.go`) to validate execution logic and streaming responses.	2026-06-12 02:22:23 +08:00
Luis Pater	387c783b32	Merge pull request #3649 from intcua/fix/xai-empty-tools-orphan-tool-choice fix(executor/xai): drop orphaned tool_choice when Claude tools array is empty	2026-06-04 13:11:23 +08:00
Luis Pater	959067edfb	feat(usage): introduce executor type tracking in usage reporting - Replaced `NewUsageReporter` with `NewExecutorUsageReporter` to include executor type in usage records. - Updated all executors to use the new reporter implementation. - Extended `UsageReporter` to track and publish executor type. - Added tests to validate proper executor type recording and handling. - Enhanced RedisQueue plugin and payload schema with executor type support.	2026-06-02 00:43:16 +08:00
lamtran	303685c230	fix(executor/xai): drop orphaned tool_choice when Claude tools array is empty When Claude Code sends a stop-hook evaluator request (or any request without tools), the payload includes "tools": [] (empty array). The claude->codex translator unconditionally emits tools: [] + tool_choice: "auto" + parallel_tool_calls: true into the Codex Responses shape. When that payload is routed to xAI, the upstream rejects with HTTP 400: "A tool_choice was set on the request but no tools were specified." Fix entirely in the xAI executor (translator package is policy-locked): add normalizeXAIToolChoiceForTools() after normalizeXAITools() to drop tool_choice and parallel_tool_calls whenever tools end up absent or empty (covering both the empty-from-source case and the all-filtered-out case where every tool was an unsupported type such as tool_search or image_generation). Per code-review feedback: always remove parallel_tool_calls when tools are missing (not gated on tool_choice presence) and existence-check each key before sjson delete to avoid unnecessary JSON parse/copy. Verification: - go build -o test-output ./cmd/server - go test ./internal/runtime/executor/... -count=1 - 5 new regression tests cover empty / missing / present / orphaned parallel_tool_calls / no-op-when-both-absent.	2026-05-31 23:13:15 +07:00
Luis Pater	94c1b25146	feat(executor): add TTFT tracking and reporting for enhanced performance metrics - Introduced Time-To-First-Token (TTFT) measurement and reporting across major executors. - Added TTFT calculation to `UsageReporter`, including support for HTTP clients and WebSocket communication. - Updated tests to validate TTFT tracking in streamed and non-streamed scenarios. - Ensured integration with `usage` plugin and augmented usage records with TTFT data.	2026-05-28 02:59:24 +08:00
Luis Pater	11f0f906bd	feat(logging): add `SetTranslatedReasoningEffort` to track reasoning levels in usage reporting - Introduced `SetTranslatedReasoningEffort` method in `UsageReporter` to capture and log reasoning efforts from translated payloads. - Updated executors to incorporate the new reporting functionality for handling reasoning efforts across various providers. - Enhanced logging for thinking level extraction with new helper function `ExtractTranslatedReasoningEffort`.	2026-05-28 02:19:45 +08:00
Luis Pater	bac006e72b	feat(thinking): add xAI provider support with reasoning.effort implementation - Implemented `xAI` provider for thinking configurations with support for reasoning.effort levels. - Registered `xAI` in available providers and updated relevant APIs for compatibility. - Added unit tests for `xAI` provider functionality, including fallback logic for unsupported levels. - Integrated `xAI` with executor handling and ensured conformance with OpenAI-compatible standards.	2026-05-19 03:09:53 +08:00
Luis Pater	2007a89594	feat(runtime): enhance payload rule resolution with dynamic path support - Introduced `resolvePayloadRulePaths` function to dynamically resolve rule paths supporting array queries and complex logic. - Updated payload processing logic (`apply defaults`, `overrides`, `filters`) to handle resolved paths for better flexibility. - Added helper functions for path parsing, query matching, and logical resolution to improve modularity and reusability. - Introduced payload condition match logic, including `match`, `not-match`, `exist`, and `not-exist` rules in `PayloadConfig`. - Enhanced `payloadModelRulesMatch` function to support conditional checks at various levels. - Added helper methods for evaluating JSON path conditions and values. - Updated tests to validate new conditional rules against different payload scenarios.	2026-05-17 23:06:43 +08:00
Luis Pater	4b13f9c255	Merge pull request #3439 from ben-vargas/fix-grok-tool-params fix(xai): default missing function tool parameters	2026-05-17 15:04:28 +08:00
Luis Pater	74cb53dee1	feat(xai): support namespace tools and enhance tool normalization logic - Added `namespace` tool type support, enabling nested tools to be normalized and moved to the top level. - Refactored tool normalization logic into `normalizeXAITool` for reusability and clarity. - Updated `xai_executor` test cases to validate namespace tool handling and nested tool normalization.	2026-05-17 15:02:36 +08:00
Ben Vargas	2607888a97	fix(xai): default missing function tool parameters	2026-05-16 17:57:40 -06:00
Luis Pater	8b3670b8dd	feat(xai): support namespace tools and enhance tool normalization logic - Added `namespace` tool type support, enabling nested tools to be normalized and moved to the top level. - Refactored tool normalization logic into `normalizeXAITool` for reusability and clarity. - Updated `xai_executor` test cases to validate namespace tool handling and nested tool normalization.	2026-05-17 05:22:57 +08:00
Luis Pater	ddd10539ad	feat(xai): normalize xAI input reasoning items and enhance test cases - Added `normalizeXAIInputReasoningItems` to clean up `input` reasoning items, removing null `content` and `encrypted_content` fields. - Updated `xai_executor` test cases to validate input normalization and reasoning item handling.	2026-05-17 04:51:17 +08:00
Luis Pater	088ab33df8	feat(api): add Codex client models support for OpenAI API - Introduced Codex client models framework in `openai` package. - Added JSON-based model definitions (`codex_client_models.json`) for Codex, including metadata, reasoning levels, and configuration options. - Implemented handlers to load, clone, and build Codex client models with support for visibility overrides and metadata application. - Enabled sorting and prioritization of models based on configuration or runtime criteria. - Added utility functions for managing and validating model attributes.	2026-05-17 04:48:34 +08:00
Luis Pater	53d1fd6c5c	feat(api, xai): add xAI Grok video model support with API integration - Introduced new xAI `grok-imagine-video` model for video generation with configurable options (e.g., duration, size, resolution). - Implemented video-specific API endpoints (`/v1/videos`, `/v1/videos/generations`, `/v1/videos/edits`, `/v1/videos/extensions`), including request validation and model handling. - Enhanced model registry with `xaiBuiltinVideoModelID` and metadata for video capabilities. - Added unit tests to validate video model support, request structures, and API response handling. - Extended `XAIExecutor` to integrate video generation and retrieval via runtime requests.	2026-05-17 02:53:50 +08:00
Luis Pater	2ff9e33e26	feat(api, xai): integrate xAI Grok image models and extend API endpoints for image support - Added new xAI Grok image models (`grok-imagine-image`, `grok-imagine-image-quality`) with high-fidelity and aspect ratio configurations. - Extended `isSupportedImagesModel` logic to validate xAI models. - Implemented API request builders for image generation/editing with customizable options (e.g., resolution, aspect ratio, response format). - Enhanced `/v1/images` endpoints to handle xAI model capabilities, including response normalization and model-specific handlers. - Updated unit tests to validate xAI model validation, request structure, and API integration.	2026-05-17 01:30:23 +08:00
Luis Pater	e4c957078c	feat(auth): add OAuth2 support for xAI with PKCE and token persistence - Implemented xAI OAuth2 integration with PKCE (Proof Key for Code Exchange) support. - Added logic for token exchange, refresh, and persistent storage in JSON format. - Created `xai` package with helpers for OAuth discovery, API token handling, and URL building. - Introduced `XAIExecutor` for integrating xAI credentials into runtime HTTP requests. - Added unit tests to validate OAuth flow, token persistence, and endpoint validation.	2026-05-17 01:02:35 +08:00

22 Commits