CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-06-24 20:19:03 +08:00

Author	SHA1	Message	Date
LTbinglingfeng	7f026e1aab	Add runtime config clone	2026-06-17 02:39:04 +08:00
LTbinglingfeng	a65ced4a92	fix(management): reload plugins asynchronously after changes	2026-06-17 01:06:57 +08:00
LTbinglingfeng	13f51d96cb	fix(pluginhost): avoid holding host lock during plugin lifecycle	2026-06-17 01:05:21 +08:00
sususu98	30dc2e7f34	fix(translator): emit Claude server tool blocks for Codex web_search_call streams (#3868 ) * fix(translator): emit Claude server tool blocks for Codex web_search_call streams Map Codex Responses streaming web_search_call events to Claude SSE server_tool_use and web_search_tool_result blocks, with deduplication and a focused stream regression test. * fix(translator): stabilize Codex web_search fallback tool_use IDs Reuse the active fallback web_search tool_use ID across later stream events so tool_result blocks stay paired when upstream omits item IDs. This is defensive hardening; live Codex streams already provide ws_* IDs. * fix(translator): emit Codex web_search blocks from populated items Wait for output_item.done before emitting Claude web_search tool_use and tool_result blocks, and avoid deduping early added/completed events that arrive before action.query is available. Matches live Responses stream ordering seen in local tmux verification. * fix(translator): map Codex web_search_call items in non-stream Claude responses Emit server_tool_use and web_search_tool_result blocks from completed response.output web_search_call items, matching the streaming translator. * fix(translator): keep non-stream web_search on end_turn and dedupe output items Do not treat server web_search_call items as client tool_use for stop_reason. Skip duplicate or query-less open_page web_search output items in non-stream translation, matching spark live behavior.	2026-06-16 23:07:08 +08:00
Luis Pater	2884a67ed0	feat(videos): add support for video_url extraction and validation in handlers - Updated `openai_videos_handlers` to extract and set `video_url` from payloads when available. - Enhanced unit tests to validate correct `video_url` extraction and inclusion in responses. v7.2.9	2026-06-16 21:06:27 +08:00
Luis Pater	f63cf9820a	docs: add CatAPI sponsorship details to README files - Included CatAPI information in README files (EN, JA, CN) to acknowledge sponsorship. - Added CatAPI logo and sign-up link with credit claim details. - Updated project assets to include CatAPI logo. v7.2.8	2026-06-16 20:16:59 +08:00
sususu98	87132e54d7	feat(plugin): add ModelRouter before auth with single-slot routing targets (#3865 ) * feat(plugin): add ModelRouter before auth with single-slot routing targets ## Motivation Plugins that need to change execution based on the original inbound request (protocol format, raw body, headers, query, stream flag, metadata, etc.) often resorted to virtual/trampoline models or routing inside interceptors. This commit adds ModelRouter: a pluggable layer before model-to-provider resolution and AuthManager credential selection, so plugins can declare who executes a request without spoofing the client model name. This is a new capability, not a bugfix on the existing chain. With no ModelRouter plugins loaded, behavior matches upstream. ## Pipeline placement - `execute`, `stream`, and `count` (and image paths via AuthManager) call `applyModelRouter()` before building `coreexecutor.Request`. - Routing runs before the request interceptor (before auth), so routers see the client’s original context. After a plugin executor is chosen, the existing after-auth interceptor → response/stream interceptor chain still applies. - Internal `ExecuteModel` / `ExecuteModelStream` (host callbacks) support `SkipRouterPluginID` so nested calls do not re-enter the same router. ## Routing API (single slot, mutually exclusive) `ModelRouteResponse` uses one target slot to avoid ambiguity when both `TargetExecutorPluginID` and `TargetProvider` were set and the host ignored one: \| Field \| Meaning \| \|-------\|---------\| \| `Handled` \| `false`: this router declines; try the next router or default path \| \| `TargetKind` \| `self` \\| `executor` \\| `provider` (pick one) \| \| `Target` \| `self`/`executor`: plugin ID; `provider`: built-in provider key \| \| `TargetModel` \| Optional on `provider` only; empty keeps client `RequestedModel` \| \| `Reason` \| Optional diagnostic text \| - self: the router plugin’s own executor (`Target` normalized to the router’s plugin ID). - executor: another plugin’s executor; host pre-checks with `executorPluginReady()` (executor declared and provider identifier resolvable) to avoid handled routes that 500 at execution. - provider: skip registry model resolution; fixed built-in AuthManager path; optional `TargetModel` for execution model only—does not change outward requested-model metadata. Routers run in descending plugin priority (tie-break: ascending plugin ID). Panic, error, invalid target, or unavailable executor/provider → log and fall through to the next router; if none handle, use the original provider+auth flow. ## Context exposed to routers `ModelRouteRequest` includes: - `SourceFormat`, `RequestedModel`, `Stream` - `Headers`, `Query`, `Body` (defensive copies) - `Metadata` (best-effort read-only context snapshot) - `AvailableProviders`: built-in provider keys with at least one non-disabled auth (`AuthManager.AvailableProviders()`). Does not reflect per-model cooldown or transient unavailability—treat as an optimistic snapshot. Adds `AuthManager.HasProviderAuth()` and `AvailableProviders()`, excluding `Disabled` and `StatusDisabled` auths consistently with credential selection. ## Host and RPC - Go plugins: `pluginapi.ModelRouter` + `RouteModel()`. - RPC plugins: `pluginabi.MethodModelRoute` (`model.route`), capability flag `model_router`. - `pluginhost.Host` implements `RouteModel` / `RouteModelExcept`; handlers use `SetModelRouterHost` or a `PluginHost` type assertion; direct executor paths use `ExecutePluginExecutor` / `CountPluginExecutor`. - No bundled example ModelRouter plugin; capability is active only when a third-party plugin declares `model_router` and loads. ## Plugin RPC schema (policy A, upstream-aligned) - `pluginabi.SchemaVersion` stays 1: capability additions (`model_router`, `model.route`) do not bump the number; increment only on breaking RPC JSON changes. - Host sends `schema_version` at register; reject only if the plugin declares a higher* version than the host. - No unpublished “ModelRouter requires schema ≥ 3” gate (v3 single-slot API was never public). - Existing plugins and examples without `model_router` (`schema_version: 1`) need no changes. - RPC ModelRouter: `schema_version: 1` + `model_router: true` + implement `model.route`. ## Path consistency within this commit - Provider routes reuse image-only model checks (e.g. `gpt-image-2`) on the normalized model, same as the default AuthManager path. - `count` aligned with execute/stream: `SkipRouterPluginID`, query/headers injection, interceptor skip semantics. - Handlers: `modelRoutersEnabled` treats hosts without `HasModelRouters` as disabled (same as before ModelRouter existed); `pluginhost.Host` implements the detector. - API docs: `ModelRouter` explicitly includes built-in provider targets (in addition to plugin executors and the router’s own executor). ## Testing go test ./internal/pluginhost ./sdk/api/handlers ./sdk/pluginapi ./sdk/pluginabi ./sdk/cliproxy/auth go build -o test-output ./cmd/server && rm test-output go test ./... * fix(handlers): address ModelRouter review feedback - Use modelExecutionQuery for plugin executor and AuthManager paths so inbound URL query matches router/header behavior - Guard queryFromContext when gin Request.URL is nil - Read plugin executor stream chunks via nextStreamChunk to exit on cancel - Drop redundant clonePluginMetadata on capability record meta Tests cover query propagation, stream cancel, and nil URL safety. * feat(plugin): add Claude web search router example Add a Claude Code web_search ModelRouter example that can route matching Claude requests through Antigravity, Codex, xAI, or Tavily. The plugin includes executor orchestration, backend fallback/penalty handling, Tavily API key support, Claude-compatible response assembly, stream forwarding, and focused unit coverage for detection, fallback routing, model resolution, penalties, stream forwarding, and Tavily behavior. Verification: go test -count=1 ./... in examples/plugin/claude-web-search-router/go; go build -buildmode=c-shared for the plugin; go build ./cmd/server; live local CPA curl coverage for plugin load, four explicit routes, fallback, and Codex spark routing. * fix(pluginhost): validate executor routes before fallback * fix(pluginhost): skip oauth-only executor routes	2026-06-16 19:15:34 +08:00
sususu98	9f940f162f	fix(pluginhost): keep stream callbacks alive until stream close Keep RPC streaming executor callback scopes alive until async streams close, detach nested host.model.execute_stream contexts from request cancellation, and clean up the stream bridge on stream completion.	2026-06-16 17:31:11 +08:00
Luis Pater	907e3493ee	docs: update VisionCoder details in README files - Added information about exclusive retail availability of Claude Max 200 and GPT Pro 200 premium accounts. - Enhanced descriptions of VisionCoder's offerings in README files (EN, JA, CN).	2026-06-16 13:52:33 +08:00
Luis Pater	8fad0d0325	feat(config+executor): add global Claude cloak mode toggle and improve credential fallback logic - Introduced `disable-claude-cloak-mode` configuration to globally disable Claude cloak mode with credential-level overrides. - Enhanced `getCloakConfigFromAuth` to support fallback to metadata for cloak settings. - Updated cloak configuration precedence logic, integrating global, credential, and default modes. - Updated config and watcher diff handling to include `disable-claude-cloak-mode`. Closes: #2789	2026-06-16 13:03:16 +08:00
Luis Pater	2406daf3ef	feat(util): normalize Claude tool_result content and improve Gemini integration - Added `ConvertClaudeToolResultContent` to standardize Claude tool_result content, preserving JSON structure and splitting out base64-encoded images. - Updated Gemini and Gemini-CLI translators to use the new utility for generating deterministic function responses and inline image parts. - Added comprehensive test cases for content types and edge cases, ensuring correct handling of string, JSON, and image blocks. Closes: #2781 v7.2.7	2026-06-16 08:09:30 +08:00
Luis Pater	844b855974	feat(executor): sanitize web search tool domains to meet Anthropic requirements - Added `sanitizeClaudeWebSearchDomains` to remove empty `allowed_domains` and `blocked_domains` fields for built-in web_search tools, addressing ambiguity errors from Anthropic. - Integrated domain sanitization into the Claude message preparation pipeline. - Added test cases to validate correct handling of empty and non-empty domain fields across various tool types. Closes: #2681	2026-06-16 03:29:44 +08:00
Supra4E8C	e40599831a	Merge pull request #3850 from router-for-me/feat/log-feat Feat/log feat v7.2.6	2026-06-15 22:06:38 +08:00
LTbinglingfeng	0b21b07115	fix log cursor rotation gap	2026-06-15 18:37:21 +08:00
LTbinglingfeng	5036513bf9	Fix ambiguous empty log cursor handling	2026-06-15 16:53:50 +08:00
LTbinglingfeng	a47c386319	Avoid counting all logs for tail reads	2026-06-15 15:51:15 +08:00
LTbinglingfeng	db3fdea4a1	Disambiguate zero-offset log cursors	2026-06-15 15:50:03 +08:00
LTbinglingfeng	917cec3bf6	Continue log cursors across rotation	2026-06-15 15:46:58 +08:00
LTbinglingfeng	3b0cc913ec	feat(plugins): implement asynchronous config reload after plugin deletion	2026-06-15 15:35:55 +08:00
Luis Pater	bbef8da454	feat(videos): add video authentication binding and update handler behavior - Introduced `videoAuthBindingStore` for managing mappings of video IDs to credentials with TTL support. - Updated video creation and retrieval handlers to bind and utilize credentials for authentication. - Enhanced response models to include upstream models and adjusted request preparation logic. - Added test coverage for video auth binding, TTL configuration, and expiration handling. v7.2.5	2026-06-15 13:38:40 +08:00
Luis Pater	f85768eef3	feat(auth): add config API key exclusion management with tests - Implemented helper methods `IsConfigAPIKeyAuth` and `toggleConfigAPIKeyExcludedAll` for managing config API key exclusions. - Updated API request handling to support enabling/disabling config API key exclusion patterns. - Added test coverage to validate exclusion toggling logic and persistence behavior. - Refactored duplicate code for identifying config API key auth entries into reusable utilities.	2026-06-15 11:14:05 +08:00
Luis Pater	f33bc56bb9	feat(websockets): add transcript state tracking and compaction trigger support - Added methods for managing and tracking WebSocket transcript state, including recording, prepending, and replacing transcript inputs. - Implemented `executeCompactionTriggerFromWebsocketContext` to support compaction triggers using recorded transcript context. - Enhanced upstream-downstream ID mapping with additional utilities and state synchronization. - Expanded test coverage to validate transcript state management, compaction payload generation, and WebSocket response handling. v7.2.4	2026-06-15 10:41:35 +08:00
Luis Pater	ea90ab6f77	feat(websockets): implement XAIWebsocketsExecutor with enhanced execution and ID mapping - Developed `XAIWebsocketsExecutor` for handling xAI Responses via WebSocket transport. - Introduced session and state management with `codexWebsocketSessionStore` and `xaiWebsocketIDStateStore`. - Added robust ID mapping for upstream and downstream request/response sequences. - Enhanced error propagation and handling of WebSocket terminal events. - Included utility methods for WebSocket request preparation, connection management, and state tracking. - Added foundational support for compact and streamed responses via enhanced session tracking. v7.2.3	2026-06-15 08:22:07 +08:00
Luis Pater	56988aea0f	feat(websockets): add Codex websocket passthrough support with tests - Implemented `websocketDirectCaptureExecutor` for Codex websocket passthrough functionality. - Added logic to bypass incremental state handling for passthrough models. - Updated normalization, compaction, and replay handling to support passthrough mode. - Introduced `responsesWebsocketUsesCodexWebsocketPassthrough` utility for model-specific passthrough determination. - Expanded test coverage for websocket passthrough scenarios, including compaction and response validation.	2026-06-15 02:31:05 +08:00
LTbinglingfeng	d417fa534f	Bound management log cursor reads	2026-06-15 02:18:58 +08:00
LTbinglingfeng	0d82daca6b	Preserve management log line counts	2026-06-15 02:17:57 +08:00
LTbinglingfeng	331daa24ad	Read management logs from cursors	2026-06-15 02:10:25 +08:00
LTbinglingfeng	95a72a47c8	Tail management logs with cursors	2026-06-15 02:08:06 +08:00
LTbinglingfeng	c61453a807	Add log cursor helpers	2026-06-15 02:06:25 +08:00
Luis Pater	7de9757c82	feat: add OpenAI video support with improved error handling and response normalization - Introduced `/openai/v1/videos` endpoint to support OpenAI-specific video generation. - Added error normalization and handling for OpenAI video resources, including detailed error propagation. - Enhanced response structure to include OpenAI-specific fields for status, progress, and model mappings. - Implemented new handlers for video content retrieval and error scenarios. - Expanded test coverage to validate OpenAI video support, error handling, and backend compatibility. v7.2.2	2026-06-15 01:53:52 +08:00
Supra4E8C	ea4e978c60	Merge pull request #3848 from router-for-me/feat/Community_Plugins Feat/community plugins	2026-06-15 01:22:55 +08:00
LTbinglingfeng	6f3bd7641b	feat(pluginstore): improve nil checks in pluginStoreSnapshot function	2026-06-15 01:21:07 +08:00
Luis Pater	3b96119050	feat(websockets): handle terminal events and improve error propagation - Enhanced Codex Websockets Executor to capture `response.done` as a terminal event, alongside `response.completed` and `error`. - Improved error propagation for upstream websocket errors with comprehensive message handling. - Introduced utility functions for recognizing terminal events and extracting error messages. - Expanded tests to validate new websocket event logic, including terminal event handling and upstream error propagation.	2026-06-15 01:03:19 +08:00
LTbinglingfeng	239d7ee0b0	feat(pluginstore): refactor plugin store source handling to use string URLs	2026-06-15 00:54:32 +08:00
LTbinglingfeng	303c0f2f53	feat(pluginstore): add support for third-party plugin store sources and enhance plugin management	2026-06-15 00:50:22 +08:00
Luis Pater	529d9e92c9	feat(executor): add support for compact response handling in XAIExecutor - Introduced `executeCompact` to handle non-streaming compact responses via the `/responses/compact` endpoint. - Added `executeCompactionTriggerStream` for streaming responses triggered by `compaction_trigger`. - Enhanced request preparation with `prepareResponsesRequestTo` for dynamic response formats. - Updated logic to bypass streaming for `/responses/compact` and added fallback behaviors. - Added comprehensive tests for compact response handling and event streaming validations.	2026-06-15 00:29:38 +08:00
Luis Pater	6f923a28f7	feat(pluginhost): implement host authentication callbacks and add tests - Introduced `auth_callbacks` for handling host authentication list, get, runtime, and save operations. - Added extensive unit tests to validate functionality, including disk fallback and runtime-specific cases. - Created example implementation in Go to demonstrate host callback integrations. v7.2.1	2026-06-14 23:51:40 +08:00
Luis Pater	c59eee60b7	Merge pull request #3847 from router-for-me/home Fix credential handling in home models	2026-06-14 23:47:38 +08:00
hkfires	64a8957e60	fix(auth): map credential errors to unauthorized	2026-06-14 23:02:21 +08:00
hkfires	b5da088767	fix(home): forward credentials for home models	2026-06-14 22:09:29 +08:00
Luis Pater	2a050dc95d	feat: enhance fault tolerance for kv-based caching and introduce additional tests - Updated Antigravity Credits fallback to handle KV store unavailability as a service error. - Enhanced signature caching mechanisms with request-time KV access and sliding expiration. - Added and improved tests for KV client interactions, including error handling and expiration behaviors. - Introduced `CacheSignatureBestEffort` for non-critical signature caching and clarified function flows with required context. - Ensured consistent error reporting for missing or unavailable KV stores in various scenarios. - Replaced direct `homekv` calls with injectable KV client interfaces for `antigravity` and `codex_reasoning_replay` modules. - Improved error reporting and handling for KV operations, including `KVGet`, `KVSet`, `KVDel`, and `KVExpire`. - Introduced dedicated fake KV clients for expanded and granular test coverage. - Added new unit tests to validate KV client behaviors and error scenarios, ensuring robustness and sliding expiration functionality. v7.2.0	2026-06-14 21:11:35 +08:00
Luis Pater	f6aa9a067e	Merge pull request #3846 from router-for-me/remove_amp Remove AMP integration support	2026-06-14 20:57:36 +08:00
hkfires	79db0e54be	refactor(api): remove deprecated route module interfaces	2026-06-14 20:45:23 +08:00
hkfires	b4054e185e	refactor(config): remove legacy migration code	2026-06-14 20:36:54 +08:00
hkfires	8122b9fe4b	feat!: remove amp integration support BREAKING CHANGE: ampcode configuration, management endpoints, provider routing, and X-Amp-Thread-Id session affinity are no longer supported	2026-06-14 20:31:00 +08:00
Luis Pater	e0cb43f38d	Merge pull request #3825 from xiaocoss/add-quotio-desktop docs: add Quotio Desktop to the projects list v7.1.76	2026-06-14 05:47:03 +08:00
Luis Pater	964e86f612	Merge pull request #3827 from iamkhalidbashir/add-kimi-k2.7-code feat: add Kimi K2.7 Code model (kimi-k2.7-code)	2026-06-14 05:43:53 +08:00
Luis Pater	eebb873b8a	Merge pull request #3832 from haowang02/feat/disable-image-generation-passthrough feat(config): add "passthrough" mode for disable-image-generation	2026-06-14 05:21:44 +08:00
Luis Pater	6d472d7b4f	feat(models): increase `context_length` for Composer 2.5 Fast to 200,000	2026-06-14 02:27:08 +08:00
Hao Wang	8d4a7f1f2e	feat(config): add "passthrough" mode for disable-image-generation Adds a fourth value for the disable-image-generation setting: - false: inject image_generation (unchanged) - true: strip everywhere + 404 on /v1/images/* (unchanged) - chat: strip on non-images endpoints, keep /v1/images/* (unchanged) - passthrough: never inject and never strip on non-images endpoints (the client payload is forwarded unchanged); behaves like "chat" on /v1/images/* endpoints. image_generation injection (codex executors) is already gated on the Off mode, and the /v1/images/* 404 gate is already gated on the All mode, so passthrough only required a change to the payload strip logic in payload_helpers.go, now expressed via shouldStripImageGeneration(). Closes #3831 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-13 22:41:15 +08:00

1 2 3 4 5 ...

2758 Commits