CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-06-23 02:36:49 +08:00

Author	SHA1	Message	Date
Luis Pater	041a065b2f	Merge branch 'remove-gemini-cli' into dev # Conflicts: # internal/api/handlers/management/auth_files.go # internal/thinking/provider/geminicli/apply.go	2026-06-19 14:40:29 +08:00
Luis Pater	ac8fb9706f	feat(thinking): remove `thinkingConfig` for `ModeNone` with zero budget and no level - Updated Gemini, Gemini CLI, and Antigravity logic to delete `thinkingConfig` when `ModeNone` is set, `Budget=0`, and `Level` is empty. - Adjusted tests to validate this behavior across multiple scenarios and models with zero-allowed configurations. - Extended test cases for additional coverage of mixed-model behavior. Closes: #3138	2026-06-18 22:39:02 +08:00
Luis Pater	78ba8ba731	chore: remove Gemini CLI-related translator packages and logic - Deleted `geminicli` provider and related `Apply` logic. - Removed all translator packages specific to Gemini CLI (Claude, Codex integrations). - Purged associated test files for Gemini CLI translation. - Removed `GeminiAuthenticator` and all associated authentication logic (OAuth flows, token handling, refresh logic). - Deleted internal/executor Gemini OAuth support, including bearer token handling and runtime API logic. - Purged all tests, configs, and command-line flags specific to Gemini OAuth flows. - Updated documentation and aliases to reflect Gemini removal. - Renamed `parseRetryDelay` to `ParseRetryDelay` and `deleteJSONField` to `DeleteJSONField`. - Updated references in `antigravity_executor` and tests to use the new `helps` package. - Adjusted import paths and test cases to ensure compatibility with the new location. - Updated README files to reflect changes in the retry logic references. - Updated `.github/ISSUE_TEMPLATE/bug_report.md` to remove deprecated Gemini CLI mention.	2026-06-18 13:33:10 +08:00
hkfires	8c6f279f0a	refactor(tests): remove obsolete test files and update reasoning effort logic	2026-06-17 08:11:25 +08:00
LTbinglingfeng	e38ba28db5	feat(pluginstore): add plugin store support	2026-06-12 23:15:00 +08:00
Luis Pater	0ed85bb88b	feat(pluginhost): refactor and enhance plugin system with new execution and thinking capabilities - Removed `examples/plugin/main.go` and `internal/pluginhost/loader_plugin.go` after migrating to a more modular system. - Introduced `streamBridge` in `internal/pluginhost/stream_bridge.go` for efficient stream handling and communication. - Added examples of `thinking` plugins written in both Rust and Go under `examples/plugin/thinking`. - Enhanced test coverage for plugin host system changes, including stream chunk translation and thinking logic. - Improved API compatibility and ensured backward-compatible upgrades for plugin execution.	2026-06-07 03:20:04 +08:00
Luis Pater	d625caddd9	feat(pluginhost): add capabilities for command-line flag handling and plugin execution - Implemented command-line flag registration and execution for plugins with priority-based conflict resolution. - Enabled plugin-owned command-line flag execution and persistence of plugin-auth data. - Added new `Host` methods to support command-line capabilities, including flag normalization, validation, and execution state management. - Introduced unit tests to ensure coverage for command-line plugin functionality, including auth data persistence. - Updated configs to normalize plugins during initialization.	2026-06-06 18:35:17 +08:00
Luis Pater	11f0f906bd	feat(logging): add `SetTranslatedReasoningEffort` to track reasoning levels in usage reporting - Introduced `SetTranslatedReasoningEffort` method in `UsageReporter` to capture and log reasoning efforts from translated payloads. - Updated executors to incorporate the new reporting functionality for handling reasoning efforts across various providers. - Enhanced logging for thinking level extraction with new helper function `ExtractTranslatedReasoningEffort`.	2026-05-28 02:19:45 +08:00
yavon007	0de0ad0d36	Add reasoning effort to usage events	2026-05-19 22:10:48 +08:00
Luis Pater	bac006e72b	feat(thinking): add xAI provider support with reasoning.effort implementation - Implemented `xAI` provider for thinking configurations with support for reasoning.effort levels. - Registered `xAI` in available providers and updated relevant APIs for compatibility. - Added unit tests for `xAI` provider functionality, including fallback logic for unsupported levels. - Integrated `xAI` with executor handling and ensured conformance with OpenAI-compatible standards.	2026-05-19 03:09:53 +08:00
Luis Pater	e50cabac4b	chore: upgrade CLIProxyAPI dependency to v7 across the project - Updated all references from v6 to v7 for `github.com/router-for-me/CLIProxyAPI`. - Ensured consistency in imports within core libraries, tests, and integration tests. - Added missing tests for new features in Redis Protocol integration.	2026-05-08 11:46:46 +08:00
Luis Pater	f5dc6483d5	chore: remove iFlow-related modules and dependencies - Deleted `iflow` provider implementation, including thinking configuration (`apply.go`) and authentication modules. - Removed iFlow-specific tests, executors, and helpers across SDK and internal components. - Updated all references to exclude iFlow functionality.	2026-04-17 01:07:12 +08:00
Luis Pater	8fac29631d	chore: remove Qwen support from SDK and internal components - Deleted `QwenAuthenticator`, internal `qwen_auth`, and `qwen_executor` implementations. - Removed all Qwen-related OAuth flows, token handling, and execution logic. - Cleaned up dependencies and references to Qwen across the codebase.	2026-04-15 12:16:08 +08:00
hkfires	d390b95b76	fix(tests): update test cases	2026-04-08 08:53:50 +08:00
Luis Pater	c1818f197b	Merge pull request #1940 from Blue-B/fix/claude-interleaved-thinking-amp-gzip-budget fix(claude): enable interleaved-thinking beta, decode AMP error gzip, fix budget 400	2026-04-06 09:08:23 +08:00
Blue-B	5f58248016	fix(claude): clamp max_tokens to model limit in normalizeClaudeBudget When adjustedBudget < minBudget, the previous fix blindly set max_tokens = budgetTokens+1 which could exceed MaxCompletionTokens. Now: cap max_tokens at MaxCompletionTokens, recalculate budget, and disable thinking entirely if constraints are unsatisfiable. Add unit tests covering raise, clamp, disable, and no-op scenarios.	2026-03-09 22:10:30 +09:00
Blue-B	07d6689d87	fix(claude): add interleaved-thinking beta header, AMP gzip error decoding, normalizeClaudeBudget max_tokens 1. Always include interleaved-thinking-2025-05-14 beta header so that thinking blocks are returned correctly for all Claude models. 2. Remove status-code guard in AMP reverse proxy ModifyResponse so that error responses (4xx/5xx) with hidden gzip encoding are decoded properly — prevents garbled error messages reaching the client. 3. In normalizeClaudeBudget, when the adjusted budget falls below the model minimum, set max_tokens = budgetTokens+1 instead of leaving the request unchanged (which causes a 400 from the API).	2026-03-07 21:31:10 +09:00
chujian	7c1299922e	fix(openai-compat): improve pool fallback and preserve adaptive thinking	2026-03-07 16:54:28 +08:00
hkfires	835ae178d4	feat(thinking): rename isBudgetBasedProvider to isBudgetCapableProvider and update logic for provider checks	2026-03-03 19:49:51 +08:00
hkfires	c80ab8bf0d	feat(thinking): improve provider family checks and clamp unsupported levels	2026-03-03 19:05:15 +08:00
hkfires	0452b869e8	feat(thinking): add HasLevel and MapToClaudeEffort functions for adaptive thinking support	2026-03-03 14:16:36 +08:00
hkfires	c44793789b	feat(thinking): add adaptive thinking support for Claude models Add support for Claude's "adaptive" and "auto" thinking modes using `output_config.effort`. Introduce support for new effort level "max" in adaptive thinking. Update thinking logic, validate model capabilities, and extend converters and handling to ensure compatibility with adaptive modes. Adjust static model data with supported levels and refine handling across translators and executors.	2026-03-03 09:05:31 +08:00
Luis Pater	8599b1560e	Fixed: #1716 feat(kimi): add support for explicit disabled thinking and reasoning effort handling	2026-02-28 05:29:07 +08:00
hkfires	0659ffab75	Revert "Merge pull request #1627 from thebtf/fix/reasoning-effort-clamping"	2026-02-24 19:47:53 +08:00
Kirill Turanskiy	2ea95266e3	fix: clamp reasoning_effort to valid OpenAI-format values CPA-internal thinking levels like 'xhigh' and 'minimal' are not accepted by OpenAI-format providers (MiniMax, etc.). The OpenAI applier now maps non-standard levels to the nearest valid reasoning_effort value before writing to the request body: xhigh → high minimal → low auto → medium	2026-02-18 03:36:42 +03:00
test	f5f26f0cbe	Add Kimi (Moonshot AI) provider support - OAuth2 device authorization grant flow (RFC 8628) for authentication - Streaming and non-streaming chat completions via OpenAI-compatible API - Models: kimi-k2, kimi-k2-thinking, kimi-k2.5 - CLI `--kimi-login` command for device flow auth - Token management with automatic refresh - Thinking/reasoning effort support for thinking-enabled models Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 19:24:46 -05:00
hkfires	209d74062a	fix(thinking): ensure includeThoughts is false for ModeNone in budget processing	2026-02-05 10:24:42 +08:00
hkfires	d86b13c9cb	fix(thinking): support user-defined includeThoughts setting with camelCase and snake_case variants Fixes #1378	2026-02-05 10:07:41 +08:00
neavo	6c65fdf54b	fix(gemini): support snake_case thinking config fields from Python SDK Google official Gemini Python SDK sends thinking_level, thinking_budget, and include_thoughts (snake_case) instead of thinkingLevel, thinkingBudget, and includeThoughts (camelCase). This caused thinking configuration to be ignored when using Python SDK. Changes: - Extract layer: extractGeminiConfig now reads snake_case as fallback - Apply layer: Gemini/CLI/Antigravity appliers clean up snake_case fields - Translator layer: Gemini->OpenAI/Claude/Codex translators support fallback - Tests: Added 4 test cases for snake_case field coverage Fixes #1426	2026-02-04 21:12:47 +08:00
hkfires	c8c27325dc	feat(thinking): enable thinking toggle for qwen3 and deepseek models Fix #1245	2026-01-28 09:54:05 +08:00
hkfires	e641fde25c	feat(registry): support provider-specific model info lookup	2026-01-20 10:01:17 +08:00
hkfires	239a28793c	feat(claude): clamp thinking budget to max_tokens constraints	2026-01-19 16:32:20 +08:00
hkfires	c421d653e7	refactor(claude): move max_tokens constraint enforcement to Apply method	2026-01-19 15:50:35 +08:00
hkfires	cb6caf3f87	fix(thinking): update ValidateConfig to include fromSuffix parameter and adjust budget validation logic	2026-01-18 16:37:14 +08:00
hkfires	03005b5d29	refactor(thinking): add Gemini family provider grouping for strict validation	2026-01-18 11:30:53 +08:00
hkfires	c7e8830a56	refactor(thinking): pass source and target formats to ApplyThinking for cross-format validation Update ApplyThinking signature to accept fromFormat and toFormat parameters instead of a single provider string. This enables: - Proper level-to-budget conversion when source is level-based (openai/codex) and target is budget-based (gemini/claude) - Strict budget range validation when source and target formats match - Level clamping to nearest supported level for cross-format requests - Format alias resolution in SDK translator registry for codex/openai-response Also adds ErrBudgetOutOfRange error code and improves iflow config extraction to fall back to openai format when iflow-specific config is not present.	2026-01-18 10:30:15 +08:00
hkfires	2b387e169b	feat(iflow): add iflow-rome model definition	2026-01-15 20:23:55 +08:00
hkfires	4ad6189487	refactor(thinking): extract antigravity logic into a dedicated provider	2026-01-15 19:08:22 +08:00
hkfires	ff4ff6bc2f	feat(thinking): support zero as a valid thinking budget for capable models	2026-01-15 15:41:10 +08:00
hkfires	5c40a2db21	refactor(thinking): simplify ModeNone and budget validation logic	2026-01-15 14:03:08 +08:00
hkfires	ee2976cca0	refactor(thinking): improve logging for user-defined models	2026-01-15 13:06:41 +08:00
hkfires	bcd4d9595f	fix(thinking): refine ModeNone handling based on provider capabilities	2026-01-15 13:06:41 +08:00
hkfires	5a77b7728e	refactor(thinking): improve budget clamping and logging with provider/model context	2026-01-15 13:06:41 +08:00
hkfires	1fbbba6f59	feat(logging): order log fields for improved readability	2026-01-15 13:06:41 +08:00
hkfires	f6a2d072e6	refactor(thinking): refine configuration logging	2026-01-15 13:06:41 +08:00
hkfires	6e4a602c60	fix(thinking): map reasoning_effort to thinkingConfig	2026-01-15 13:06:40 +08:00
hkfires	33d66959e9	test(thinking): remove legacy unit and integration tests	2026-01-15 13:06:40 +08:00
hkfires	7f1b2b3f6e	fix(thinking): improve model lookup and validation	2026-01-15 13:06:40 +08:00
hkfires	40ee065eff	fix(thinking): use static lookup to avoid alias issues	2026-01-15 13:06:40 +08:00
hkfires	72f2125668	fix(executor): properly handle thinking application errors	2026-01-15 13:06:39 +08:00

1 2

52 Commits