CLIProxyAPI

mirror of https://github.com/router-for-me/CLIProxyAPI.git synced 2026-05-08 14:48:29 +08:00

Author	SHA1	Message	Date
Luis Pater	2ea8f77efb	feat(models): add GPT-5.5 to the registry with support for advanced tasks	2026-04-29 09:49:26 +08:00
Luis Pater	736ae61e4a	Merge pull request #3051 from philipbankier/fix/gpt55-free-tier-test fix(test): remove free tier from GPT-5.5 inclusion test	2026-04-26 22:35:59 +08:00
philipbankier	32ef1588e8	fix(test): remove free tier from GPT-5.5 inclusion test GPT-5.5 was correctly removed from codex-free tier in `7b89583c` (since free accounts cannot access it), but the test was not updated to reflect this. This caused TestCodexStaticModelsIncludeGPT55 to fail on the free subtest. Changes: - Remove free tier from GPT-5.5 inclusion test - Add new TestCodexFreeModelsExcludeGPT55 to explicitly verify that free tier does NOT include GPT-5.5	2026-04-25 22:11:08 -04:00
Luis Pater	ea670ef8c0	feat(models): add Codex Auto Review model entry to registry JSON Closes: #2995	2026-04-26 03:09:06 +08:00
Luis Pater	7b89583cf8	chore(models): remove GPT-5.5 model entry from registry JSON	2026-04-24 05:07:03 +08:00
Ben Vargas	736018a0b0	Add GPT-5.5 Codex model support	2026-04-23 13:43:02 -06:00
Luis Pater	7d5f6d9382	feat(models): add GPT-5.5 model entry to registry JSON	2026-04-24 02:43:12 +08:00
Luis Pater	e935196df4	feat(models): add hardcoded GPT-Image-2 model support in Codex - Added `GPT-Image-2` as a built-in model to avoid dependency on remote updates for Codex. - Updated model tier functions (`CodexFree`, `CodexTeam`, etc.) to include built-in models via `WithCodexBuiltins`. - Introduced new handlers for image generation and edit operations under `OpenAIAPIHandler`. - Extended tests to validate 503 response for unsupported image model requests.	2026-04-22 20:51:13 +08:00
Luis Pater	4fc2c619fb	feat(models): add Kimi K2.6 model entry to registry JSON	2026-04-21 20:53:03 +08:00
hkfires	d9a3b3e5f3	fix(tests): update model lookup references and enhance Claude executor tests	2026-04-17 08:32:07 +08:00
Luis Pater	5dcca69e8c	feat(models): add Claude Opus 4.7 model entry to registry JSON	2026-04-17 01:08:19 +08:00
Luis Pater	f5dc6483d5	chore: remove iFlow-related modules and dependencies - Deleted `iflow` provider implementation, including thinking configuration (`apply.go`) and authentication modules. - Removed iFlow-specific tests, executors, and helpers across SDK and internal components. - Updated all references to exclude iFlow functionality.	2026-04-17 01:07:12 +08:00
Luis Pater	a4c1e32ff6	chore(models): remove outdated GPT-5 and related model entries from registry JSON	2026-04-15 20:37:32 +08:00
Luis Pater	8fac29631d	chore: remove Qwen support from SDK and internal components - Deleted `QwenAuthenticator`, internal `qwen_auth`, and `qwen_executor` implementations. - Removed all Qwen-related OAuth flows, token handling, and execution logic. - Cleaned up dependencies and references to Qwen across the codebase.	2026-04-15 12:16:08 +08:00
Luis Pater	a824e7cd0b	feat(models): add GPT-5.3, GPT-5.4, and GPT-5.4-mini with enhanced "thinking" levels	2026-04-03 23:05:10 +08:00
hkfires	fee736933b	feat(openai-compat): add per-model thinking support	2026-03-24 14:21:12 +08:00
hkfires	c3d5dbe96f	feat(model_registry): enhance model registration and refresh mechanisms	2026-03-13 10:56:39 +08:00
hkfires	dbd42a42b2	fix(model_updater): clarify log message for model refresh failure	2026-03-12 10:32:04 +08:00
hkfires	dea3e74d35	feat(antigravity): refactor model handling and remove unused code	2026-03-12 09:24:45 +08:00
hkfires	e333fbea3d	feat(updater): update StartModelsUpdater to block until models refresh completes	2026-03-10 14:41:58 +08:00
hkfires	efbe36d1d4	feat(updater): change models refresh to one-time fetch on startup	2026-03-10 14:18:54 +08:00
hkfires	30d5c95b26	feat(registry): refresh model catalog from network	2026-03-10 14:02:54 +08:00
hkfires	d1e3195e6f	feat(codex): register models by plan tier	2026-03-10 11:20:37 +08:00
Luis Pater	631e5c8331	Merge pull request #1922 from shenshuoyaoyouguang/pr/model-registry-safety fix(registry): clone model snapshots and invalidate available-model cache	2026-03-07 23:01:42 +08:00
Luis Pater	ca90487a8c	Merge branch 'main' into feature/add-gemini-3.1-flash-image-preview	2026-03-07 22:16:09 +08:00
chujian	3a18f6fcca	fix(registry): clone slice fields in model map output	2026-03-07 18:53:56 +08:00
chujian	099e734a02	fix(registry): always clone available model snapshots	2026-03-07 18:40:02 +08:00
chujian	97ef633c57	fix(registry): address review feedback	2026-03-07 17:36:57 +08:00
chujian	dae8463ba1	fix(registry): clone model snapshots and invalidate available-model cache	2026-03-07 16:59:23 +08:00
Frad LEE	a8cbc68c3e	feat(registry): add gemini 3.1 flash lite preview - Add model to GetGeminiModels() - Add model to GetGeminiVertexModels() - Add model to GetGeminiCLIModels() - Add model to GetAIStudioModels() - Add to AntigravityModelConfig with thinking levels - Update gemini-3-flash-preview description Registers the new lightweight Gemini model across all provider endpoints for cost-effective high-volume usage scenarios. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 20:52:28 +08:00
zhongnan.rex	242aecd924	feat(registry): add gemini-3.1-flash-image-preview model definition	2026-03-06 10:50:04 +08:00
Luis Pater	9397f7049f	fix(registry): simplify GPT 5.4 model description in static data	2026-03-06 02:32:56 +08:00
Luis Pater	8822f20d17	feat(registry): add GPT 5.4 model definition to static data	2026-03-06 02:23:53 +08:00
hkfires	c44793789b	feat(thinking): add adaptive thinking support for Claude models Add support for Claude's "adaptive" and "auto" thinking modes using `output_config.effort`. Introduce support for new effort level "max" in adaptive thinking. Update thinking logic, validate model capabilities, and extend converters and handling to ensure compatibility with adaptive modes. Adjust static model data with supported levels and refine handling across translators and executors.	2026-03-03 09:05:31 +08:00
Luis Pater	8aa2cce8c5	Merge PR #1735 into dev with conflict resolution and fixes	2026-03-02 03:22:51 +08:00
hkfires	134f41496d	fix(antigravity): update model configurations and add new models for Antigravity	2026-03-01 10:05:29 +08:00
Luis Pater	1ae994b4aa	fix(antigravity): adjust thinkingBudget default to 64000 and update model definitions for Claude	2026-03-01 09:39:39 +08:00
margbug01	8de0885b7d	fix: support thinking.type="auto" from Amp client for Antigravity Claude models ## Problem When using Antigravity Claude models through CLIProxyAPI, the thinking chain (reasoning content) does not display in the Amp client. ## Root Cause The Amp client sends `thinking: {"type": "auto"}` in its requests, but `ConvertClaudeRequestToAntigravity` only handled `"enabled"` and `"adaptive"` types in its switch statement. The `"auto"` type was silently ignored, resulting in no `thinkingConfig` being set in the translated Gemini request. Without `thinkingConfig`, the Antigravity API returns responses without any thinking content. Additionally, the Antigravity API for Claude models does not support `thinkingBudget: -1` (auto mode sentinel). It requires a concrete positive budget value. The fix uses 128000 as the budget for "auto" mode, which `ApplyThinking` will then normalize to stay within the model's actual limits (e.g., capped to `maxOutputTokens - 1`). ## Changes ### internal/translator/antigravity/claude/antigravity_claude_request.go 1. Add "auto" case to the thinking type switch statement. Sets `thinkingBudget: 128000` and `includeThoughts: true`. The budget is subsequently normalized by `ApplyThinking` based on model-specific limits. 2. Add "auto" to hasThinking check so that interleaved thinking hints are injected for tool-use scenarios when Amp sends `thinking.type="auto"`. ### internal/registry/model_definitions_static_data.go 3. Add Thinking configuration for `claude-sonnet-4-6`, `claude-sonnet-4-5`, and `claude-opus-4-6` in `GetAntigravityModelConfig()` -- these were previously missing, causing `ApplyThinking` to skip thinking config entirely. ## Testing - Deployed to Railway test instance (cpa-thinking-test) - Verified via debug logging that: - Amp sends `thinking: {"type": "auto"}` - CPA now translates this to `thinkingConfig: {thinkingBudget: 128000, includeThoughts: true}` - `ApplyThinking` normalizes the budget to model-specific limits - Antigravity API receives the correct thinkingConfig Amp-Thread-ID: https://ampcode.com/threads/T-019ca511-710d-776d-a07c-4b750f871a93 Co-authored-by: Amp <amp@ampcode.com>	2026-03-01 02:18:43 +08:00
maplelove	f3c164d345	feat(antigravity): update to v1.19.5 with new models and Claude 4-6 migration	2026-02-27 10:34:27 +08:00
maplelove	4040b1e766	Merge remote-tracking branch 'upstream/dev' into dev # Conflicts: # internal/runtime/executor/antigravity_executor.go	2026-02-27 10:29:50 +08:00
huang_usaki	3b4f9f43db	feat(registry): add gemini-3.1-flash-image support	2026-02-27 10:20:46 +08:00
Luis Pater	8c6c90da74	fix(registry): clean up outdated model definitions in static data	2026-02-26 23:12:40 +08:00
maplelove	8f97a5f77c	feat(registry): expose input modalities, token limits, and generation methods for Antigravity models	2026-02-23 13:33:51 +08:00
Luis Pater	713388dd7b	Fixed: #1675 fix(gemini): add model definitions for Gemini 3.1 Pro High and Image	2026-02-23 00:12:57 +08:00
Luis Pater	d210be06c2	fix(gemini): update min Thinking value and add Gemini 3.1 Pro Preview model definition	2026-02-22 21:51:32 +08:00
Luis Pater	081cfe806e	fix(gemini): correct `Created` timestamps for Gemini 3.1 Pro Preview model definitions	2026-02-21 20:47:47 +08:00
hkfires	c1c62a6c04	feat(gemini): add Gemini 3.1 Pro Preview model definitions	2026-02-21 20:42:29 +08:00
apparition	1a0ceda0fc	feat: add Gemini 3.1 Pro Preview model definition	2026-02-19 17:43:08 +08:00
Luis Pater	bb86a0c0c4	feat(logging, executor): add request logging tests and WebSocket-based Codex executor - Introduced unit tests for request logging middleware to enhance coverage. - Added WebSocket-based Codex executor to support Responses API upgrade. - Updated middleware logic to selectively capture request bodies for memory efficiency. - Enhanced Codex configuration handling with new WebSocket attributes.	2026-02-19 01:57:02 +08:00
Luis Pater	46a6782065	refactor(all): replace manual pointer assignments with `new` to enhance code readability and maintainability	2026-02-15 14:10:10 +08:00

1 2 3 4

167 Commits