-
Notifications
You must be signed in to change notification settings - Fork 641
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Intra-vendor model routing feature enhancements #12948
Closed
Closed
Changes from all commits
Commits
Show all changes
35 commits
Select commit
Hold shift + click to select a range
1898895
Fix endpoint get showing api key in plain text issue
ashera96 e1a90ec
Fix default vendors to have requestModel and remainingTokenCount fiel…
ashera96 0739029
Merge pull request #12947 from ashera96/intra-vendor-model-routing-po…
ashera96 53ae427
Fix EndpointDTO
PasanT9 e59b722
Fix parse errors due to custom parameter
PasanT9 1224219
Merge pull request #12950 from PasanT9/450-post-alpha-v1
PasanT9 860ebcb
Commit Design Assistant related generated files
ashera96 8c6548d
Fix apiKeyValue missing issue in synapse artifact
ashera96 ea03e28
Merge pull request #12964 from ashera96/intra-vendor-model-routing-po…
ashera96 3e6ba00
Handle default endpoint PUT DELETE calls
ashera96 77bd0f0
Handle deploymentStage invalid scenario
ashera96 2a34a96
Merge pull request #12967 from ashera96/intra-vendor-model-routing-po…
ashera96 53468d5
Improve error handling and set primary endpoint IDs on AI API creatio…
ashera96 8b8c158
Merge pull request #12972 from ashera96/intra-vendor-model-routing-po…
ashera96 d6394cd
Add failover policy for AI APIs
PasanT9 7bb0e0d
Refactor AI Gateway implementation
PasanT9 cdbb830
Remove AI API Endpoint sequences during API undeploy
PasanT9 fd95bdc
Refacter AI API failover implementation
PasanT9 fe27197
Add endpoint URL validation
ashera96 2cc9596
Merge pull request #12974 from ashera96/intra-vendor-model-routing-po…
ashera96 acc22ab
Merge pull request #12975 from PasanT9/post-alpha-v5
PasanT9 fc209a6
Add unique constraint on API_UUID, ENDPOINT_NAME, REVISION_UUID, and …
ashera96 b9dc2a9
Merge pull request #12977 from ashera96/intra-vendor-model-routing-po…
ashera96 4a403a0
Add missing java doc comments
PasanT9 e434de1
Merge pull request #12979 from PasanT9/post-alpha-v8
PasanT9 a836c28
Handle fail and exception scenarios
PasanT9 68be6e7
Merge pull request #12981 from PasanT9/post-alpha-v8
PasanT9 5f18fcb
Add API endpoint deletion on API deletion
ashera96 d5073d3
Fix review comments
PasanT9 d27bccb
Merge pull request #12983 from PasanT9/post-alpha-v8
PasanT9 85f58ca
Onboard models for default AI/LLM vendors
ashera96 066a2a3
Fix endpoint deletion failure when either production or sandbox don't…
ashera96 20d1f85
Merge pull request #12984 from ashera96/intra-vendor-model-routing-po…
ashera96 71e0777
Fix policy invoking when only single deployment stage is defined
PasanT9 8b0ed83
Merge pull request #12985 from PasanT9/post-alpha-v6
PasanT9 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
💡 Verification agent
🧩 Analysis chain
Verify that the default models are up-to-date.
Please ensure that these are the latest supported models from Mistral AI.
🌐 Web query:
💡 Result:
As of February 2025, Mistral AI's latest models include several advanced offerings across different categories:
Core Models
Mistral Saba 25.02
Released February 17, this model specializes in languages from the Middle East and South Asia while maintaining strong multilingual capabilities[3][4]. It balances performance with efficiency for regional applications.
Mistral Small v3 (25.01)
A January release featuring:
Specialized Models
Codestral 25.01
January update to code-generation model:
Pixtral Large (24.11)
Multimodal November release combining:
Edge-Optimized Models
October releases for local device deployment:
Earlier November releases like Mistral Large24.11 remain current with improved reasoning/function calling[2], while foundational open-source models like Mathstral7B continue receiving updates[3].
These models reflect Mistral's focus on both frontier capabilities (through Large/Saba) and practical deployment efficiency (Small/Ministerial lines)[6][9].
Citations:
Update Default Model List to Reflect Latest Mistral AI Models
The current default list in
MistralAiLLMProviderService.java
(lines 92-97) contains:"mistral-small-latest"
"mistral-medium"
"open-mistral-7b"
However, as of February 2025, the latest Mistral AI models include offerings such as Mistral Small v3 (25.01) and Mistral Saba 25.02, with no direct reference to a "medium" model or
"open-mistral-7b"
. Please review and update the default model list to align with the current model naming and versions provided by Mistral AI.