Mistral just launched their... | Artificial Analysis OKX Feed

Mistral just launched their new large open weights model, Mistral Large 3 (675B total, 41B active), alongside a set of three Ministral models (3B, 8B, 14B) Mistral has released Instruct (non-reasoning) variants of all four models, as well as reasoning variants of the three Ministral models. All models support multimodal inputs and are available with an Apache 2.0 license today on @huggingface. We evaluated Mistral Large 3 and the Instruct variants of the three Ministral models prior to launch. Mistral’s highest scoring model in Artificial Analysis Intelligence Index remains the proprietary Magistral Medium 1.2, launched a couple of months back in September - this is due to reasoning giving models a significant advantage in many evals we use. Mistral discloses that a reasoning version of Mistral Large 3 is already in training and we look forward to evaluating it soon! Key highlights: ➤ Large and small models: at 675B total with 41B active, Mistral Large 3 is Mistral’s first open weights mixture-of-experts model since Mixtral 8x7B and 8x22B in late 2023 to early 2024. The Ministral releases are dense with 3B, 8B, and 14B parameter variants ➤ Significant intelligence increase but not amongst leading models (including proprietary): Mistral Large 3 represents a significant upgrade compared to the previous Mistral Large 2 with a +11 point increase on the Intelligence Index up to 38. However, Large 3 still trails leading proprietary reasoning & non-reasoning models ➤ Versatile small models: the Ministral models are released with Base, Instruct, and Reasoning variant weights - we tested only the Instruct variants ahead of release, which achieved Index scores of 31 (14B), 28 (8B), and 22 (3B). This places Ministral 14B ahead of the previous Mistral Small 3.2 with 40% fewer parameters. We are working on evaluating the reasoning variants and will share their intelligence results soon. ➤ Multi-modal capabilities: all models in the release support text and image inputs - this is a significant differentiator for Mistral Large 3, as few open weight models in its size class have support for image input. Context length also increases to 256k, enabling larger-input tasks. These new models from Mistral are not a step change from open weights competition, but they represent a strong performance base with vision capabilities. The Ministral 8B and 14B variants offer particularly compelling performance for their size, and we’re excited to see how the community uses and builds on these models. At launch, the new models are available for serverless inference on @MistralAI and a range of other providers including @awscloud Bedrock, @Azure AI Foundry, @IBMwatsonx, @FireworksAI_HQ, @togethercompute, and @modal.

1.15萬

本頁面內容由第三方提供。除非另有說明，OKX 不是所引用文章的作者，也不對此類材料主張任何版權。該內容僅供參考，並不代表 OKX 觀點，不作為任何形式的認可，也不應被視為投資建議或購買或出售數字資產的招攬。在使用生成式人工智能提供摘要或其他信息的情況下，此類人工智能生成的內容可能不準確或不一致。請閱讀鏈接文章，瞭解更多詳情和信息。OKX 不對第三方網站上的內容負責。包含穩定幣、NFTs 等在內的數字資產涉及較高程度的風險，其價值可能會產生較大波動。請根據自身財務狀況，仔細考慮交易或持有數字資產是否適合您。