Mistral AI shocks the AI world with the release of Mistral Large 2, a powerful open-source model that takes on Meta’s Llama 3.1 in the rapidly evolving AI landscape.
Key details of Mistral Large 2 release: Mistral’s new flagship model boasts 123 billion parameters and is licensed as open-source for non-commercial research use, while commercial applications require a separate license:
- The model offers advanced multilingual capabilities, supporting dozens of languages including English, French, Spanish, German, Italian, Portuguese, Arabic, Hindi, Russian, Chinese, Japanese, and Korean.
- Mistral Large 2 delivers strong performance in reasoning, code generation, and mathematics, making it ideal for tasks that require large reasoning capabilities or are highly specialized.
- The model is designed for single-node inference with long-context applications in mind, allowing it to run at large throughput on a single node.
Impressive performance benchmarks: Mistral Large 2 demonstrates competitive performance against leading models like GPT-4o, Llama 3.1-405, and Anthropic’s Claude 3.5 Sonnet across various benchmarks:
- On the Multilingual MMLU benchmark, it performed on par with Meta’s Llama 3.1-405B while delivering significant cost benefits due to its smaller size.
- The model excels in code generation tasks, outperforming Claude 3.5 Sonnet and Claude 3 Opus on HumanEval and HumanEval Plus benchmarks.
- It also grabs the second spot across Mathematics-focused benchmarks like GSM8K and Math Instruct.
Enterprise-focused enhancements: Mistral has made key improvements to cater to enterprise AI adoption:
- The model has been fine-tuned to minimize hallucinations and be more cautious and selective when responding, ensuring transparency when it lacks sufficient information to answer.
- Instruction-following capabilities have been enhanced, making the model better at following user guidelines and handling long multi-turn conversations.
- Mistral Large 2 has been tuned to provide succinct and to-the-point answers wherever possible, which can be beneficial in enterprise settings.
Broader context and implications: Mistral’s release of Mistral Large 2 is not an isolated move but part of the company’s aggressive strategy in the AI domain:
- The startup has been raising large funding rounds, launching task-specific models, and partnering with industry giants to expand its reach.
- The release of Mistral Large 2 closely follows Meta’s launch of its open-source Llama 3.1 model, highlighting the intensifying competition in the AI race.
- With its impressive performance and enterprise-focused enhancements, Mistral Large 2 has the potential to shake up the AI landscape and provide a compelling alternative to leading closed-source models.
As the AI race continues to accelerate, Mistral’s release of Mistral Large 2 demonstrates the growing importance of open-source models in driving innovation and competition. The model’s advanced capabilities, strong performance benchmarks, and enterprise-focused improvements position it as a serious contender in the evolving AI landscape. However, it remains to be seen how it will fare against other leading models in real-world applications and how the licensing model will impact its adoption. Nonetheless, Mistral’s aggressive moves in the AI domain suggest that the company is poised to play a significant role in shaping the future of AI.
Mistral shocks with new open model Mistral Large 2, taking on Llama 3.1