back
Get SIGNAL/NOISE in your inbox daily

Mistral, a well-funded French AI startup, has launched two new large language models (LLMs) based on the Mamba architecture, which aims to improve upon the efficiency of transformer-based models.

Codestral Mamba for developers: Mistral’s code-generating model, Codestral Mamba 7B, offers faster response times and longer input handling compared to other models:

  • The model can handle inputs of up to 256,000 tokens, double that of OpenAI’s GPT-4.
  • In benchmarking tests, Codestral Mamba outperformed rival open-source models like CodeLlama 7B, CodeGemma-1.17B, and DeepSeek in HumanEval tests.
  • Developers can modify and deploy the model from its GitHub repository and through HuggingFace, with an open-source Apache 2.0 license.

Mathstral for math and science: Mistral’s second model, Mathstral 7B, is designed for math-related reasoning and scientific discovery:

  • Developed in collaboration with Project Numina, Mathstral has a 32K context window and will be available under an Apache 2.0 open-source license.
  • The model outperformed other models designed for math reasoning and can achieve better results on benchmarks with more inference-time computations.
  • Users can access Mathstral through Mistral’s la Plataforme API and HuggingFace, using it as-is or fine-tuning the model for specific needs.

Mistral’s growth and competition: The launch of these new models comes as Mistral continues to compete against other AI developers like OpenAI and Anthropic:

  • Mistral recently raised $640 million in Series B funding, bringing its valuation close to $6 billion.
  • The company has also received investments from tech giants like Microsoft and IBM, solidifying its position in the AI market.
  • By offering its models on an open-source basis, Mistral is carving out a unique niche in the competitive landscape of AI development.

Broader implications: The introduction of Codestral Mamba and Mathstral showcases Mistral’s commitment to developing specialized AI models that cater to specific use cases, such as programming and mathematical reasoning. As the company continues to release powerful open-source models, it has the potential to democratize access to advanced AI technologies and drive innovation across various industries. However, the long-term impact of these models and their open-source nature on the competitive dynamics of the AI market remains to be seen.

Recent Stories

Oct 17, 2025

DOE fusion roadmap targets 2030s commercial deployment as AI drives $9B investment

The Department of Energy has released a new roadmap targeting commercial-scale fusion power deployment by the mid-2030s, though the plan lacks specific funding commitments and relies on scientific breakthroughs that have eluded researchers for decades. The strategy emphasizes public-private partnerships and positions AI as both a research tool and motivation for developing fusion energy to meet data centers' growing electricity demands. The big picture: The DOE's roadmap aims to "deliver the public infrastructure that supports the fusion private sector scale up in the 2030s," but acknowledges it cannot commit to specific funding levels and remains subject to Congressional appropriations. Why...

Oct 17, 2025

Tying it all together: Credo’s purple cables power the $4B AI data center boom

Credo, a Silicon Valley semiconductor company specializing in data center cables and chips, has seen its stock price more than double this year to $143.61, following a 245% surge in 2024. The company's signature purple cables, which cost between $300-$500 each, have become essential infrastructure for AI data centers, positioning Credo to capitalize on the trillion-dollar AI infrastructure expansion as hyperscalers like Amazon, Microsoft, and Elon Musk's xAI rapidly build out massive computing facilities. What you should know: Credo's active electrical cables (AECs) are becoming indispensable for connecting the massive GPU clusters required for AI training and inference. The company...

Oct 17, 2025

Vatican launches Latin American AI network for human development

The Vatican hosted a two-day conference bringing together 50 global experts to explore how artificial intelligence can advance peace, social justice, and human development. The event launched the Latin American AI Network for Integral Human Development and established principles for ethical AI governance that prioritize human dignity over technological advancement. What you should know: The Pontifical Academy of Social Sciences, the Vatican's research body for social issues, organized the "Digital Rerum Novarum" conference on October 16-17, combining academic research with practical AI applications. Participants included leading experts from MIT, Microsoft, Columbia University, the UN, and major European institutions. The conference...