Alibaba's AI coding assistant Qwen2.5-Coder-32B also runs locally on Macs

Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage

Join Now

The rise of locally-run AI coding assistants marks a significant shift in how developers can access powerful language models for programming tasks, with Alibaba’s new Qwen2.5-Coder series emerging as a notable player in this space.

Key capabilities and specifications: Qwen2.5-Coder-32B-Instruct represents a breakthrough in open-source code models, claiming performance comparable to GPT-4o while maintaining a relatively modest size of 32B parameters.

The model is Apache 2.0 licensed, making it freely available for both personal and commercial use
With a 32B parameter size, it can run on high-end consumer hardware like a 64GB MacBook Pro M2
The quantized version requires approximately 20GB of storage space

Performance benchmarks: Independent testing validates Qwen’s claims of competitive performance against industry leaders.

Paul Gauthier’s Aider benchmarks place Qwen2.5-Coder-32B at 74% accuracy, positioning it between GPT-4o (71%) and Claude 3.5 Haiku (75%)
The model matches GPT-4o in “diff” benchmark scores, though slightly trailing Claude 3.5 Haiku
The smaller 14B and 7B variants achieved respectable scores of 69% and 58% respectively

Technical implementation: The model offers multiple deployment options for MacOS users.

Ollama integration provides a straightforward installation process using a simple pull command
MLX implementation leverages Apple Silicon’s capabilities for improved performance
The model can be accessed through various interfaces, including command-line tools and programming libraries

Practical applications: Real-world testing demonstrates the model’s capability to handle diverse programming tasks.

Successfully generates functional code for database operations and CSV handling
Creates complex visualizations, including terminal-based fractals
Maintains competitive response quality compared to cloud-based alternatives

Looking ahead: Qwen2.5-Coder-32B represents a significant milestone in locally-run AI coding assistants, potentially reducing dependency on cloud-based services while maintaining professional-grade capabilities. Its ability to run on high-end consumer hardware while matching the performance of larger models suggests a promising direction for accessible AI development tools.

Qwen2.5-Coder-32B is an LLM that can code well that runs on my Mac

simonwillison

Menu

Alibaba’s AI coding assistant Qwen2.5-Coder-32B also runs locally on Macs

Recent News

Adnoc partners with US robotics startup to deploy AI across oil operations

6 places where Google’s Gemini AI should be but isn’t

How to protect your portfolio from a potential AI bubble burst

Join the revolution

CO/AI

Resources

Join the revolution

Menu

Welcome

Alibaba’s AI coding assistant Qwen2.5-Coder-32B also runs locally on Macs

Recent News

Adnoc partners with US robotics startup to deploy AI across oil operations

6 places where Google’s Gemini AI should be but isn’t

How to protect your portfolio from a potential AI bubble burst

Join the revolution

CO/AI

Resources

Join the revolution