Alibaba's AI coding assistant Qwen2.5-Coder-32B also runs locally on Macs

Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage

Join Now

The rise of locally-run AI coding assistants marks a significant shift in how developers can access powerful language models for programming tasks, with Alibaba’s new Qwen2.5-Coder series emerging as a notable player in this space.

Key capabilities and specifications: Qwen2.5-Coder-32B-Instruct represents a breakthrough in open-source code models, claiming performance comparable to GPT-4o while maintaining a relatively modest size of 32B parameters.

The model is Apache 2.0 licensed, making it freely available for both personal and commercial use
With a 32B parameter size, it can run on high-end consumer hardware like a 64GB MacBook Pro M2
The quantized version requires approximately 20GB of storage space

Performance benchmarks: Independent testing validates Qwen’s claims of competitive performance against industry leaders.

Paul Gauthier’s Aider benchmarks place Qwen2.5-Coder-32B at 74% accuracy, positioning it between GPT-4o (71%) and Claude 3.5 Haiku (75%)
The model matches GPT-4o in “diff” benchmark scores, though slightly trailing Claude 3.5 Haiku
The smaller 14B and 7B variants achieved respectable scores of 69% and 58% respectively

Technical implementation: The model offers multiple deployment options for MacOS users.

Ollama integration provides a straightforward installation process using a simple pull command
MLX implementation leverages Apple Silicon’s capabilities for improved performance
The model can be accessed through various interfaces, including command-line tools and programming libraries

Practical applications: Real-world testing demonstrates the model’s capability to handle diverse programming tasks.

Successfully generates functional code for database operations and CSV handling
Creates complex visualizations, including terminal-based fractals
Maintains competitive response quality compared to cloud-based alternatives

Looking ahead: Qwen2.5-Coder-32B represents a significant milestone in locally-run AI coding assistants, potentially reducing dependency on cloud-based services while maintaining professional-grade capabilities. Its ability to run on high-end consumer hardware while matching the performance of larger models suggests a promising direction for accessible AI development tools.

Qwen2.5-Coder-32B is an LLM that can code well that runs on my Mac

simonwillison

Menu

Alibaba’s AI coding assistant Qwen2.5-Coder-32B also runs locally on Macs

Recent News

“Learn to AI”: California propels workforce training with tech giants across public education system

Qualcomm plans AI server chips for 2028 amid competitive challenges

LangChain launches Open SWE, an AI agent for autonomous coding tasks

Join the revolution

CO/AI

Resources

Join the revolution

Menu

Welcome

Alibaba’s AI coding assistant Qwen2.5-Coder-32B also runs locally on Macs

Recent News

“Learn to AI”: California propels workforce training with tech giants across public education system

Qualcomm plans AI server chips for 2028 amid competitive challenges

LangChain launches Open SWE, an AI agent for autonomous coding tasks

Join the revolution

CO/AI

Resources

Join the revolution