×
DeepSeek’s new image generator is another win for cost-effective AI
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

DeepSeek, a Chinese AI startup, has released Janus-Pro, a new open-source text-to-image AI model that claims to outperform established competitors like Stable Diffusion and DALL-E.

Key Features and Capabilities: The Janus-Pro model family ranges from 1 billion to 7 billion parameters and operates using an autoregressive framework for image generation and analysis.

  • The model is available under an MIT license, making it suitable for commercial use
  • Users can download Janus-Pro through HuggingFace and GitHub platforms
  • Smaller versions of the model are limited to analyzing images at 384 x 384 resolution

Performance and Benchmarks: DeepSeek’s internal testing shows promising results for their new image generation model.

  • Janus-Pro-7B reportedly outperforms Stable Diffusion and DALL-E on GenEval and DPG-Bench benchmarks
  • Nvidia has publicly praised the model as “an excellent AI advancement
  • Early user impressions are mixed but generally positive, though more widespread testing is needed

Cost and Efficiency Advantages: The model represents a potential shift in AI development economics.

  • DeepSeek’s training costs are reportedly lower than those of US-based AI companies
  • Initial reports suggest more energy-efficient operation compared to Western counterparts
  • This efficiency could challenge the necessity of large-scale initiatives like the $500 billion Stargate project

Market Impact: DeepSeek continues to gain momentum in the AI space.

  • The company recently topped ChatGPT in App Store downloads
  • The release builds upon their previous Janus model
  • The open-source nature of the model could accelerate AI development and adoption

Strategic Implications: The success of DeepSeek’s more efficient, cost-effective approach to AI development raises questions about the future direction of AI infrastructure investments and could reshape industry assumptions about the resources required for competitive AI development.

DeepSeek's new image model looks like another win for cheaper AI

Recent News

SoftBank teams with Quantinuum to advance quantum computing applications

Early quantum-classical hybrid systems will be tested in Japan to tackle network optimization and materials science challenges that exceed classical computing limits.

Ex-Google, Apple engineers unveil Oumi AI, a truly open-source AI development platform

Former Google engineers release fully transparent AI development tools that allow unrestricted access to model code and training data.

Google thwarts hacker group using Gemini to breach accounts

State-sponsored hacking groups from Iran, China, and North Korea attempted to use Gemini for cyberattacks but were blocked by Google's security controls.