Flux by Black Forest Labs sets a new standard for open-source text-to-image models with its impressive 12B parameters and Midjourney-like aesthetics.
Introducing Flux: Black Forest Labs, the team behind Stable Diffusion, has released Flux, the largest state-of-the-art open-source text-to-image model to date:
- Flux pushes the boundaries of creativity and performance, delivering high-quality visuals reminiscent of Midjourney.
- The model boasts an impressive 12B parameters, setting a new standard for open-source text-to-image models.
Three Variations of Flux: BFL has released three versions of the model, each catering to different user needs and licensing requirements:
- FLUX.1 [dev]: The base model, open-sourced with a non-commercial license for the community to build upon.
- FLUX.1 [schnell]: A distilled version of the base model that operates up to 10 times faster, released under the Apache 2 License.
- FLUX.1 [pro]: A closed-source version only available through API, designed for commercial use.
Integration with fal’s Inference Engine: Flux’s partnership with fal takes performance to the next level:
- By leveraging fal’s cutting-edge inference engine, Flux models can run up to 2x faster than with eager torch.
- This integration results in faster processing times without compromising the exceptional quality and detail of the generated images.
Key Features of Flux: The model offers several enhancements that set it apart from its predecessors:
- Enhanced Image Quality: Flux generates stunning visuals at higher resolutions, elevating the overall visual appeal of the generated images.
- Advanced Human Anatomy and Photorealism: The model achieves highly realistic and anatomically accurate images, particularly when it comes to human subjects.
- Improved Prompt Adherence: Flux delivers more accurate and relevant images based on user inputs, ensuring a better alignment between prompts and generated visuals.
- Exceptional Speed: The Flux Schnell variant offers unparalleled speed and efficiency, making it ideal for high-demand applications.
A New Era for Text-to-Image Generation: Flux’s release marks a significant milestone in the development of open-source text-to-image models:
- By combining state-of-the-art performance, enhanced aesthetics, and faster processing times, Flux sets a new standard for the industry.
- The availability of different versions caters to a wide range of users, from community developers to commercial enterprises, ensuring accessibility and flexibility.
- As Black Forest Labs continues to innovate and refine Flux, we can expect even more impressive advancements in the field of text-to-image generation, empowering creators and developers alike.
Flux: Open-source text-to-image model with 12B parameters