×
Midjourney expands into AI creative writing with new diversity techniques
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

Midjourney‘s AI research expands beyond image generation into creative text generation, tackling a fundamental challenge in LLM outputs. The company has partnered with NYU researchers to develop new techniques that enhance diversity in AI-generated text while maintaining quality, marking an important advancement in making AI writing more imaginative and less predictable.

The big picture: Midjourney, with its massive user base of nearly 20 million on Discord alone, is venturing beyond its core image generation business by tackling limitations in AI-generated creative writing.

  • The company collaborated with NYU researchers to develop two new techniques called Diversified Direct Preference Optimization (DDPO) and Diversified Odds Ratio Preference Optimization (DORPO).
  • These methods specifically address the tendency of large language models to produce homogeneous, predictable text when asked to generate creative content.

The technical innovation: The research introduces “deviation” as a training metric to guide language models toward producing more varied and creative outputs.

  • Unlike existing diversity-promoting techniques that operate only at inference time, these new methods integrate diversity considerations directly into the model’s training process.
  • The researchers published their findings in a paper on Hugging Face, complete with implementation details available on GitHub.

Key findings: Tests showed that DDPO significantly outperformed standard approaches in balancing quality with diversity.

  • Llama-3.1-8B models trained with DDPO achieved the best balance of output quality and creativity.
  • Remarkably, even with reduced dataset sizes, DDPO-trained models maintained their ability to generate diverse outputs.

Why this matters: Creative writing fundamentally differs from factual or code generation tasks because it benefits from multiple valid approaches rather than a single correct answer.

  • Standard LLM training methods often prioritize user preference over originality, resulting in safe but repetitive content.
  • By addressing this limitation, Midjourney’s research could lead to AI systems that better support human creativity rather than merely producing predictable, templated content.

Behind the numbers: Midjourney’s expansion into LLM research comes after the company revealed plans to develop its own computing and AI hardware in late summer 2024.

  • With “nearly 20 million users on its Discord channel” according to third-party trackers, Midjourney has the scale and user base to potentially apply these creative writing improvements across multiple generative AI domains.
Midjourney’s surprise: new research on making LLMs write more creatively

Recent News

6 steps to transform ordinary photos into magazine-quality portraits with AI

New AI enhancement techniques allow anyone to elevate ordinary photos to professional portrait standards with a simple six-step process.

AI could add $340 billion annually to banking through smarter payments

Banks stand to gain hundreds of billions annually by implementing AI across payment systems, from fraud prevention to personalized customer service.

“Vibe coding” divides developers: 10 points to consider before vibing out, pro and con

Experienced developers debate whether AI handling routine coding tasks represents progress or introduces unacceptable security and knowledge gaps in the profession.