×
Google’s Gemini AI to receive major image generation upgrade
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

Google Gemini’s image generation set for a major upgrade: Google is working on improving its Gemini AI’s image generation capabilities, addressing a key limitation in the current system.

  • Gemini’s image generation, powered by Imagen 3, currently only produces square images with a 1:1 aspect ratio.
  • Users have expressed frustration with the lack of flexibility in image dimensions, as cropping is the only available option to alter image sizes.

Upcoming feature revealed in beta code: Android Authority has uncovered evidence of new image resizing options in a recent Google app beta.

  • Code from the 15.41.34.29.arm64 beta version of the Google app suggests that users will soon be able to select from various image sizes.
  • The new feature is expected to include common aspect ratios such as 16:9, providing greater versatility in image creation.

Advanced resizing capabilities: The planned update goes beyond simple cropping, offering a more sophisticated approach to image manipulation.

  • When resizing an image, Gemini will reportedly generate additional content to fill the new space rather than merely stretching or cropping the existing image.
  • Users will have the ability to resize images multiple times while maintaining the expanded environment, with only the main subject being adjusted in size.

Current limitations and premium features: Gemini’s image generation capabilities come with some restrictions and premium options.

  • The basic version of Imagen, available for free with Google Gemini, cannot create images of humans, portraits, or faces.
  • Users seeking these advanced features need to subscribe to Gemini Advanced, which offers a one-month free trial.
  • Gemini saves generated images in JPG format, distinguishing it from competitors like ChatGPT.

Recent developments in Gemini ecosystem: Google has been actively expanding and refining its Gemini AI offerings.

  • Gemini Live, a feature enabling continuous conversations with users, has been rolled out to compatible Android devices.
  • The Gemini app has undergone a visual overhaul, resulting in a more streamlined user experience.

Implications for user experience: The addition of flexible image sizing options is expected to significantly enhance Gemini’s appeal and functionality.

  • This update addresses a major pain point for users, potentially making Gemini a more competitive option in the AI image generation market.
  • The improvement aligns with Google’s efforts to create a more versatile and user-friendly AI assistant, particularly for Android device users.

Looking ahead: While the discovered code suggests these features are in development, it’s important to note that their final implementation and release date remain uncertain.

  • The ability to generate images in various aspect ratios could make Gemini more attractive for content creators and marketers who require specific image dimensions for different platforms.
  • This update may also signal Google’s commitment to continually improving its AI offerings to stay competitive in the rapidly evolving field of generative AI.
Gemini image generation just tipped for a major upgrade

Recent News

Veo 2 vs. Sora: A closer look at Google and OpenAI’s latest AI video tools

Tech companies unveil AI tools capable of generating realistic short videos from text prompts, though length and quality limitations persist as major hurdles.

7 essential ways to use ChatGPT’s new mobile search feature

OpenAI's mobile search upgrade enables business users to access current market data and news through conversational queries, marking a departure from traditional search methods.

FastVideo is an open-source framework that accelerates video diffusion models

New optimization techniques reduce the computing power needed for AI video generation from days to hours, though widespread adoption remains limited by hardware costs.