×
Google’s new tool Whisk remixes your existing images with AI
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

Google’s latest innovation that allows users to create new images by using existing images as prompts rather than relying solely on text descriptions.

Core functionality: Google’s new AI tool Whisk enables users to generate images by providing visual references for subject matter, scene composition, and artistic style.

  • Users can upload multiple reference images for each category or use Google’s auto-generated suggestions via a dice icon feature
  • The system allows for optional text input to further refine the desired output
  • Each generated image comes with an accompanying text prompt that can be modified for iterations

Technical implementation: Whisk is powered by Google’s latest Imagen 3 image generation model, representing a significant advancement in visual AI technology.

  • The tool prioritizes quick visual exploration over precise image editing capabilities
  • Generation times currently take several seconds per image
  • Users can favorite and download successful results or continue refining through prompt adjustments

Key features and limitations: The platform emphasizes flexibility and iterative improvement while acknowledging potential imperfections.

  • The system allows for both pure visual input and hybrid visual-text prompting
  • Google acknowledges that results may sometimes “miss the mark”
  • Users maintain control through the ability to edit underlying prompts and refine outputs

Broader ecosystem developments: Google’s image generation advancement comes alongside significant progress in video AI technology.

  • The company simultaneously announced Veo 2, its enhanced video generation model
  • Veo 2 boasts improved understanding of cinematographic principles
  • The video technology will initially deploy through VideoFX and later expand to YouTube Shorts and other Google products in 2025
  • The new model claims reduced hallucination of common AI artifacts like extra fingers compared to competitors such as OpenAI’s Sora

Future implications: While Whisk represents an innovative approach to image generation by emphasizing visual rather than textual inputs, its success will likely depend on how effectively it can bridge the gap between user intent and AI output, particularly given the acknowledged imperfections in current results.

Google’s Whisk AI generator will ‘remix’ the pictures you plug in

Recent News

North Korea unveils AI-equipped suicide drones amid deepening Russia ties

North Korea's AI-equipped suicide drones reflect growing technological cooperation with Russia, potentially destabilizing security in an already tense Korean peninsula.

Rookie mistake: Police recruit fired for using ChatGPT on academy essay finds second chance

A promising police career was derailed then revived after an officer's use of AI revealed gaps in how law enforcement is adapting to new technology.

Auburn University launches AI-focused cybersecurity center to counter emerging threats

Auburn's new center brings together experts from multiple disciplines to develop defensive strategies against the rising tide of AI-powered cyber threats affecting 78 percent of security officers surveyed.