Google - CO/AI

News/Google

Dec 11, 2024

Google unveils Gemini 2 and AI agents for personal assistance

Google has announced Gemini 2, representing a significant advancement in AI capabilities and human-computer interaction. Core capabilities and improvements: Gemini 2 represents a substantial upgrade to Google's flagship AI model, with enhanced abilities to handle complex tasks across multiple domains. The new model demonstrates advanced "multimodal" capabilities, processing and interpreting video, audio, and speech with greater sophistication Gemini 2 can effectively plan and execute computer-based tasks while engaging in natural conversation The system shows marked improvement in understanding and interacting with the physical world through various sensors and inputs Specialized AI applications: Google is launching purpose-built AI agents to showcase...

read Dec 10, 2024

Perplexity isn’t just competing with Google Search, it’s winning

The competitive landscape: Perplexity has emerged as a formidable challenger to Google's search dominance by offering a fundamentally different approach to information retrieval. Unlike Google's traditional search results, Perplexity provides concise, cited answers that eliminate the need to browse multiple pages The platform reached 10 million active monthly users and experiences 20% monthly growth since its launch in August 2022 Perplexity's new shopping feature delivers targeted product recommendations without requiring users to sift through numerous results and advertisements Google's strategic dilemma: The tech giant faces a challenging predicament as it attempts to compete with AI-powered search while preserving its core...

read Dec 9, 2024

Pixel Studio 1.4 adds AI sticker creator and Gboard integration

The latest update to Google's Pixel Studio app brings expanded creative capabilities with new AI-powered features and improved integration with other Google services. Key Features and Updates; Version 1.4 of Pixel Studio introduces significant enhancements to the app's sticker creation and sharing capabilities. Users can now generate stickers directly within the app using AI technology by either describing their desired sticker or creating one from an existing photo The update integrates Pixel Studio with Gboard, Google's keyboard app, allowing users to access and share stickers across any messaging application A new dedicated "Stickers" tab has been added to the "My...

read Dec 9, 2024

Gemini is getting close to allowing for multilingual voice interactions

AI and technology giant Google is expanding the multilingual capabilities of its Gemini AI assistant with forthcoming support for bilingual conversations. Current status and functionality: Google's latest development enables users to engage with Gemini Live in two different languages during the same conversation, though the feature is not yet widely available. The functionality is specifically designed for speech-based interactions Support documentation indicates compatibility with over 30 languages Despite being documented in Google's support pages, the feature remains inaccessible to most users Technical implementation: Recent analysis of Google's beta app reveals progress toward implementing this multilingual capability. Evidence of the feature...

read Dec 9, 2024

Google’s AI Mode for Search coming soon, APK teardown reveals

A new AI Mode for Google Search is in development, signaling the company's latest effort to integrate conversational AI capabilities directly into its core search experience. Key developments: Google is implementing an AI Mode button within its search interface and widget that will enable users to access new AI-powered search features. The feature was discovered in the Google app v15.49.40 beta through code analysis A dedicated AI Mode button will appear in both the main Google Search interface and the Search widget Currently, attempting to use the feature results in app crashes as it remains under development Technical implementation: The...

read Dec 9, 2024

Gmail uses AI to summarize your emails, but should it?

Core functionality: Google has integrated Gemini AI into Gmail's web and mobile interfaces, providing email summarization capabilities and advanced search features for subscribers. The feature is currently available only to Google One AI Premium ($20/month) subscribers and Google Workspace users Users can access Gemini through a distinctive black star-shaped logo in the Gmail interface The tool can summarize individual emails, threads, or specific time periods of correspondence Key features and operations: Gemini offers multiple ways to interact with and analyze email content, providing flexible summarization options across different timeframes. Users can request summaries through the Gemini button or specific commands like...

read Dec 9, 2024

DeepMind’s GenCast can accurately predict weather 2 weeks in advance

The development of Google DeepMind's GenCast marks a significant advancement in weather forecasting technology, potentially offering unprecedented accuracy for predictions up to 15 days ahead. The breakthrough technology: GenCast represents a new approach to weather forecasting that combines machine learning with traditional meteorological data to generate highly accurate predictions. The AI model has been trained on an extensive dataset spanning 39 years (1979-2018) of global weather information GenCast achieves remarkable accuracy rates, surpassing current forecasting standards in 97.2% of cases For predictions beyond 36 hours, the model's accuracy increases to 99.8% The system can process and deliver a 15-day forecast...

read Dec 7, 2024

Google’s new AI model creates interactive worlds, but not everyone’s impressed

The advancement of AI-generated interactive environments has reached a new milestone with Google's Genie 2, which transforms static images and text descriptions into explorable 3D worlds, marking a significant evolution from its 2D predecessor launched just nine months ago. Core capabilities and advancements: Google's DeepMind team has expanded their AI model to generate interactive 3D environments from simple inputs, complete with controllable avatars and basic physics interactions. The model can create diverse interactive scenes featuring wooden puppets, robots, and vessels, capable of performing basic actions like popping balloons and climbing ladders Genie 2's most notable feature is its "long horizon...

read Dec 6, 2024

Google Search may see radical transformation in 2025

Artificial Intelligence is set to dramatically transform Google Search in 2025, according to recent statements from Google CEO Sundar Pichai that signal major changes ahead for the world's dominant search engine. Major announcement details: During The New York Times' Dealbook Summit, Pichai revealed that Google Search will undergo significant changes by early 2025. The CEO promised surprising new search capabilities that will differentiate it from current functionality AI integration will be central to these transformative changes Google has already begun incorporating AI features through AI Overviews, despite some initial accuracy issues Competitive landscape: Google maintains a confident stance about its...

read Dec 6, 2024

Google Home tests AI-powered custom automation creation

Google Home's latest feature integrates AI technology to simplify smart home automation through natural language commands. New Feature Overview: Google Home is testing a Gemini-powered "Help me create" feature in its Android app that enables users to generate custom automations through conversational inputs. The feature allows users to describe desired automations in natural language, such as "Turn on all lights at sunset" or "Help me sleep better" This functionality builds upon the existing script editor's "Help me script" capability, making it more accessible to everyday users The feature is currently being tested in US English and requires users to opt...

read Dec 5, 2024

Pixel phones get new AI features in big December update

The latest Pixel feature drop brings significant AI-powered updates to Google's smartphone lineup, focusing on enhanced accessibility, smarter interactions, and improved everyday functionality. Core AI enhancements: Gemini, Google's advanced AI model, takes center stage in this December update with expanded capabilities and personalization features. Users can now teach Gemini their preferences and interests through "Gemini Saved Info," allowing for more personalized responses based on stored information The AI assistant extends its reach across more Google apps, including Spotify integration for music control and smart home device management Gemini Nano powers new contextual responses in call screening, making it easier to...

read Dec 5, 2024

Google just brought Gemini AI to Chrome’s address bar

The integration of Google's Gemini AI into everyday tools marks a significant expansion of the tech giant's AI strategy, focusing on accessibility and user convenience across its ecosystem. Key developments: Google has introduced two notable updates to its Gemini AI assistant, making it more accessible through Chrome's address bar and expanding mobile functionality. Users can now access Gemini directly from Chrome's address bar by typing "@gemini" followed by their query The feature redirects users to the Gemini homepage while maintaining the same browser window Mobile users with Gemini Advanced subscriptions can now upload up to 10 files (maximum 100MB) directly...

read Dec 5, 2024

Google launches PaliGemma 2 vision language models

Google's latest contribution to the field of artificial intelligence combines advanced vision and language capabilities in a powerful new model called PaliGemma 2, representing a significant step forward in multimodal AI technology. Core architecture and capabilities; PaliGemma 2 integrates SigLIP for visual processing with Gemma 2 for text generation, creating a versatile vision-language model that can handle multiple image resolutions and text-based tasks. The model comes in three sizes: 3B, 8B, and 28B parameters, offering flexibility for different computational needs and use cases Supported image resolutions range from 224x224 to 896x896, enabling analysis of both standard and high-resolution images The...

read Dec 5, 2024

Google’s new Expressive Captions feature now detects emotional context

Live Captions, a Google Android feature introduced in 2019 that generates real-time captions for any device audio, is receiving a significant upgrade to better capture the emotional context of speech and sounds. Major upgrade details: Google is rolling out Expressive Captions, an AI-powered enhancement to Live Caption that recognizes and visually represents tone, volume, and ambient sounds in captioned text. The new feature is exclusively available in the United States for English language content on devices running Android 14 and above Expressive Captions processes all data locally on the device, allowing it to function even in airplane mode The feature...

read Dec 5, 2024

DeepMind’s Genie 2 AI creates self-building video games

The ability to transform static images into interactive 3D environments represents a significant advancement in AI technology, with implications extending far beyond gaming into AI training and virtual world creation. Core innovation: DeepMind's Genie 2 system can generate playable 3D worlds from single images, marking a significant leap forward in AI-generated content and virtual environment creation. The system uses an autoregressive latent diffusion model to create interactive environments that respond to user actions in real-time Generated worlds maintain consistency in physics, lighting, and object permanence for up to one minute The technology allows for instant transformation of conceptual images into...

read Dec 5, 2024

Pixel 9 AI could automatically create to-do lists from phone calls

Google is reportedly developing new AI-powered features for its upcoming Pixel 9 series that could automatically generate actionable to-do lists from phone conversations. New feature overview: Google is developing a "Call To-do List" feature for the Google Phone app that builds upon the existing Call Notes functionality in the Pixel 9 series. The feature would use Gemini Nano AI to analyze phone conversations and automatically generate to-do lists based on the discussion content Users will be able to access, copy, and share these AI-generated lists directly from their call logs Like the current Call Notes feature, the to-do list generation...

read Dec 5, 2024

Google is reportedly testing a new conversational AI search interface

A new AI-powered conversational search experience appears to be in development at Google, with code discoveries suggesting significant updates to how users interact with Google Search on Android devices. Key features in development: Google is working on an "AI Mode" (also called "AIM") for its Android Search app that will introduce new voice and visual interaction capabilities. The feature will include voice input via a microphone button for spoken queries Users will be able to take new photos or upload existing images as part of their searches The system will support follow-up questions and allow users to interrupt responses mid-stream...

read Dec 4, 2024

Google just lost the leader of its hit NotebookLM product

AI industry leaders continue to shift between companies and launch new ventures, with the latest movement coming from Google's NotebookLM team. Key departures: Three key members of Google's NotebookLM team, including senior product manager Raiza Martin, designer Jason Spielman, and engineer Stephen Hughes, have announced their departure from the company. The team members are planning to collaborate on a new project, though specifics about their venture have not been disclosed Their exit represents another significant talent movement in the competitive AI technology sector Project background: NotebookLM, Google's AI-powered note-taking application, gained recognition for its innovative features and integration with major...

read Dec 4, 2024

Gemini’s rumored ‘I’m feeling lucky’ button may make AI interactions more engaging

Advances in conversational AI are driving Google to implement innovative features for increasing user engagement with its Gemini AI assistant. Latest developments: Google plans to introduce a new "I'm Feeling Lucky" button for its Gemini AI assistant, reminiscent of the classic Google Search feature from the 1990s. The button will generate random prompts for users to interact with Gemini, ranging from practical reminders to creative suggestions The feature was discovered in unreleased code by Android Authority Additional random conversation suggestions will appear alongside the permanent lucky button Feature functionality: The new implementation aims to showcase Gemini's capabilities while making AI...

read Dec 4, 2024

Google’s Genie 2 turns text into playable 3D games

The intersection of artificial intelligence and gaming reaches a new milestone with Google DeepMind's introduction of Genie 2, a groundbreaking AI system that can generate playable 3D game environments from text prompts. Core innovation: Google DeepMind has developed a foundation world model that creates interactive gaming environments based on simple text inputs, marking a significant advancement in AI training and game development. The system allows both human and AI control through standard input devices like keyboards and mouse Environments can be generated in real-time, providing unlimited possibilities for training scenarios The technology builds upon DeepMind's legacy of using games for...

read Dec 4, 2024

DeepMind’s GenCast AI weather prediction model is the best yet

Artificial intelligence has achieved a significant breakthrough in weather forecasting with Google DeepMind's new GenCast model, demonstrating superior accuracy compared to existing systems and marking a notable advancement in the application of AI to meteorological prediction. The innovation: Google DeepMind's GenCast represents a departure from traditional weather forecasting methods by relying solely on AI techniques rather than combining them with physics-based approaches. The model functions similarly to language AI, predicting weather conditions instead of words During training, GenCast's parameters gradually align with actual weather patterns through analysis of 40 years of historical data (1979-2018) The system generates probabilistic forecasts, providing...

read Dec 3, 2024

Google may bring AI podcast tool NotebookLM to its Gemini mobile app

The ability to automatically generate AI-powered podcasts from text documents and visual content is poised to enhance mobile learning experiences through Google's latest innovations. Key development: Google appears to be integrating its NotebookLM AI podcast creation capabilities into the Gemini mobile app, based on recently discovered beta code. Beta code snippets reveal commands for "create_podcast" and "Generate audio overview" functionality The integration would allow users to convert PDFs, text files, and videos into AI-generated podcast content The feature builds upon NotebookLM's existing ability to create conversational audio content from various source materials Technical implementation: NotebookLM creates realistic-sounding podcast-style discussions between...

read Dec 3, 2024

Google unveils AI video model ‘Veo’ in private preview

Google has expanded its creative AI offerings to include sophisticated video generation capabilities. Latest developments: Google has made its text-to-video generation model, Veo, available in private preview on the Vertex AI platform. Veo can create realistic videos at 24 or 30 frames per second based on text prompts The model can also generate videos from reference images while maintaining consistent style Initial demonstrations show promising results in addressing motion consistency, a common challenge in video generation Technical capabilities and features: Vertex AI users now have access to both Veo and the enhanced Imagen 3 text-to-image generator. Imagen 3 introduces new...

read Dec 2, 2024

Google’s AI Overviews may get fact-checking feature

Google's search app for Android is preparing to enhance transparency and credibility in its AI-powered search summaries by making source attribution more accessible to users. Key development: New code within Google's beta app indicates upcoming changes to how AI-generated overviews display their source material. The feature appears in the Google app beta v15.47.28 for Android, suggesting imminent implementation of direct source linking within Gemini's AI-generated summaries Currently, users must take extra steps through the Google "G" icon to verify sources, which can be cumbersome and counterproductive The new implementation would mirror the mobile web experience, where sources are more readily...

read