News/Google

Dec 16, 2024

Google’s new AI video tool Veo 2 surpasses Sora in audience tests

Google's new Veo 2, an enhanced AI video generation model, marks a significant advancement in the competitive landscape of AI-powered video creation tools, with the company claiming superior performance compared to OpenAI's Sora. Latest developments: Google has unveiled Veo 2 on its VideoFX platform, offering improved video generation capabilities with enhanced physics understanding and human movement representation. The new model supports video creation at resolutions up to 4K and includes cinematographic features like genre specification and lens selection Access is currently limited to a waitlist system through Google Forms Videos created with Veo 2 will include Google's SynthID watermark for...

read
Dec 16, 2024

Google’s new tool Whisk remixes your existing images with AI

Google's latest innovation that allows users to create new images by using existing images as prompts rather than relying solely on text descriptions. Core functionality: Google's new AI tool Whisk enables users to generate images by providing visual references for subject matter, scene composition, and artistic style. Users can upload multiple reference images for each category or use Google's auto-generated suggestions via a dice icon feature The system allows for optional text input to further refine the desired output Each generated image comes with an accompanying text prompt that can be modified for iterations Technical implementation: Whisk is powered by...

read
Dec 16, 2024

NotebookLM’s new ‘join’ feature lets you participate in your AI podcasts

Google has expanded the capabilities of its hit AI podcast generator and research assistant NotebookLM Core functionality enhancement: NotebookLM now offers an interactive 'join' feature that enables users to actively participate in AI-generated conversations. The platform creates podcast-style Audio Overviews by analyzing various source materials including PDFs, websites, podcasts, and videos Users can now engage directly with AI hosts through a question-and-answer format The system utilizes Google's upgraded Gemini 2.0 AI model to power its conversational capabilities Interface improvements: Google has restructured NotebookLM's interface to provide a more intuitive user experience. The platform now features three main sections: a Sources...

read
Dec 16, 2024

Google’s Project Mariner is an AI agent that navigates the web for you

Project overview: Google has introduced Project Mariner, a research prototype powered by Gemini 2.0, designed to automate web browsing tasks through an experimental Chrome extension. The AI agent can understand and interact with various web elements, including images, code, and forms Project Mariner achieved an 83.5% success rate when tested against the WebVoyager benchmark The tool operates exclusively in active browser tabs and requires human supervision Core functionality: Project Mariner serves as a virtual assistant that can navigate websites and perform tasks based on user instructions. Users communicate with Mariner through a chatbot interface in a side window The AI...

read
Dec 15, 2024

Google CEO outlines AI, Waymo and quantum computing plans

The advancement of artificial intelligence has positioned Google at the forefront of technological innovation, with CEO Sundar Pichai steering the company's ambitious AI initiatives while navigating complex regulatory and ethical considerations. Strategic vision and AI development: Google's journey in artificial intelligence reflects a deliberate, long-term approach that began with its "AI-first" directive in 2015. The recent launch of Gemini 2.0, Google's latest AI model, represents a significant milestone in the company's AI capabilities Breakthrough developments in quantum computing and Waymo's self-driving technology demonstrate Google's commitment to pushing technological boundaries The company's extensive network of data centers and compute clusters provides...

read
Dec 15, 2024

Google CEO plans AI ‘Manhattan Project’ after Trump inauguration

The landscape of artificial intelligence development is taking on new geopolitical significance as tech leaders position themselves ahead of Donald Trump's return to the White House. Strategic vision and leadership: Sundar Pichai, CEO of Alphabet and Google, is proposing an ambitious national AI research initiative comparable to the Manhattan Project of World War II. Pichai emphasized the opportunity for nationwide collaboration, expressing enthusiasm for large-scale infrastructure projects to accelerate AI progress The proposal aligns with recent recommendations from the bipartisan U.S.-China Economic and Security Review Commission (USCC) The initiative would be part of a broader strategy to maintain America's technological...

read
Dec 15, 2024

What’s inside Google NotebookLM’s recent upgrade

The AI-powered productivity tool NotebookLM has undergone a significant transformation with Google introducing new features focused on enhanced interactivity, organization, and collaboration capabilities. Key updates and features: Google has integrated the experimental Gemini 2.0 Flash version into NotebookLM while introducing a complete interface redesign and new interactive audio capabilities. The platform now includes Interactive Audio Overview hosts that allow users to engage in voice conversations, asking questions and requesting clarification on concepts A redesigned three-panel layout improves organization and workflow management The Sources Panel provides centralized information management The Chat Panel facilitates AI-driven discussions with source citations The Studio Panel...

read
Dec 15, 2024

Gemini will now summarize your Google Drive folders with AI

Google's integration of Gemini AI into Google Drive marks a significant expansion of AI-powered document management capabilities, allowing users to obtain summaries of entire folders rather than just individual files. Core functionality: Gemini's new folder summarization feature operates through a simple interface that provides AI-powered insights across multiple document types. Users can access the summary function through a dedicated button or context menu The system can process various file formats including text documents, PDFs, spreadsheets, presentations, and images A sidebar interface displays summaries and enables interactive follow-up questions about folder contents Availability and access: The folder summarization feature is being...

read
Dec 15, 2024

Google investment chief says company’s biggest AI bet is on search

The tech giant Alphabet is doubling down on artificial intelligence for its core search business, marking a strategic pivot as it faces new competition in the AI era. Strategic focus: Alphabet's President and Chief Investment Officer Ruth Porat has identified AI-powered search as the company's most significant strategic investment. Google has begun incorporating AI-generated summaries into search results, particularly for queries without clear-cut answers This initiative comes as a direct response to competition from OpenAI's ChatGPT The company must carefully navigate the challenge of AI "hallucinations" - instances where AI systems generate false information Financial commitment: Alphabet is making substantial...

read
Dec 15, 2024

Google Labs is a playground for very cool new AI tools — here’s how to try them

The recent redesign of Google Labs showcases the tech giant's expanding portfolio of experimental AI tools, making cutting-edge technology more accessible to everyday users. Core platform changes: Google Labs has undergone a significant redesign to streamline user access to experimental AI features and simplify the testing process. The refreshed interface makes discovering and initiating new experiments more intuitive Users can now provide direct feedback that influences the development of these AI tools New features and updates are continuously rolled out across various experiments Featured AI innovations: Google's latest experimental offerings span multiple domains, from content creation to academic research interpretation....

read
Dec 15, 2024

Google’s new Trillium AI chip aims to reshape AI and cloud computing

The race for AI hardware supremacy has intensified with the release of Google's Trillium chip, its sixth-generation Tensor Processing Unit, which promises significant advancements in performance and cost-efficiency for AI workloads. Core innovations and capabilities: Google's Trillium represents a substantial leap forward in AI processing technology, featuring doubled High Bandwidth Memory and a third-generation SparseCore architecture. The new TPU delivers 2.5 times better training performance per dollar and triple the inference throughput compared to previous generations Hardware improvements include a 4.7-fold increase in peak compute performance per chip Early adopter AI21 Labs reports significant gains in cost-efficiency and scalability for...

read
Dec 14, 2024

Why Google’s Astra and Mariner projects are big leaps for the AI agent space

The race to develop AI agents that can autonomously perform tasks and understand their environment is intensifying, with Google unveiling new prototypes that demonstrate both the potential and current limitations of this technology. Key developments: Google is expanding testing for Project Astra, a virtual assistant capable of processing multiple types of media in real-time, while also revealing Project Mariner, a browser-based AI agent. Project Astra can process text, images, video, and audio inputs while maintaining context from previous interactions for up to 10 minutes The system is being integrated with Google Search, Lens, and Maps A prototype version using smart...

read
Dec 14, 2024

Google, TPG and Intersect Power partner on clean energy data centers

Surging AI industry demand for additional data center capacity is prompting major tech companies to seek innovative solutions for sustainable power generation. Strategic partnership unveiled: Google, Intersect Power, and TPG Rise Climate have formed a groundbreaking alliance to develop data centers powered by co-located clean energy facilities across the United States. The collaboration aims to synchronize data center development with renewable power generation, creating industrial parks with gigawatts of capacity The first phase of the initial co-located clean energy project is scheduled to begin operations in 2026, with full completion expected in 2027 A funding round of over $800 million,...

read
Dec 14, 2024

It’s looking more likely that Google will launch new AI-powered smart glasses

The increased sophistication of AI-powered wearable technology is bringing Google's long-anticipated smart glasses closer to reality, with Project Astra representing the company's latest venture into AI-integrated eyewear. Project overview and background: Google's Project Astra represents the company's most ambitious vision for an all-encompassing, multimodal virtual assistant that could revolutionize how we interact with AI in daily life. Initially demonstrated at Google I/O, Project Astra aims to create an always-on AI assistant that can seamlessly integrate into users' daily activities The technology currently exists between conceptual demonstration and early prototype stages Smart glasses have consistently appeared as a key component in...

read
Dec 14, 2024

What to know about Google’s new AI coding agent Jules

Google's introduction of Jules, a new AI-powered code agent built on Gemini 2.0, marks a significant advancement in automated software development tools, specifically targeting bug fixes and code improvements for developers. Key Features and Capabilities: Jules is designed to create multi-step plans for addressing coding issues, with a current focus on Python and JavaScript development within GitHub workflows. The AI agent can modify multiple files simultaneously and prepare pull requests for code changes Developers maintain full control over the process, with the ability to review and adjust Jules' proposed solutions before implementation The system integrates directly with GitHub, streamlining the...

read
Dec 14, 2024

Why Google’s Gemini 2 AI model is such a big deal

The rapid evolution of artificial intelligence continues as Google unveils Gemini 2, marking a significant step forward in autonomous AI capabilities and agent-based computing. Core capabilities and improvements: Gemini 2 Flash, the first version in the Gemini 2 family, demonstrates enhanced performance while maintaining smaller model size and faster processing speeds. The model features native multimodal capabilities, allowing it to generate images, speech, and text without relying on separate specialized models Advanced reasoning abilities will be integrated into Google Search AI Overviews, enhancing the search experience Improved visual understanding, speech translation, and video analysis capabilities set this version apart from...

read
Dec 12, 2024

Google speeds up Gemini AI to enhance digital assistants

The race to advance AI capabilities has entered a new phase as Google introduces significant upgrades to its flagship artificial intelligence model, promising enhanced speed and versatility across multiple applications. Core Developments: Google has launched Gemini 2.0, a major update to its AI model that doubles the processing speed of its predecessor while expanding functionality. The new model demonstrates multimodal capabilities, handling image generation and audio processing across different languages Integration with Google Search and coding applications enhances the model's practical utility Performance improvements focus on speed optimization, with the new version operating twice as fast as the previous iteration...

read
Dec 12, 2024

Google tests Gemini AI agents to assist video game players

The rapid advancement of artificial intelligence is pushing into the gaming world as Google explores new ways to integrate AI assistance directly into gameplay experiences. Core Development: Google's latest Gemini 2.0 AI system is being tested to create intelligent gaming assistants that can analyze gameplay in real-time and provide contextual guidance. The AI agents can interpret on-screen action and engage in real-time conversations to offer strategic suggestions These agents can leverage Google Search to access and apply gaming knowledge from across the web Initial testing is being conducted with popular mobile games like Clash of Clans and Hay Day from...

read
Dec 11, 2024

Google’s new AI coding assistant Jules will fix bugs while you sleep

The emergence of Google's autonomous AI coding assistant "Jules" marks a significant advancement in automated software development, representing a new frontier in the collaboration between artificial intelligence and human developers. Core capabilities: Jules is an AI coding assistant built on Google's Gemini 2.0 platform that can autonomously identify and fix software bugs while developers are offline. The system integrates directly with GitHub's workflow system, enabling it to analyze complex codebases and implement fixes across multiple files Jules can prepare pull requests independently while operating as an autonomous agent within GitHub's ecosystem Before executing changes, the system presents suggested plans and...

read
Dec 11, 2024

Gemini 2.0 is Google’s advanced AI model for the agentic age

Google has officially unveiled Gemini 2.0, a significant upgrade to their flagship AI model that introduces advanced autonomous capabilities and marks a strategic shift toward more independent AI systems. Core capabilities and improvements: Gemini 2.0 represents a substantial leap forward in AI functionality, introducing several key technological advancements. The model now features native image and audio output capabilities, enabling more natural multimodal interactions Enhanced long context understanding allows the system to process and retain more information New tool usage abilities and compositional function-calling expand the model's practical applications Improved planning and instruction-following capabilities enable more complex task completion Development and...

read
Dec 11, 2024

Google’s Project Astra wants to be your all-knowing AI assistant

Project Astra represents a significant advancement in AI assistance technology, combining multimodal interaction capabilities with persistent memory and real-time processing across multiple devices. Core functionality: Project Astra operates as a universal AI assistant that can interact through speech and video while maintaining contextual awareness through conversation memory. The system can process both verbal commands and visual input through phone cameras or specialized prototype glasses It maintains memory of conversations and can recall up to 10 minutes of current session information The assistant leverages multiple tools including Google Search, Maps, and Lens to provide comprehensive responses Technical capabilities and integration: Project...

read
Dec 11, 2024

Google’s new Deep Research AI tool does web scouring for you

Core functionality: Deep Research, a new feature for Gemini Advanced users, automates the web research process by creating and executing multi-step research plans based on user queries. Users can input research questions and review an AI-generated research plan before execution The system spends several minutes browsing the web, mimicking human research patterns Results are compiled into a comprehensive report with source links for further exploration Users can refine results through follow-up questions and export final reports to Google Docs Availability and access: Google is implementing a phased rollout of Deep Research with specific platform and subscription requirements. The feature is...

read
Dec 11, 2024

Google’s new Trillium AI chip delivers improved speed, powers Gemini 2.0 model

The AI hardware industry has taken another significant leap forward with Google's announcement of Trillium, its sixth-generation AI accelerator chip that powered the training of Gemini 2.0 while delivering unprecedented performance improvements. Core technological advancement: Google's new Trillium processor represents a quantum leap in AI chip capabilities, offering four times the training performance of its predecessor while significantly reducing energy consumption. The chip achieves a 4.7x increase in peak compute performance compared to previous generations Memory capacity and interchip interconnect bandwidth have both doubled Energy efficiency has improved by 67%, addressing critical data center power consumption concerns Infrastructure deployment: Google...

read
Dec 11, 2024

How tools like NotebookLM use AI to generate podcasts

Google's NotebookLM transforms written documents into AI-generated audio summaries that mimic podcast-style presentations. Core functionality and purpose: NotebookLM serves as an AI-powered research assistant that helps users process and understand uploaded documents through interactive analysis and summarization. The platform allows users to upload documents or links and engage with AI to extract key information and insights Its Audio Overviews feature converts text content into conversational audio summaries that sound similar to human-hosted podcasts The tool is designed to act as a "thought partner" for researchers and writers working with complex information Technical capabilities: The system employs advanced natural language processing...

read
Load More