News/Cloud
Lambda unveils low-cost inference-as-a-service API
The AI infrastructure landscape is evolving as Lambda, a San Francisco-based GPU services provider, introduces a new inference-as-a-service API aimed at making AI model deployment more accessible and cost-effective for enterprises. The core offering: Lambda's new Inference API enables businesses to deploy AI models into production without managing underlying compute infrastructure. The service supports various leading models including Meta's Llama 3.3, Llama 3.1, Nous's Hermes-3, and Alibaba's Qwen 2.5 Pricing starts at $0.02 per million tokens for smaller models and reaches $0.90 per million tokens for larger models Developers can begin using the service within five minutes by generating an...
read Dec 11, 2024Google warns FTC that Microsoft-OpenAI deal harms competition
Tech giants continue to clash over exclusive partnerships and market access in an AI arms race that seems to be only just beginning. Core allegations: Google has reportedly approached the Federal Trade Commission (FTC) to challenge Microsoft's exclusive cloud hosting arrangement with OpenAI. The complaint emerges amid a broader FTC investigation into Microsoft's cloud computing practices Google argues that forcing OpenAI's customers to use Microsoft's servers creates unfair cost burdens for competitors The exclusive arrangement has generated approximately $1 billion in revenue for Microsoft in 2024 from reselling OpenAI's large language models (LLMs) Financial implications: Microsoft's partnership with OpenAI has...
read Dec 11, 2024Cloud trends 2024: Serverless, sovereign and AI-enabled
Public cloud platforms are experiencing significant transformation through artificial intelligence integration, serverless architecture adoption, and growing emphasis on digital sovereignty across different global markets. Major market shifts: The public cloud platform landscape is being reshaped by three primary forces: artificial intelligence adoption, serverless-first approaches, and regional sovereignty requirements. Hyperscalers are adapting their core infrastructure to support generative AI capabilities while expanding their service offerings beyond traditional enterprise IT boundaries Chinese cloud providers are driving innovation in AI services and foundation model support across multiple domains European cloud providers are capitalizing on sovereignty and sustainability requirements to compete effectively across the...
read Dec 9, 2024Adobe and AWS team up to enhance AI-powered customer experiences
Adobe's expansion of its Experience Platform onto Amazon Web Services (AWS) marks a significant development in enterprise AI and customer data management, reflecting the growing demand for seamless cloud-based personalization solutions. Strategic Partnership Overview: The collaboration between Adobe and AWS, announced at the recent re:Invent conference, represents a major shift in how enterprises can leverage cloud computing for customer experience management. The partnership will enable organizations using AWS infrastructure to activate customer data for personalization without complex cross-cloud data transfers Adobe's suite of applications, including Real-Time CDP, Journey Optimizer, and Customer Journey Analytics, will be available directly within AWS environments...
read Dec 5, 2024AWS’ new prompt caching feature cuts AI costs by 90%
AI infrastructure cost management and optimization are becoming increasingly important as enterprise adoption grows, prompting major cloud providers to introduce new features aimed at reducing expenses. Latest AWS Bedrock features: Amazon Web Services has unveiled two key capabilities - Intelligent Prompt Routing and Prompt Caching - to help customers reduce AI model usage costs. Intelligent Prompt Routing automatically directs queries to appropriately-sized models within a chosen model family, potentially reducing costs by up to 30% without sacrificing accuracy The system ensures simple queries are handled by smaller models while complex questions are routed to more sophisticated ones AWS customer Argo...
read Dec 4, 2024The biggest announcements from Amazon’s re:Invent 2024
The annual Amazon Web Services (AWS) re:Invent 2024 conference in Las Vegas marks a significant shift in cloud computing, with generative AI taking center stage as AWS strengthens its enterprise offerings amid growing competition. Key announcements and platform enhancements: AWS has unveiled several major updates to its AI and cloud computing services, focusing on enterprise-level solutions and improved AI capabilities. Bedrock, AWS's foundational model platform, now features multi-agent orchestration, enabling businesses to create collaborative AI systems that can work together on complex tasks Moody's has already implemented this technology to enhance their analytical capabilities through coordinated specialist AI agents New...
read Dec 2, 2024Red Hat, AWS join forces to expand VM and AI offerings
The growing emphasis on artificial intelligence and virtual machine management is driving new partnerships in cloud computing, as demonstrated by Red Hat's expanded collaboration with Amazon Web Services (AWS). Strategic partnership overview: Red Hat and AWS have signed a collaboration agreement to increase the availability of Red Hat's virtualization and AI solutions through AWS Marketplace. The partnership will focus on scaling Red Hat Enterprise Linux AI, OpenShift AI, and OpenShift Virtualization solutions Red Hat will expand its Cloud Center of Excellence to showcase cross-cloud implementation capabilities The companies will follow a jointly developed go-to-market strategy Key product developments: Red Hat...
read Nov 29, 2024How cloud and AI are changing the future of network architectures
The intersection of cloud computing and artificial intelligence is reshaping telecommunications network architecture, with Nokia leading efforts to create seamless network-cloud integration across multiple infrastructure layers. The big picture: Nokia's vision centers on developing a network-cloud continuum that integrates devices, edge infrastructure, and network core components, incorporating both traditional AI and generative AI technologies throughout the system. Nokia's Cloud and Network Services CTO Jitin Bhandari estimates that Communication Service Providers (CSPs) could access an additional $1.1 trillion market across multiple vertical sectors The integration of cloud and AI technologies presents significant implementation challenges that require careful consideration and planning CSPs...
read Nov 28, 2024Capgemini taps Mistral, Microsoft to accelerate AI adoption in regulated industries
The expansion of enterprise AI solutions gains momentum as major tech players join forces to accelerate adoption of generative AI technologies across industries. Strategic partnership details: Capgemini is expanding its Intelligent App Factory on Microsoft Azure through a new collaboration with Mistral AI and Microsoft, focusing on delivering customized AI solutions for regulated industries. The partnership integrates Mistral AI's language models with Microsoft's cloud infrastructure and leverages Capgemini's industry expertise The collaboration builds upon Capgemini and Microsoft's existing work since 2023 on enterprise-wide generative AI adoption The initiative aims to provide scalable, efficient, and secure solutions for clients globally Key...
read Nov 27, 2024These Android alum are launching an operating system just for AI agents
The rapidly evolving AI landscape is witnessing a new development as former Android executives launch a startup aimed at creating an operating system specifically designed for AI agents. The core mission: /dev/agents, a new startup led by former Android leaders, aims to develop a cloud-based operating system that will make AI agents more accessible and easier to build. Former Google Android VP David Singleton leads the company as CEO, bringing his extensive experience in operating system development The platform will focus on enabling "trusted agents" to work with users across multiple devices The system promises to introduce new UI patterns,...
read Nov 24, 2024Snowflake taps Anthropic’s Claude for enterprise and agentic AI offering
The integration of Anthropic's Claude 3.5 models into Snowflake's Cortex AI platform marks a significant advancement in enterprise AI capabilities, combining advanced language models with robust data analytics infrastructure. Partnership Overview: Snowflake and Anthropic have established a multi-year collaboration to integrate Claude 3.5 models into Snowflake's Cortex AI platform, enhancing enterprise AI capabilities while maintaining security and governance standards. The partnership will enable businesses to develop and deploy AI products and workflows within Snowflake's ecosystem Snowflake's agentic AI products, including Snowflake Intelligence and Cortex Analyst, will utilize Claude as a primary language model The integration will be available in select...
read Nov 23, 2024KPMG pours $100M into Google Cloud to boost AI adoption
The strategic partnership between KPMG and Google Cloud marks a significant development in enterprise AI adoption, with KPMG committing USD 100 million to expand their collaboration in generative AI, data analytics, and cybersecurity solutions. Investment scope and projected returns: KPMG's USD 100 million investment in Google Cloud alliance aims to generate USD 1 billion in incremental growth through AI-powered solutions. KPMG has experienced a tenfold increase in Google Cloud-related bookings over the past two years The firm established a Google Cloud Center of Excellence in April 2024 to align product development and technical resources The partnership focuses on data modernization...
read Nov 22, 2024Google’s new AI Agent Space allows businesses to discover and deploy agents
The cloud computing landscape is rapidly evolving into an AI battleground, with major players like Google Cloud, Microsoft Azure, and Amazon Web Services competing to provide comprehensive AI agent solutions for enterprise customers. Latest Development: Google Cloud has launched AI Agent Space, a new ecosystem program that allows businesses to discover, deploy, and collaborate on AI agents for task automation and operational optimization. The platform provides partners with early access to Google's AI technologies and direct engineering support Google will promote partner-created agents through its Cloud Marketplace to help scale solutions The initiative aims to create a win-win scenario by...
read Nov 19, 2024Cloudera buys Octopai to boost AI and data management
The data management and AI landscape is evolving as Cloudera makes strategic moves to enhance its enterprise offerings through acquisition and new product launches. Strategic acquisition details: Cloudera has signed a definitive agreement to acquire Octopai's automated data lineage and metadata management platform, with the deal expected to close by November 2024. The acquisition will integrate Octopai's specialized data lineage and catalog capabilities into Cloudera's existing infrastructure This strategic move particularly benefits enterprises in highly regulated sectors that need robust data governance across multiple environments The integration aims to provide customers with enhanced visibility of their data regardless of their...
read Nov 19, 2024Cloud spending surges 21% in Q3 amid AI boom
Market performance and key players: The global cloud infrastructure market reached $82 billion in Q3 2023, marking a 21% increase from the previous year. AWS, Google, and Microsoft dominated the market, collectively accounting for 64% of total spending Google led growth with a 36% increase, followed by Microsoft at 33% and AWS at 19% Combined spending among the three major providers grew 26% year-over-year AI-driven growth factors: Enterprise enthusiasm for artificial intelligence capabilities is fueling unprecedented investment in cloud infrastructure and services. Companies are investing heavily in cloud providers' AI solutions, anticipating significant gains in efficiency and productivity This surge...
read Nov 14, 2024Hybrid compute adoption surges as enterprises seek control over AI assets
The growing adoption of artificial intelligence by large enterprises is driving a shift toward hybrid computing models that combine public cloud services with private infrastructure, allowing organizations to maintain greater control over their AI capabilities. The evolving AI landscape: Large enterprises are increasingly adopting a hybrid approach to artificial intelligence deployment, combining public cloud services with private computing resources and locally-controlled models. Organizations spending over $10 million annually on AI are particularly motivated to develop private computing capabilities alongside their use of public cloud services This trend is especially prominent among companies with significant security concerns, regulatory requirements, or specific...
read Nov 11, 2024Northflank raises $22.3M to simplify cloud infrastructure for devs
The cloud infrastructure landscape is experiencing a significant shift as Northflank secures $22.3 million in funding to streamline developer workflows and simplify cloud deployments. Funding breakdown and company mission; London-based Northflank has secured a $16 million Series A round led by Bain Capital Ventures and a $6.3 million seed round led by Vertex Ventures US. The company's total funding now stands at approximately $25 million, with participation from Kindred Ventures, Tapestry VC, Pebblebed, and Uncorrelated Ventures Northflank's platform enables rapid deployment of applications, databases, and automated jobs across major cloud providers including AWS, Google Cloud, Microsoft Azure, and Oracle Cloud...
read Nov 5, 2024The 7 cloud computing trends shaping business success in 2025
The cloud computing revolution: Cloud technology is poised to undergo a fundamental transformation by 2025, with several key trends set to redefine business success and innovation. Emerging technologies like AI, quantum computing, and edge computing are converging with cloud services to create new possibilities for organizations across industries. These advancements promise to deliver unprecedented efficiency, cost savings, and performance improvements for businesses that embrace them. AI as the brain of cloud computing: Artificial intelligence is evolving from a service running in the cloud to the intelligent force optimizing all aspects of cloud operations. AI-driven systems will predict resource needs, automatically...
read Nov 4, 2024Amazon spent $75B in 2024 to meet AI demand — next year it will spend more
Cloud computing expansion fuels Amazon's massive capital expenditure: Amazon plans to spend $75 billion on capital expenditure in 2024, with an even higher amount expected in 2025, primarily driven by the growth of its cloud computing business, Amazon Web Services (AWS). The surge in spending is attributed to rising demand for generative AI services and an increasing number of customers migrating their workloads from on-premises infrastructure to the cloud. AWS reported impressive Q3 2023 results, with net sales of $27.45 billion (up 19% year-on-year) and operating profit jumping almost 50% to $10.45 billion. Amazon CEO Andy Jassy highlighted recent customer...
read Oct 31, 2024DigitalOcean and Hugging Face launch 1-click AI models
AI Accessibility Breakthrough: DigitalOcean and Hugging Face have formed a strategic partnership to democratize artificial intelligence, particularly for startups and small to medium-sized businesses. The collaboration introduces "1-Click Models," a solution designed to simplify the deployment of machine learning models in cloud environments. This initiative aims to provide faster and more affordable access to generative AI capabilities, leveling the playing field for organizations with limited technical resources. Key Features and Benefits: DigitalOcean supports popular models from Google, Meta, Mistral, and NousResearch at launch. Models are deployed in GPU Droplets, DigitalOcean's virtual machines, and placed within inference containers. Developers can access...
read Oct 30, 2024Cloud computing and Office software drive big revenue surge for Microsoft
Microsoft's AI-Powered Cloud Fuels Impressive Q1 Results: Microsoft Corporation has reported stronger-than-expected revenue growth for its first quarter of fiscal year 2024, with cloud computing and Office software leading the charge. Key financial highlights: The tech giant's performance surpassed analyst expectations, demonstrating the growing impact of its artificial intelligence investments. Sales for Q1 (ending September 30) increased by 16% to $65.6 billion, exceeding the average analyst estimate of $64.5 billion. Earnings per share reached $3.30, surpassing the projected $3.11. Overall cloud revenue, encompassing products like Office and Azure, grew by 22% to $38.9 billion. AI's growing influence: Microsoft's strategic focus...
read Oct 29, 2024Box and Amazon partner to simplify AI app creation for enterprises
Box and AWS join forces to streamline generative AI app development: The content management platform Box has expanded its partnership with Amazon Web Services (AWS) to simplify the creation of generative AI applications for enterprise clients. Box customers can now access foundation models through Amazon Bedrock, including Anthropic's Claude and Amazon's Titan, as part of Box AI. This integration allows companies to quickly build generative AI applications by combining advanced AI models with Box's Intelligent Content Management platform. The partnership aims to provide both accuracy and security for businesses looking to accelerate their AI initiatives. Key features of the collaboration:...
read Oct 24, 2024How cybercriminals are using sex bots to exploit their victims
AI-powered sex chat services exploit cloud vulnerabilities: Cybercriminals are increasingly using stolen cloud credentials to operate and resell AI-powered sex chat services, often bypassing content filters to engage in disturbing role-playing scenarios. Researchers at Permiso Security have observed a significant increase in attacks against generative AI infrastructure, particularly Amazon Web Services' (AWS) Bedrock, over the past six months. These attacks often stem from accidentally exposed cloud credentials or keys, such as those left in public code repositories like GitHub. Investigations revealed that many AWS users had not enabled logging, limiting visibility into the attackers' activities. Honeypot experiment reveals alarming trends:...
read Oct 24, 2024Forrester predicts significant disruptions to AI infrastructure sector by 2025
AI's impact on tech infrastructure in 2025: Forrester's Predictions for 2025 anticipate significant disruption in the technology infrastructure space, driven by an accelerated demand for AI-powered solutions and the need to demonstrate concrete value from AI investments. The report suggests that 2025 will be a pivotal year for businesses to show real return on investment and tangible benefits from their AI initiatives, justifying the hype and expense surrounding these technologies. The devastating CrowdStrike outage in 2024 has underscored the importance of addressing risk, resiliency, and modern security practices in tech infrastructure. Key predictions for tech infrastructure and operations: Major tech...
read