<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/"><channel><title>Cost Management</title><link>https://cloud.google.com/blog/topics/cost-management/</link><description>Cost Management</description><atom:link href="https://cloudblog.withgoogle.com/blog/topics/cost-management/rss/" rel="self"></atom:link><language>en</language><lastBuildDate>Wed, 22 Apr 2026 20:10:29 +0000</lastBuildDate><image><url>https://cloud.google.com/blog/topics/cost-management/static/blog/images/google.a51985becaa6.png</url><title>Cost Management</title><link>https://cloud.google.com/blog/topics/cost-management/</link></image><item><title>Next-gen FinOps for the AI era</title><link>https://cloud.google.com/blog/topics/cost-management/introducing-spend-caps-ai-cost-visibility-next26/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Today we’re excited to announce the next generation of our FinOps product suite to help our customers increase operational efficiency, better understand their costs, and control them with Spend Caps. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;What’s new: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;We’re introducing a new FinOps Explainability agent, which is designed to operate autonomously, and investigate the drivers of your AI-related Cloud costs. This is in addition to new FinOps tooling which provides commercial auditability. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We’re also announcing a private preview of &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Spend Caps&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; in Google Cloud, enabling FinOps and DevOps managers to set budgets and enforce cost boundaries at the project level for Google AI Studio (AIS), Gemini Enterprise Agent Platform (the evolution of Vertex AI) , Cloud Run, Cloud Run Functions, and Maps. These caps alert and ultimately pause API traffic once your set budget is reached.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters for your business: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;These new FinOps tools give you clear visibility into AI costs, increase control with Spend Caps to prevent overspending, and offer the commercial flexibility needed to scale your AI innovations efficiently. Customers who are using our existing  FinOps tools are seeing huge improvements. Since launching Gemini Cloud Assist (GCA) for FinOps last year,&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; cost reporting adoption has surged 75% while simultaneously slashing customer time spent doing FinOps cost analysis by 18&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;%.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Where to get started:  &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;You can access the FinOps Explainability Agent in the console &lt;/span&gt;&lt;a href="https://console.cloud.google.com/billing/"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, along with the FinOps tooling. Customers can sign-up for the private preview of Spend Caps &lt;/span&gt;&lt;a href="https://docs.google.com/forms/d/12TIOZQq4FWb7LMZ_IFLBIM3sQPJipyNAkiw8pRw7d50/viewform?edit_requested=true" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.  &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;Goodbye static reporting &lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Given the number of variables and the number of services commonly running for a large enterprise, cloud cost reporting can be noisy. Even when, in theory, AI costs are just the result of quantity (q) times price (p). Quantity can be driven by a large mix of variables such as API request traffic, error logs, fluctuating token counts, or even cloud storage. And price often fluctuates with different AI model types, and frequent provider price shifts. This challenges FinOps and DevOps managers to synthesize this data to identify efficiency opportunities, or take timely action. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In Google Cloud Billing we &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;used Gemini to develop our new &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;FinOps Explainability agent&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; to autonomously help users understand the drivers of AI costs. Attributing ROI to AI projects requires a clear understanding of its costs, but because AI often piggybacks on existing infrastructure, its expenses frequently blur into the general cost of doing business.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Now you can use the FinOps Explainability agent to identify your AI cost drivers automatically, and use it to answer questions like:  “How much did I spend on Gemini 1.5 Pro versus Gemini 1.5 Flash?” Or,  “&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;Break down my total spend by API Key so I can see which integration is expensive.” Or, “Show me the split between Input Token costs and Output Token costs for Gemini 3.0 Pro.”&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; Users can quickly discover &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;what&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; services and &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;which &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;projects are driving your AI costs.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/Figma_prototype_recording_10FPS_100.gif"
        
          alt="image-1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="g5l82"&gt;FinOps Explainability agent helps you analyze AI costs, drivers &amp;amp; trends&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;Hello automated Spend Caps&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The speed of AI adoption and usage is driving cloud spend that behaves differently than traditional cloud spend. AI uses specialized hardware (TPU/GPUs) and a single runaway training job or unoptimized model running on that hardware can drain a budget in a very short amount of time. Users are also constantly experimenting. Traditional cost control tools typically alert managers, but don’t enforce budget caps. The result: many enterprises have been forced to build their own complex custom spend guardrails that are enforced through destructive actions that may be time consuming to adjust, such as disassociating forms of payment. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We’re excited to announce that &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Spend Caps&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; are coming soon to Google Cloud. Designed to work with Google Cloud Budgets, FinOps and DevOps can set budgets that enforce automated cost boundaries (caps) at the project level for AIS, Agent Platform, Cloud Run, Cloud Run Functions, and Maps. These caps alert and ultimately pause API traffic once your set budget is reached, but leave your resources intact. If you need the traffic to resume, simply suspend the Spend Cap. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We expect customers that want to contain the costs of AI R&amp;amp;D to benefit immensely from this new feature. You can sign up for the &lt;/span&gt;&lt;a href="https://docs.google.com/forms/d/12TIOZQq4FWb7LMZ_IFLBIM3sQPJipyNAkiw8pRw7d50/viewform?edit_requested=true" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;private preview&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; today. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/image_5WZotVw.gif"
        
          alt="image"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="g5l82"&gt;Spend Caps help prevent cost overruns&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Real commercial incentive auditability.  &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Google Cloud meets you at every stage of growth—offering the commercial flexibility, startup programs, and enterprise incentives needed to help your costs scale efficiently. To help users more clearly understand the connection between commercial agreements and the services being billed, we’ve designed our FinOps tooling to provide end-to-end auditability of our commercial obligations. With the private preview rollout of enhanced billing account hierarchies, customers can view their aggregated spend across multiple billing accounts, including Other Eligible Services (OES) spend. Additionally, we are announcing a private preview for Google Cloud contract commitment reporting, providing visibility into Google Cloud commit contract burndown within your Enterprise Agreement. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;The future of FinOps is here. Built with AI for AI. &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With the FinOps Explainability agent for deep visibility, Spend Caps for increased control, and enhanced billing account hierarchies with contract commitment reporting for ultimate commercial flexibility, Google Cloud is empowering you to scale your AI innovations with confidence and precision. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Wed, 22 Apr 2026 12:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/introducing-spend-caps-ai-cost-visibility-next26/</guid><category>Google Cloud Next</category><category>Cost Management</category><media:content height="540" url="https://storage.googleapis.com/gweb-cloudblog-publish/images/GCN26_102_BlogHeader_2436x1200_Opt_1_Light.max-600x600.jpg" width="540"></media:content><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Next-gen FinOps for the AI era</title><description></description><image>https://storage.googleapis.com/gweb-cloudblog-publish/images/GCN26_102_BlogHeader_2436x1200_Opt_1_Light.max-600x600.jpg</image><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/introducing-spend-caps-ai-cost-visibility-next26/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Sarah McMullin</name><title>Head of Cloud FinOps Product</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Pravir Gupta</name><title>VP &amp; GM, Google Cloud Business Platform</title><department></department><company></company></author></item><item><title>How to find the sweet spot between cost and performance</title><link>https://cloud.google.com/blog/products/ai-machine-learning/build-a-robust-and-cost-effective-gen-ai-strategy/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At Google Cloud, we often see customers asking themselves: "How can we manage our generative AI costs effectively without sacrificing the performance and availability our applications demand?" &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This is the million-dollar question — or, perhaps more accurately, the "tokens-per-minute" question. The key isn't just about choosing the cheapest option, but about finding the right recipe of tools and services that aligns with your  workload patterns.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This guide will walk you through Google Cloud's flexible gen AI  infrastructure options, showing you how to find that sweet spot on the efficient frontier between cost and performance. We'll start with the foundational pay-as-you-go (PayGo) models and then explore how to layer on more specialized options to build a robust and cost-effective gen AI strategy.&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;&lt;span style="vertical-align: baseline;"&gt;Understanding your foundation: Pay-as-You-Go (PayGo) options&lt;/span&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;For many workloads, Google Cloud's standard PayGo offerings provide a powerful and flexible starting point. To get the most out of them, it's crucial to understand the mechanisms that govern performance and availability.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;1. Dynamic Shared Quota (DSQ)&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At its core, the standard PayGo environment operates on a principle of fairness and efficiency called Dynamic Shared Quota (DSQ). Instead of enforcing rigid, per-customer limits, DSQ intelligently distributes available GenAI capacity among all customers.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_kWhsBI3.max-1000x1000.jpg"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;How it works:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;High-priority lane: Your organization has a default Tokens Per Second (TPS) threshold. Any requests you send that fall within this threshold are given higher priority. This lane is designed to provide high availability, targeting a 99.5% SLO.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Best-effort lane: If you experience a spike in traffic and exceed your TPS threshold, your excess requests are not immediately dropped. Instead, they are handled with lower priority, receiving throughput when there is spare capacity available.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This system is designed so that sudden traffic spikes from one customer do not negatively impact the baseline performance of others. You get a reliable level of service for your everyday needs, with the potential to burst when the system has capacity to spare.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;2. Usage tiers: Rewarding your investment&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To provide more predictable performance as your gen AI usage grows, Google Cloud automatically places your organization into Usage Tiers based on your rolling 30-day spend on eligible Vertex AI services. &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;The higher your tier, the higher your guaranteed Tokens Per Minute (TPM) limit&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At the time of this article, these are the tiers for our popular model families:&lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;/p&gt;
&lt;div align="left"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;&lt;table style="width: 99.3473%;"&gt;&lt;colgroup&gt;&lt;col style="width: 38.2928%;"/&gt;&lt;col style="width: 13.4542%;"/&gt;&lt;col style="width: 27.5553%;"/&gt;&lt;col style="width: 20.6988%;"/&gt;&lt;/colgroup&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p style="text-align: center;"&gt;&lt;span style="vertical-align: baseline;"&gt;Model Family&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p style="text-align: center;"&gt;&lt;span style="vertical-align: baseline;"&gt;Tier&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p style="text-align: center;"&gt;&lt;span style="vertical-align: baseline;"&gt;Spend (30 days)&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p style="text-align: center;"&gt;&lt;span style="vertical-align: baseline;"&gt;TPM&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Pro Models&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Tier 1&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;$10 - $250&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;500,000&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt; &lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Tier 2&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;$250 - $2,000&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;1,000,000&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt; &lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Tier 3&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;&amp;gt; $2,000&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;2,000,000&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Flash / Flash-Lite Models&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Tier 1&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;$10 - $250&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;2,000,000&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt; &lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Tier 2&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;$250 - $2,000&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;4,000,000&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt; &lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Tier 3&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;&amp;gt; $2,000&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;10,000,000&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;p&gt;&lt;sup&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; Important: &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;For the most updated model and threshold please always refer to the &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/standard-paygo#tiered"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;documentation&lt;/span&gt;&lt;/a&gt;&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Crucially, you should think of your tier limit as a floor, not a ceiling.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_MJ3MPBA.max-1000x1000.jpg"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Critical traffic: Traffic up to your organization's tier limit is protected. You should experience minimal to no &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;429&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; (resource exhausted) errors as long as you stay within this baseline.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Opportunistic bursting: When you exceed your tier limit, you can still burst to use spare system capacity on a best-effort basis. If the entire system is under heavy load, fair-share throttling will engage for this excess traffic. The key takeaway is that we don't artificially cap your performance if there's idle capacity available.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;3. Priority PayGo: Your insurance policy for spikes&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;What if your workload is prone to unpredictable spikes and you can't risk &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;429&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; errors, but you're not ready to commit to a fixed capacity model? This is where Priority PayGo comes in. It's designed to give you the best of both worlds: the flexibility of PayGo with the high availability needed for important traffic.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;For a premium, you can tag specific API requests for higher priority.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Important: Please note that the Priority PayGo feature is currently available only for the global endpoint. Future release on regional endpoints might happen but is not guaranteed.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;How to use Priority PayGo:&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;It's as simple as adding a header to your API call. No sign-up or commitment is needed.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-code"&gt;&lt;dl&gt;
    &lt;dt&gt;code_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;code&amp;#x27;, &amp;#x27;curl -X POST \\\r\n -H &amp;quot;Authorization: Bearer $(gcloud auth print-access-token)&amp;quot; \\\r\n -H &amp;quot;Content-Type: application/json&amp;quot; \\\r\n -H &amp;quot;X-Vertex-AI-LLM-Shared-Request-Type: priority&amp;quot; \\\r\n https://aiplatform.googleapis.com/...&amp;#x27;), (&amp;#x27;language&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;caption&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f206a3eda30&amp;gt;)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Be mindful of the ramp limit. As the images below illustrate, ramping up priority requests too quickly can cause some requests to be downgraded to standard priority if capacity is constrained. A slower, more gradual ramp-up ensures the best experience and mitigates downgrading.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;For example: &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/3_GEHhkK1.max-1000x1000.jpg"
        
          alt="3"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="mea1l"&gt;System tries to serve priority requests even when they are above the ramp limit, however they are subject to downgrading (not throttling) when capacity is constrained&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/4_JvcW6D5.max-1000x1000.jpg"
        
          alt="4"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="mea1l"&gt;Ramping priority requests within the limit mitigates downgrading and ensures good experience&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can monitor your utilized Priority PayGo request following this &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/priority-paygo#verify-usage"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;documentation&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;&lt;span style="vertical-align: baseline;"&gt;For the uncompromising workload: Provisioned Throughput (PT)&lt;/span&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;When your gen AI  workload is absolutely business-critical and you need an explicit availability guarantee, it's time to consider PT. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With PT, you reserve a specific amount of model processing capacity for a fixed monthly cost. This is the only way to get an availability SLA. While a standard PayGo model has an &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;uptime&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; SLA (the model is up), PT provides an &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;availability&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; SLA (your requests will be processed).&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Let’s deep dive a little bit in more detail by the definition of “error rate”: &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;the number of Valid Requests that result in a response with HTTP Status 5XX and Code "Internal Error" divided by the total number of Valid Requests during that period, subject to a minimum of 2000 Valid Requests in the measurement period.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;While standard PAYG returns 429 in case of “Resource exhausted” resulting on the call not being count in the error rate , for standard Provisioned Throughput, when you use less than your purchased amount, errors that might otherwise be 429 are returned as 5XX and count toward the SLA error rate. This is what defines the SLA difference between PT and PAYG.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This makes Provisioned Throughput the ideal choice for:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Large, predictable production workloads.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Applications with strict performance requirements where throttling is not an option.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;Fine-grained control over your PT requests &lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;By default, any usage above your PT order automatically spills over to PAYG. However, you can control this behavior at the request level using HTTP headers:&lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 40px;"&gt;&lt;span style="vertical-align: baseline;"&gt;Prevent overages: To ensure you never exceed your PT commitment and deny any excess requests, add the &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;dedicated&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; header. This is useful for strict budget control.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-code"&gt;&lt;dl&gt;
    &lt;dt&gt;code_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;code&amp;#x27;, &amp;#x27;{&amp;quot;X-Vertex-AI-LLM-Request-Type&amp;quot;: &amp;quot;dedicated&amp;quot;}&amp;#x27;), (&amp;#x27;language&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;caption&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f206a3eda90&amp;gt;)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p style="padding-left: 40px;"&gt;&lt;span style="vertical-align: baseline;"&gt;Bypass PT on-demand: To intentionally send a lower-priority request to the PayGo pool even though you have a PT order, use the &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;shared&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; header. This is perfect for experimenting or running non-critical jobs without consuming your reserved capacity.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-code"&gt;&lt;dl&gt;
    &lt;dt&gt;code_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;code&amp;#x27;, &amp;#x27;{&amp;quot;X-Vertex-AI-LLM-Request-Type&amp;quot;: &amp;quot;shared&amp;quot;}&amp;#x27;), (&amp;#x27;language&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;caption&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f20696692e0&amp;gt;)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;Monitoring your investment&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can closely monitor your Provisioned Throughput usage using Cloud Monitoring metrics on the &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;aiplatform.googleapis.com/PublisherModel&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; resource. Key metrics include:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;/dedicated_gsu_limit&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;: Your dedicated limit in Generative Scale Units (GSUs).&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;/consumed_token_throughput&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;: Your actual throughput usage, accounting for the model's burndown rate.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;/dedicated_token_limit&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;: Your dedicated limit measured in tokens per second.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This allows you to ensure you are getting the value you paid for and helps you right-size your commitment over time. To learn more about PT on Vertex AI, visit our guide &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/ai-machine-learning/provisioned-throughput-on-vertex-ai?e=48754805"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;Building your recipe: Combining options for optimal results&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Consider a workload with a predictable daily baseline, expected peaks, and the occasional unexpected spike. The optimal recipe would be:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Provisioned Throughput: Cover your predictable, mission-critical baseload. This gives you an availability SLA for the core of your application.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Priority PayGo: Use this to handle predictable peaks that rise above your PT commitment or for important traffic that is less frequent. This acts as a cost-effective insurance policy against &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;429&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; errors for your most important variable traffic.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Standard PayGo (within tier limit): This forms your foundation for general, non-critical traffic that fits comfortably within your organization's usage tier.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Standard PayGo (opportunistic bursting): For non-critical, latency-insensitive jobs (like batch processing), you can rely on the best-effort bursting of the standard PayGo model. If some of these requests are throttled, it won't impact your core user experience, and you don't pay a premium for them.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;By understanding and combining these powerful tools, you can move beyond simply managing costs and start truly optimizing your GenAI strategy for the perfect balance of performance, availability, and value.&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;&lt;span style="vertical-align: baseline;"&gt;Extra bonus: Batch API and Flex PayGo &lt;/span&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Starting with the Batch API, not every LLM request needs a sub-second time-to-first-token (TTFT). If a user is chatting with a customer service bot, low latency is critical. But if you are classifying millions of support tickets from last month, running evaluations, or generating daily summary reports, nobody is sitting at a screen waiting for a real-time stream. This is where the Gemini Batch API becomes your best friend. Customers can bundle up a massive payload of requests into a single file and submit it asynchronously. The infrastructure processes these workloads during off-peak windows or when idle compute capacity is available. The target turnaround time is 24 hours, though in practice, it is typically much faster. By trading immediate execution for asynchronous processing, &lt;/span&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;you get a 50% discount on standard token costs&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;While Batch handles your offline heavy lifting, your live apps still need real-time computation. But not all requests are latency-driven and customers might accept to wait a little longer to get a discount on the standard token costs. Flex PayGo provides a highly cost-effective way to access Gemini models, offering a &lt;/span&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;50% discount compared to Standard PayGo&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;. Optimized for non-critical workloads that can accommodate response times of up to 30 minutes, it allows for seamless transitions between Provisioned Throughput (PT), Standard PayGo, and Flex PayGo with minimal code changes. Ideal use cases include:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Offline analysis of text and multimodal files.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Model quality evaluation and benchmarking.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Data annotation and labeling.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Automated product catalog generation.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;Get started &lt;/span&gt;&lt;/h3&gt;
&lt;ol&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Explore the Models in Vertex AI:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Discover the full range of Google's first-party models as well as over &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;100 open-source models available&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; in the Model Garden &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Dive deeper into the documentation:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; For the most up-to-date technical details, thresholds, and code samples, the official &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/learn/overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Vertex AI documentation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; is your source of truth.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Review pricing details:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Get a detailed breakdown of token costs, Provisioned Throughput pricing, and the latest discounts for Batch and Flex APIs on the &lt;/span&gt;&lt;a href="https://cloud.google.com/vertex-ai/pricing?e=48754805&amp;amp;hl=en" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Vertex AI pricing page&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;&lt;/div&gt;</description><pubDate>Mon, 13 Apr 2026 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/ai-machine-learning/build-a-robust-and-cost-effective-gen-ai-strategy/</guid><category>Cost Management</category><category>AI &amp; Machine Learning</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>How to find the sweet spot between cost and performance</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/ai-machine-learning/build-a-robust-and-cost-effective-gen-ai-strategy/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Federico Vibrati</name><title>Technical Account Manager, Google Cloud</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Federico Preli</name><title>Data and AI Architect, Google Cloud</title><department></department><company></company></author></item><item><title>Simpler billing, clearer savings: A FinOps guide to updated spend-based CUDs</title><link>https://cloud.google.com/blog/topics/cost-management/a-finops-professionals-guide-to-updated-spend-based-cuds/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Optimizing cloud spend is one of the most rewarding aspects of FinOps — and committed use discounts (CUDs) remain one of the most effective levers to pull.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In July 2025, we began rolling out &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/docs/cuds-multiprice"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;updates to the spend-based CUD model&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to make it easier to understand your costs and savings, expand coverage to new SKUs (including Cloud Run and H3/M-series VMs), and offer increased flexibility. These changes are now available to all customers. Let’s dive into how this new model simplifies your FinOps practice.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;1. What is the spend-based CUD data change all about? &lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The most important shift is the move from a credit-based system to a &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;direct discounted price model using &lt;/strong&gt;&lt;a href="https://docs.cloud.google.com/docs/cuds-multiprice#consumption-model-intro"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;consumption models.&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Under the old &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;credits model&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;, you committed to an hourly on-demand amount. To find your &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;savings&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; (the actual cost reduction realized), you had to use three different numbers: the full on-demand cost, the commitment fee, and the offsetting credit.&lt;/span&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;1. &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;The old math:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li aria-level="2" style="list-style-type: lower-alpha; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;$10.00 (On-demand) + $5.50 (Commitment fee) - $10.00 (Credit) = $5.50 (Net Cost)&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="2" style="list-style-type: lower-alpha; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Savings = $10.00 (On-demand) - $5.50 (Net costs) = $4.50&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With the new &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/docs/cuds-multiprice#consumption-model-intro"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;direct discount model&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, you don’t need to do that math to calculate your net costs. You commit directly to the net, discounted spend amount. Your usage is simply billed at that discounted rate.&lt;/span&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;2. &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;The new math:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;/p&gt;
&lt;ol style="list-style-type: lower-alpha;"&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;$5.50 (Discounted costs)&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Savings = $10.00 (On-demand) - $5.50 (Discounted costs) = $4.50&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;  &lt;/strong&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can now see your net cost at a glance, and calculating the savings only requires comparing the on-demand price ($10.00) to your new discounted cost ($5.50), which equals &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;$4.50/hr.&lt;/strong&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;2. How do I validate my savings before and after the changes?  &lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The unified &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/billing/docs/how-to/analyze-cuds"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;CUD Analysis tool&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; is your best resource for auditing the migration or performing deep-dives on your spend. CUD Analysis for the new spend-based CUD model&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; allows you to quickly verify the savings you are getting with the new model, and you can use this tool to compare that the savings didn’t change between the old and the new model. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can validate your savings by following these steps:&lt;/span&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;1. Identify the date when the migration took place; you can see the migration date in the billing overview page.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_jzjRx1j.max-1000x1000.png"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;2. Go to CUD Analysis to validate the savings before and after the migration. &lt;/span&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;3. To quantify costs from before the migration:&lt;/span&gt;&lt;/p&gt;
&lt;ol style="list-style-type: lower-alpha;"&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Filter the view for one day before the migration, in this case &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Oct. 26, 2025.&lt;/strong&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;Select a CUD Product, for example &lt;strong style="vertical-align: baseline;"&gt;Cloud SQL CUD.&lt;/strong&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;In our example, &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;we&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;paid a $50.35 CUD fee to get a $69.12 credit. When you subtract that fee from the credit, your actual take-home &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;savings were $18.77&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_2jbhCzc.max-1000x1000.png"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;4. To validate costs after the migration&lt;/span&gt;&lt;/p&gt;
&lt;ol style="list-style-type: lower-alpha;"&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Change the date to &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Oct. 28, 2025&lt;/strong&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;Under the new model, you pay the discounted rates upfront. Your dashboard will reflect a Net Cost of $50.35, compared to the $69.12 on-demand cost, clearly showing your &lt;strong style="vertical-align: baseline;"&gt;$18.77 in savings.&lt;/strong&gt;&lt;/li&gt;
&lt;/ol&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/3_nQjMUwd.max-1000x1000.png"
        
          alt="3"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In addition, this release also includes &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/docs/cuds-verify-discounts#example_cost_reports"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;an update to &lt;/span&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Cost Reports&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to include “Savings Programs,” which accurately reflects your actual net savings ($18.77 in our example above), rather than gross credit. &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;When comparing pre- and post-migration data in Cost Reports, ensure you include both usage SKUs and commitment fee SKUs to capture the full scope of the commitment.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;3. What other capabilities are in the new CUD Analysis?&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Beyond support for the new model, the new &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/billing/docs/how-to/analyze-cuds"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;CUD Analysis tool&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; offers deeper visibility into your CUD coverage and CUD utilization. You can now analyze your CUDs with &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;hourly data granularity&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; for up to 30 days. This is a major improvement for FinOps teams, as daily averages often hide underutilization spikes that occur during specific hours.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/4_HLosdOT.max-1000x1000.png"
        
          alt="4"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="rirdr"&gt;CUD Analysis: Compute Flexible CUD coverage analysis&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/5_9A7ZjUx.max-1000x1000.png"
        
          alt="5"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="rirdr"&gt;CUD Analysis: Per CUD purchase utilization visibility&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;If you want to use your own data analysis tools, we offer a new &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/billing/docs/how-to/export-data-bigquery-tables/cud-export"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;spend-based CUD metadata export&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;that lets you manage your spend-based CUDs programmatically. You can use this export to join with the Billing BigQuery Export datasets to run in-depth, programmatic analysis on all your commitment data. You can also export &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/billing/docs/how-to/analyze-cuds#download_your_report"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;a CSV from the CUD Analysis view&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to see the raw data for every resource and its price without needing the full BigQuery export.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;4. How much commitment should I buy? &lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Our &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/docs/cuds-recommender"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;CUD recommendations&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; are the primary tool for determining how much of a commitment to purchase. We recently enhanced our Compute Flexible CUD commitment recommendations to provide greater accuracy by including data from GKE, Cloud Run, Cloud Run Functions, and Compute Engine. Additionally, CUD scenario modeling allows you to adjust these suggestions in real-time. You can adjust coverage thresholds, filter out specific dates with irregular usage, or extend the lookback analysis window up to 180 days to identify the exact commitment level that aligns with your specific risk profile.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/6_MpUcC4f.max-1000x1000.png"
        
          alt="6"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="rirdr"&gt;CUD scenario modeling: experiment with multiple options to identify your ideal CUD strategy&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;5. Is there anything else I should know about Flex CUDs? &lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With the release of the new spend-based model, we’ve addressed the &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;reporting limitation&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; affecting customers who use a combination of &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/compute/docs/instances/committed-use-discounts-overview#spend_based"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Flex CUDs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and GKE/Cloud Run CUDs. Previously, our analysis tools were unable to accurately identify the source of specific credits, leading to discrepancies in KPI metrics like savings, coverage, and utilization. Under the new spend-based CUD model, this limitation has been corrected, so your CUD analysis now provides an accurate, granular view of your savings per Google Cloud service.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To begin navigating the updated spend-based model, visit the Billing console. You can learn more in our documentation:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;a href="https://cloud.google.com/docs/cuds-multiprice"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Enhancements to the Spend-based CUD program &lt;/span&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;a href="https://cloud.google.com/docs/cuds-multiprice-datamodel"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Insights into the multi-price data model&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;a href="https://docs.cloud.google.com/docs/cuds-verify-discounts"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Verify your savings post-migration&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-related_article_tout"&gt;





&lt;div class="uni-related-article-tout h-c-page"&gt;
  &lt;section class="h-c-grid"&gt;
    &lt;a href="https://cloud.google.com/blog/products/compute/expanded-coverage-for-compute-flex-cuds/"
       data-analytics='{
                       "event": "page interaction",
                       "category": "article lead",
                       "action": "related article - inline",
                       "label": "article: {slug}"
                     }'
       class="uni-related-article-tout__wrapper h-c-grid__col h-c-grid__col--8 h-c-grid__col-m--6 h-c-grid__col-l--6
        h-c-grid__col--offset-2 h-c-grid__col-m--offset-3 h-c-grid__col-l--offset-3 uni-click-tracker"&gt;
      &lt;div class="uni-related-article-tout__inner-wrapper"&gt;
        &lt;p class="uni-related-article-tout__eyebrow h-c-eyebrow"&gt;Related Article&lt;/p&gt;

        &lt;div class="uni-related-article-tout__content-wrapper"&gt;
          &lt;div class="uni-related-article-tout__image-wrapper"&gt;
            &lt;div class="uni-related-article-tout__image" style="background-image: url('')"&gt;&lt;/div&gt;
          &lt;/div&gt;
          &lt;div class="uni-related-article-tout__content"&gt;
            &lt;h4 class="uni-related-article-tout__header h-has-bottom-margin"&gt;Save more with expanded coverage for Compute Flex CUDs&lt;/h4&gt;
            &lt;p class="uni-related-article-tout__body"&gt;Compute Flexible Committed Use Discounts (Flex CUDs) now cover memory-optimized and HPC VM families and Cloud Run.&lt;/p&gt;
            &lt;div class="cta module-cta h-c-copy  uni-related-article-tout__cta muted"&gt;
              &lt;span class="nowrap"&gt;Read Article
                &lt;svg class="icon h-c-icon" role="presentation"&gt;
                  &lt;use xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="#mi-arrow-forward"&gt;&lt;/use&gt;
                &lt;/svg&gt;
              &lt;/span&gt;
            &lt;/div&gt;
          &lt;/div&gt;
        &lt;/div&gt;
      &lt;/div&gt;
    &lt;/a&gt;
  &lt;/section&gt;
&lt;/div&gt;

&lt;/div&gt;</description><pubDate>Thu, 12 Feb 2026 17:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/a-finops-professionals-guide-to-updated-spend-based-cuds/</guid><category>Compute</category><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Simpler billing, clearer savings: A FinOps guide to updated spend-based CUDs</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/a-finops-professionals-guide-to-updated-spend-based-cuds/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Alfonso Hernandez</name><title>Sr. Product Manager</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Rahul Sharma</name><title>Sr. Product Manager</title><department></department><company></company></author></item><item><title>Automating FinOps cost management policies using Workload Manager</title><link>https://cloud.google.com/blog/topics/cost-management/automate-financial-governance-policies-using-workload-manager/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Do you find yourself battling surprise cloud bills? Do you spend more time tracking down un-tagged resources and chasing development teams than you do on strategic financial planning? In the fast-paced world of cloud, manual cost management is a losing game. It’s time-consuming, prone to errors, and often, by the time you’ve identified a cost anomaly, it's too late to prevent the impact. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;What if you could codify your financial governance policies and automate their enforcement across your entire Google Cloud organization? Enter Workload Manager (WLM), a powerful tool that lets you automate the validation of your cloud workloads against best practices for security and compliance, including your own custom-defined FinOps rules. Better yet, we recently slashed the cost of using Workload Manager by up to 95% for certain scenarios, letting you run large-scale scans more economically, including a small free tier to help you run small-scale tests. In this blog, we show you how to get started with automated financial governance policies in Workload Manager, so you can stop playing catch-up and start proactively managing your cloud spend.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;The challenge with manual FinOps&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Managing business-critical workloads in the cloud is complex. Staying on top of cost-control best practices is a significant and time-consuming effort. Manual reviews and audits can take weeks or even months to complete, by which time costs can spiral. This manual approach often leads to "configuration drift," where systems deviate from your established cost management policies, making it difficult to detect and control spending.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Workload Manager helps you break free from these manual constraints by providing a framework for automated, continuous validation, helping FinOps teams to:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Improve standardization:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Decouple team dependencies and drive consistent application of cost-control policies across the organization.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Enable ownership:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Empower individual teams to build and manage their own detection rules for specific use cases, fostering a culture of financial accountability.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Simplify auditing:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Easily run infrastructure checks across your entire organization and consolidate the findings into a single BigQuery dataset for streamlined reporting and analysis.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;By codifying your FinOps policies, you can define them once and run continuous scans to detect violations across your entire cloud environment on a regular schedule.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Workload Manager makes this easy, providing you with out-of-the-box rules across Security, Cost, Reliability etc. Here are some examples of FinOps cost management policies that can be automated with Workload Manager:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Must have required label or tag for a specific google cloud resource (eg: BigQuery dataset)&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Enforce lifecycle management or autoclass configuration for every cloud storage bucket&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Ensure appropriate data retention is set for storage (eg: BigQuery tables)&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Disable simultaneous multi-threading to optimize licensing costs (eg: SQL Server)&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Figure_-_1.max-1000x1000.png"
        
          alt="Figure - 1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="u9ctq"&gt;Figure - 1: Default Workload Manager policies as per Google Cloud best practices&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Don't find what you need? You can always build your own custom policies using examples in our Git repo.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Let’s take a closer look. &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Automating FinOps policies: A step-by-step guide&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Here’s how you can use Workload Manager to automate your cost management policies.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Step 1: Define your FinOps rules and create a new evaluation&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;First, you need to translate your cost management policies into a format that the Workload Manager can understand. The tool uses Open Policy Agent (OPA) Rego for defining custom rules. In this blog we will take a primary use case for FinOps — that is, to ensure resources are properly labeled for cost allocation and showback.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can choose from hundreds of &lt;/span&gt;&lt;a href="https://cloud.google.com/workload-manager/docs/reference/best-practices-general"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;predefined rules&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; authored by Google Cloud experts that cover FinOps, reliability, security, and operations according to the Google Cloud best practices or create and customize your own rules (checkout examples from the &lt;/span&gt;&lt;a href="https://github.com/GoogleCloudPlatform/workload-manager/tree/main/rules" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud GitHub repository&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;). In our example we will use one of the predefined ‘Google Cloud Best Practices’ rules for bigquery-missing-labels on a dataset. In this case, navigate to the Workload Manager section in your Google Cloud Console and start by creating a new evaluation.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Give your evaluation a name and select "Custom" as the workload type. This is where you can point Workload Manager to the Cloud Storage bucket that contains your custom FinOps rules if you’ve built one. The experience allows you to run both pre-defined and custom rule checks in one evaluation.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Figure_-_2.max-1000x1000.png"
        
          alt="Figure - 2"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="u9ctq"&gt;Figure 2 - Creating new evaluation rule&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Step 2: Define the scope of your scan&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Next, define the scope of your evaluation. You have the flexibility to scan your entire Google Cloud organization, specific folders, or individual projects. This allows you to apply broad cost-governance policies organization-wide, or create more targeted rules for specific teams or environments. You can also apply filters based on resource labels or names for more granular control. In this example, region selection lets you select where you want to process your data to meet data residency requirements.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Figure_-_3.max-1000x1000.png"
        
          alt="Figure - 3"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="u9ctq"&gt;Figure 3 - Selecting scope and location for your evaluation rule&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Step 3: Schedule and notify&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With FinOps, automation is key. You can schedule your evaluation to run at a specific cadence, from hourly to monthly. This helps ensure continuous monitoring and provides a historical record of your policy compliance. Optionally, but highly recommended for FinOps, you can configure the evaluation to save all results to a BigQuery dataset for historical analysis and reporting. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can also set up notifications to alert the right teams when an issue is found. Channels include email, Slack, PagerDuty, and more, so that policy violations can be addressed promptly.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/figure_-_4.max-1000x1000.png"
        
          alt="figure - 4"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="u9ctq"&gt;Figure 4 - Export, schedule and notify evaluation rules&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Step 4: Run, review, and report&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Once saved, the evaluation will run on your defined schedule, or you can trigger it on-demand. The results of each scan are stored, providing a historical view of your compliance posture&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;From the Workload Manager dashboard, you can see a summary of scanned resources, issues found, and trends over time. For deeper analysis, you can explore the violation data directly in the BigQuery dataset you configured earlier.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/figure_-_5.max-1000x1000.png"
        
          alt="figure - 5"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="u9ctq"&gt;Figure - 5: Checkout evaluations for workload manager&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Visualize findings with Looker Studio&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To make the data accessible and actionable for all stakeholders, you can easily connect your BigQuery results to Looker Studio. Create interactive dashboards that visualize your FinOps policy violations, such as assets missing required labels or resources that don't comply with cost-saving rules. This provides a clear, at-a-glance view of your cost governance status.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can find Looker Studio template in template gallery and easily connect it with your datasets and modify as needed. Here is how you can use it:&lt;/span&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Go to Looker studio. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Navigate to Templates and under Bigquery, select &lt;/span&gt;&lt;a href="https://lookerstudio.google.com/c/reporting/e146051d-f7fd-406c-a62c-290fa2fee749/preview/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Workload Manager&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Click on “Use your own Data” that asks for connecting the Bigquery table generated in previous steps. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;After you have connected the Bigquery dataset,  lick on Edit to create a customizable copy to incorporate any changes or share it with your team. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/figure_6_rqgAwFk.max-1000x1000.png"
        
          alt="figure 6"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="u9ctq"&gt;Figure - 6: Set up preconfigured Looker Studio dashboard for reporting&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Take control of your cloud costs today&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Stop the endless cycle of manual cloud cost management. With Workload Manager, you can embed your FinOps policies directly into your cloud environment, automate enforcement, and provide teams with the feedback they need to stay on budget. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Ready to get started? Explore the &lt;/span&gt;&lt;a href="https://github.com/GoogleCloudPlatform/workload-manager" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;sample policies on GitHub&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and check out the &lt;/span&gt;&lt;a href="https://cloud.google.com/workload-manager/docs/evaluate/custom-rules/about-custom-rules"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;official documentation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to begin automating your FinOps framework today, and take advantage of Workload Manager’s new pricing.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Check out a quick overview video on how Workload Manager Evaluations helps you do a lot more across Security, Reliability and FinOps.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-video"&gt;



&lt;div class="article-module article-video "&gt;
  &lt;figure&gt;
    &lt;a class="h-c-video h-c-video--marquee"
      href="https://youtube.com/watch?v=sWwvdkLyA6A"
      data-glue-modal-trigger="uni-modal-sWwvdkLyA6A-"
      data-glue-modal-disabled-on-mobile="true"&gt;

      
        

        &lt;div class="article-video__aspect-image"
          style="background-image: url(https://storage.googleapis.com/gweb-cloudblog-publish/images/maxresdefault_73jse8f.max-1000x1000.jpg);"&gt;
          &lt;span class="h-u-visually-hidden"&gt;Google Cloud Configuration Management with Workload Manager&lt;/span&gt;
        &lt;/div&gt;
      
      &lt;svg role="img" class="h-c-video__play h-c-icon h-c-icon--color-white"&gt;
        &lt;use xlink:href="#mi-youtube-icon"&gt;&lt;/use&gt;
      &lt;/svg&gt;
    &lt;/a&gt;

    
  &lt;/figure&gt;
&lt;/div&gt;

&lt;div class="h-c-modal--video"
     data-glue-modal="uni-modal-sWwvdkLyA6A-"
     data-glue-modal-close-label="Close Dialog"&gt;
   &lt;a class="glue-yt-video"
      data-glue-yt-video-autoplay="true"
      data-glue-yt-video-height="99%"
      data-glue-yt-video-vid="sWwvdkLyA6A"
      data-glue-yt-video-width="100%"
      href="https://youtube.com/watch?v=sWwvdkLyA6A"
      ng-cloak&gt;
   &lt;/a&gt;
&lt;/div&gt;

&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Then, review the updated &lt;/span&gt;&lt;a href="https://cloud.google.com/workload-manager/pricing"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;pricing&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to learn more.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Tue, 04 Nov 2025 17:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/automate-financial-governance-policies-using-workload-manager/</guid><category>Management Tools</category><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Automating FinOps cost management policies using Workload Manager</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/automate-financial-governance-policies-using-workload-manager/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Pathik Sharma</name><title>Cloud FinOps Lead, delta, Google Cloud Consulting</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Omkar Suram</name><title>Product Manager</title><department></department><company></company></author></item><item><title>Announcing the General Availability of Smarter, AI-powered Cost Anomaly Detection</title><link>https://cloud.google.com/blog/topics/cost-management/announcing-ga-of-cost-anomaly-detection/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Last year, we announced the public preview of &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/cost-management/introducing-cost-anomaly-detection?e=48754805"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Cost Anomaly Detection&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, an AI-powered product designed to eliminate one of the biggest anxieties of using the Cloud: unexpected costs. The goal was to provide a safety net that automatically identifies unusual spikes in spending, helping you catch issues before they become financial problems.&lt;/span&gt;&lt;/p&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Today, we are excited to announce that &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Cost Anomaly Detection is now generally available (GA)&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;, and it is more proactive, intelligent, and flexible. Best of all, anomaly alerts are now &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;on by default&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; for every customer across all projects, including the new ones, offering complete protection from day one.&lt;/span&gt;&lt;/p&gt;
&lt;h3 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;What’s new in general availability?&lt;/strong&gt;&lt;/h3&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;For the GA release, we focused on making the service smarter, more automatic, more proactive, and more customizable to suit your specific needs. Here’s what’s new:&lt;/span&gt;&lt;/p&gt;
&lt;h4 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;1. Auto-alerts by default&lt;/strong&gt;&lt;/h4&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Insights into any deviations in your cloud costs should be the default. Protection from cost overruns should be constant and not require any configuration from your end. That's why we’ve &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;automatically enabled anomaly alerts for all customers &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;on all their projects. Default alerts will be sent to Billing Administrators; you can, of course, easily visit the billing console to manage and customize your alert preferences at any time. The alerts will take you to the Anomaly dashboard on the billing console, where you can easily see all the details related to the cost spike including the root causes.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_OGkA4fI.max-1000x1000.png"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="jhzb7"&gt;Anomaly Dashboard with Root Cause Analysis&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_yz7YJIQ.max-1000x1000.png"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="jhzb7"&gt;Default alert configuration&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h4 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;2. Intelligent, AI-generated thresholds&lt;/strong&gt;&lt;/h4&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Will auto-alerts mean more noise and email spam? No. Our improved algorithm now provides &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;automated, AI-generated anomaly thresholds&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; based on your historical spending patterns. This intelligent baseline ensures you are only alerted to spikes that seem significant and unexpected, relative to your spend behavior. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/3_IsnAYqV.max-1000x1000.png"
        
          alt="3"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="jhzb7"&gt;Default threshold configuration&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;And while the AI-generated thresholds work out of the box, you still have the flexibility to override them with your own custom values, if needed. Customers who have already configured their own custom values but would like to leverage our AI-generated thresholds, can easily do so from the billing console at any time. &lt;/span&gt;&lt;/p&gt;
&lt;h4 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;3. More flexible filtering with percentage deviation&lt;/strong&gt;&lt;/h4&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We heard your feedback that every project has a different sensitivity to cost spikes. A $100 deviation might be critical for a small project but expected noise for a large one. To address this, we’ve introduced an &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;additional threshold for percentage deviation&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; that filters your anomaly dashboard and alerts not only on an absolute dollar value but also on a percentage change. This allows your alerts to stay relevant to your budget and scale. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/4_3mzgbJr.max-1000x1000.png"
        
          alt="4"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="jhzb7"&gt;Custom threshold configuration&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Don't worry — all anomalies are still captured and can be viewed at any time by simply removing the filters from your dashboard.&lt;/span&gt;&lt;/p&gt;
&lt;h4 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;4.&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Immediate protection from day one&lt;/strong&gt;&lt;/h4&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;During the public preview, we offered anomaly detection only on projects that were at least 6 months old due to lack of significant spend history. However, our improved algorithm now solves this "cold start" problem, making it possible to alert on anomalies even for &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;new accounts and projects with no prior spend history. &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;This helps ensure that you are protected on Google Cloud, from the get go. &lt;/span&gt;&lt;/p&gt;
&lt;h3 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Get started today&lt;/strong&gt;&lt;/h3&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Cost Anomaly Detection is a core part of our FinOps capabilities that provides you with complete and predictable control over your cloud costs. When layered with Cloud Budgets, it creates a robust cost control strategy that works to prevent, detect, and contain runaway spend. And it remains free, offered as part of our comprehensive set of cost management tools.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Head over to your &lt;/span&gt;&lt;a href="https://pantheon.corp.google.com/billing" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;billing console&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to access this product and refer to our &lt;/span&gt;&lt;a href="https://cloud.google.com/billing/docs/how-to/manage-anomalies"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;documentation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for more details.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Mon, 03 Nov 2025 17:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/announcing-ga-of-cost-anomaly-detection/</guid><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Announcing the General Availability of Smarter, AI-powered Cost Anomaly Detection</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/announcing-ga-of-cost-anomaly-detection/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Shruthi Nambi</name><title>Product Manager</title><department></department><company></company></author></item><item><title>Three-part framework to measure the impact of your AI use case</title><link>https://cloud.google.com/blog/topics/cost-management/measure-the-value-and-impact-of-your-ai/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Generative AI is no longer just an experiment. The real challenge now is quantifying its value. For leaders, the path is clear: make AI projects drive business growth, not just incur costs. Today, we'll share a simple three-part plan to help you measure the effect and see the true worth of your AI initiatives.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This methodology connects your technology solution to a concrete business outcome. It creates a logical narrative that justifies investment and measures success.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;1. Define what success looks like (the value)&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The first step is to define the project's desired outcome by identifying its "value drivers." For any AI initiative, these drivers typically fall into four universal business categories:&lt;/span&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Operational efficiency &amp;amp; cost savings:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; This involves quantifying improvements to core business processes. Value is measured by reducing manual effort, optimizing resource allocation, lowering error rates in production or operations, or streamlining complex supply chains.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Revenue &amp;amp; growth acceleration:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; While many organizations initially focus on efficiency, true market leadership is achieved through growth. This category of value drivers is the critical differentiator, as it focuses on top-line impact. Value can come from accelerating time-to-market for new products, identifying new revenue streams through data analysis, or improving sales effectiveness and customer lifetime value.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Experience &amp;amp; engagement:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; This captures the enhancement of human interaction with technology. It applies broadly to improving customer satisfaction (CX), boosting employee productivity and morale with intelligent tools (EX), or creating more seamless partner experiences.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Strategic advancement &amp;amp; risk mitigation:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; This covers long-term competitive advantages and downside protection. Value drivers include accelerating R&amp;amp;D cycles, gaining market-differentiating insights from proprietary data, strengthening operational resiliency, or ensuring regulatory compliance and reducing fraud.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Try Google Cloud for free&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2068fd9cd0&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;2. Specify what it costs to succeed (your investment)&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The second part of the framework demands transparency regarding the investment. This requires a complete view of the Total Cost of Ownership (TCO), which extends beyond service fees to include model training, infrastructure, and the operational support needed to maintain the system. For a detailed guide, we encourage a review of our post, &lt;a href="https://cloud.google.com/blog/topics/cost-management/unlock-the-true-cost-of-enterprise-ai-on-google-cloud"&gt;How to calculate your AI costs on Google Cloud&lt;/a&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;3. State the ROI &lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This is the synthesis of the first two steps. The ROI calculation makes the business case explicit by stating the time required to pay back the initial investment and the ongoing financial return the project will generate.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;The framework in action: An AI chatbot for customer service&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Now, let's apply the universal framework to a specific use case. Consider an e-commerce company implementing an AI chatbot. Here, the four general value drivers become tailored to the world of customer service.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Step 1: Define success (the value)&lt;br/&gt;&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;The team uses the customer-service-specific quadrants to build a comprehensive value estimate.&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;&lt;span style="vertical-align: baseline;"&gt;Quadrant 1: Operational efficiency&lt;/span&gt;&lt;/strong&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Reduced agent handling time: By automating 60% of routine inquiries, the company frees up thousands of agent hours. This enables agents to serve more customers or perhaps provide better quality service to premium customers. &lt;/span&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Estimated hours saved: ~725 hrs (lets say this equate to $15,660 in value)&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Lower onboarding &amp;amp; training costs: New agents become productive faster as the AI handles the most common questions, reducing the burden of repetitive training.&lt;/span&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Estimated monthly value: $1,000&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong&gt;&lt;span style="vertical-align: baseline;"&gt;Quadrant 2: Revenue growth&lt;/span&gt;&lt;/strong&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;24/7 Sales &amp;amp; support: The chatbot assists customers and captures sales leads around the clock, converting shoppers who would otherwise leave.&lt;/span&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Estimated mMonthly vValue: $5,000&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Improved customer retention: Faster resolution and a better experience lead to a small, measurable increase in customer loyalty and repeat purchases.&lt;/span&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Estimated monthly value: $1,000&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong&gt;&lt;span style="vertical-align: baseline;"&gt;Quadrant 3: Customer and employee experience&lt;/span&gt;&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="vertical-align: baseline;"&gt;Enhanced agent experience &amp;amp; retention: Human agents are freed from monotonous tasks to focus on complex, rewarding problems. This improves morale and reduces costly agent turnover.&lt;/span&gt;&lt;/span&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Estimated monthly value: $500&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong&gt;&lt;span style="vertical-align: baseline;"&gt;Quadrant 4: Strategic enablement&lt;/span&gt;&lt;/strong&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="vertical-align: baseline;"&gt;Expanding business to more languages: Enabling human agents to provide support in 15+ additional languages, thanks to the translation service built into the system.&lt;/span&gt;&lt;/span&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Estimated revenue increase: $1,750&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;&lt;span style="vertical-align: baseline;"&gt;Total &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;estimated monthly value&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; = $15,660 + $1,000 + $5,000 + $1,000 + $500 + $1,750 = $24,910&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Step 2: Define the cost (the investment)&lt;br/&gt;&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Following a TCO analysis from our earlier &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/cost-management/unlock-the-true-cost-of-enterprise-ai-on-google-cloud?e=48754805"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;blog post&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, we calculated the total ongoing monthly cost for the fully managed AI solution on Google Cloud would be approximately $2,700.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Step 3: State the ROI &lt;br/&gt;&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;The final story was simple and powerful. With a monthly value of around $25,000 and a cost of only $2,700, the project generated significant positive cash flow. The initial setup cost was paid back in less than two weeks, securing an instant "yes" from leadership.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Get started&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Contact us to consult with an expert &lt;/span&gt;&lt;a href="https://cloud.google.com/consulting/portfolio/value-realization-for-ai"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-related_article_tout"&gt;





&lt;div class="uni-related-article-tout h-c-page"&gt;
  &lt;section class="h-c-grid"&gt;
    &lt;a href="https://cloud.google.com/blog/topics/cost-management/unlock-the-true-cost-of-enterprise-ai-on-google-cloud/"
       data-analytics='{
                       "event": "page interaction",
                       "category": "article lead",
                       "action": "related article - inline",
                       "label": "article: {slug}"
                     }'
       class="uni-related-article-tout__wrapper h-c-grid__col h-c-grid__col--8 h-c-grid__col-m--6 h-c-grid__col-l--6
        h-c-grid__col--offset-2 h-c-grid__col-m--offset-3 h-c-grid__col-l--offset-3 uni-click-tracker"&gt;
      &lt;div class="uni-related-article-tout__inner-wrapper"&gt;
        &lt;p class="uni-related-article-tout__eyebrow h-c-eyebrow"&gt;Related Article&lt;/p&gt;

        &lt;div class="uni-related-article-tout__content-wrapper"&gt;
          &lt;div class="uni-related-article-tout__image-wrapper"&gt;
            &lt;div class="uni-related-article-tout__image" style="background-image: url('')"&gt;&lt;/div&gt;
          &lt;/div&gt;
          &lt;div class="uni-related-article-tout__content"&gt;
            &lt;h4 class="uni-related-article-tout__header h-has-bottom-margin"&gt;How to calculate your AI costs on Google Cloud&lt;/h4&gt;
            &lt;p class="uni-related-article-tout__body"&gt;Learn a comprehensive approach to manage expenses and maximize value from your AI investments on Google Cloud.&lt;/p&gt;
            &lt;div class="cta module-cta h-c-copy  uni-related-article-tout__cta muted"&gt;
              &lt;span class="nowrap"&gt;Read Article
                &lt;svg class="icon h-c-icon" role="presentation"&gt;
                  &lt;use xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="#mi-arrow-forward"&gt;&lt;/use&gt;
                &lt;/svg&gt;
              &lt;/span&gt;
            &lt;/div&gt;
          &lt;/div&gt;
        &lt;/div&gt;
      &lt;/div&gt;
    &lt;/a&gt;
  &lt;/section&gt;
&lt;/div&gt;

&lt;/div&gt;</description><pubDate>Thu, 11 Sep 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/measure-the-value-and-impact-of-your-ai/</guid><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Three-part framework to measure the impact of your AI use case</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/measure-the-value-and-impact-of-your-ai/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Eva Dong</name><title>AI Value Realization Lead, delta, Google Cloud Consulting</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Pathik Sharma</name><title>Cloud FinOps Lead, delta, Google Cloud Consulting</title><department></department><company></company></author></item><item><title>Introducing no-cost, multicloud Data Transfer Essentials for EU and U.K. customers</title><link>https://cloud.google.com/blog/products/networking/new-for-the-uk-and-eu-no-cost-multicloud-data-transfer-essentials/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At Google Cloud, our services are built with interoperability and openness in mind to enable customer choice and multicloud strategies. W&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;e pioneered a&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/data-analytics/introducing-bigquery-omni"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;multicloud data warehouse&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, enabling workloads to run across clouds. We were the first company to provide &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/identity-security/google-advances-sovereignty-choice-and-security-in-the-cloud?e=48754805"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;digital sovereignty solutions&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for European governments and to&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/networking/eliminating-data-transfer-fees-when-migrating-off-google-cloud"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;waive exit fees&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;for customers who stop using Google Cloud.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We continue this open approach with the launch today of our new &lt;/span&gt;&lt;a href="https://cloud.google.com/data-transfer-essentials/docs/overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Data Transfer Essentials&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; service for customers in the European Union and the United Kingdom. Built in response to the principles of cloud interoperability and choice outlined in the &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/identity-security/navigating-the-eu-ai-act-google-clouds-proactive-approach"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;EU Data Act&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, Data Transfer Essentials is a new, simple solution for data transfers between Google Cloud and other cloud service providers. Although the Act allows cloud providers to pass through costs to customers, Data Transfer Essentials is available today at no cost to customers. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Designed for “in-parallel” processing of workloads belonging to the same organization that are distributed across two or more cloud providers, Data Transfer Essentials enables you to build flexible, multicloud strategies and use the best-of-breed solutions across different cloud providers. This can foster greater digital operational resilience – without incurring outbound data transfer costs from Google Cloud.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To get started, please read our &lt;/span&gt;&lt;a href="https://cloud.google.com/data-transfer-essentials/docs/create-resources"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;configuration guide&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to learn how to opt in and specify your multicloud traffic. Qualifying multicloud traffic will be metered separately, and will appear on your bill at a zero charge, while all other traffic will continue to be billed at existing &lt;/span&gt;&lt;a href="https://cloud.google.com/network-tiers/docs"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Network Service Tier&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; rates.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The original promise of the cloud is one that is open, elastic, and free from artificial lock-ins. Google Cloud continues to embrace this openness and the ability for customers to choose the cloud service provider that works best for their workload needs. Read more about Data Transfer Essentials &lt;/span&gt;&lt;a href="https://cloud.google.com/data-transfer-essentials/docs"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Wed, 10 Sep 2025 05:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/networking/new-for-the-uk-and-eu-no-cost-multicloud-data-transfer-essentials/</guid><category>Cost Management</category><category>Security &amp; Identity</category><category>Networking</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Introducing no-cost, multicloud Data Transfer Essentials for EU and U.K. customers</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/networking/new-for-the-uk-and-eu-no-cost-multicloud-data-transfer-essentials/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Jeanette Manfra </name><title>VP, Head of Risk and Compliance, Google Cloud</title><department></department><company></company></author></item><item><title>Save more with expanded coverage for Compute Flex CUDs</title><link>https://cloud.google.com/blog/products/compute/expanded-coverage-for-compute-flex-cuds/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We’re excited to announce an expansion to our &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/instances/committed-use-discounts-overview#spend_based"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Compute Flexible Committed Use Discounts (Flex CUDs)&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, providing you with greater flexibility across your cloud environment. Your spend commitments now stretch further and cover a wider array of Google Cloud services and VM families, translating into greater savings for your workloads.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Flex CUDs are spend-based commitments that provide deep discounts on Google Cloud compute resources in exchange for a one or three-year term. This model offers maximum flexibility, automatically applying savings across a broad pool of eligible VM families and regions without being tied to a single resource.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;More power, more savings with expanded coverage&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We understand that modern applications are built on a diverse mix of services, from massive databases to nimble serverless functions. To better support the way you build, we’re expanding Flex CUDs to cover more of the specialized solutions and serverless solutions you use every day:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Memory-optimized VM Families:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; We’re bringing enhanced discounts to our memory-optimized M1, M2, M3 and the new M4 VM families. Now you can get more value from critical workloads like SAP HANA, in-memory analytics platforms and high-performing databases. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;High-performance computing (HPC) VM families:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; For compute-intensive workloads, Flex CUDs now apply to our HPC-optimized H3 and the new &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/compute/new-h4d-vms-optimized-for-hpc?e=48754805"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;H4D&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; VM families, perfect for complex simulations and scientific research.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Cloud Run and Cloud Functions:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; For developers and organizations that use Cloud Run's fully managed platform, we are extending Flex CUDs’ coverage to Cloud Run request-based billing and Cloud Run functions.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;$300 in free credit to try Google Cloud infrastructure&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2056683b50&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Why this matters&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This expansion of Compute Flex CUDs is designed with your growth and efficiency in mind:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Maximize your spend commitments:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Instead of being tied to a specific resource type or region, your committed spend can now be applied across a larger portion of your Google Cloud usage. This means less "wasted" commitment and more active savings.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Enhanced financial predictability and control:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; With greater coverage, you gain a clearer picture of your anticipated cloud spend, making budgeting and financial planning more predictable. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Simplified cost management:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; A single, flexible commitment can now cover a more diverse set of services, streamlining your financial operations and reducing the complexity of managing multiple, granular commitments.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Fuel innovation:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; By reducing the cost of core compute and serverless services, you free up budget that can be reinvested into innovation.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;An updated Billing model&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Compute Flex CUDs’ expanded coverage is made possible by the new and improved spend-based CUDs model, which streamlines how discounts are applied and provides greater flexibility. Enabling this feature triggers&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; some &lt;/span&gt;&lt;a href="https://cloud.google.com/docs/cuds-multiprice#billing-ui"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;experience changes&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to the Billing user interface, Cloud Billing export to BigQuery schema, and Cloud Commerce Consumer Procurement API. This new billing model is simpler: we directly charge the discounted rate for CUD-eligible usage, reflecting the applicable discount, instead of using credits to offset usage and reflect savings. It’s also more flexible: we apply discounts to a wider range of products within spend-based CUDs. For more, this&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;a href="https://cloud.google.com/docs/cuds-multiprice-datamodel"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;follow-up resource&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;details the updates, including information on a sample export to preview your monthly bill in the new format, key CUD KPIs, new SKUs added to CUDs, and CUD product information.&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; You can learn more about these changes in the &lt;/span&gt;&lt;a href="https://cloud.google.com/billing/docs/resources/multiprice-cuds"&gt;&lt;span style="vertical-align: baseline;"&gt;documentation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Availability and next steps&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At Google Cloud, we’re committed to providing you with the most flexible and cost-effective solutions for your evolving cloud needs. This expansion of Compute Flex CUDs is a testament to that commitment, enabling you to build, deploy, and scale your applications with even greater financial efficiency. Starting today, you can &lt;/span&gt;&lt;a href="https://cloud.google.com/docs/cuds-multiprice#how-to-opt-in"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;opt-in&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and begin enjoying Compute Flex CUDs’ expanded scope and improved billing model. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Starting January 21, 2026, all customers will be automatically transitioned to the new spend-based model to take advantage of these expanded Flex CUDs. If you don’t opt in to multi-price CUDs, these changes will be automatically applied on January 21, 2026. New customers who create a Billing Account on or after July 15, 2025 will automatically be under the new billing model for Flex CUDs. Stay tuned for more updates as we continue to enhance our offerings to support your success on Google Cloud.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Fri, 05 Sep 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/compute/expanded-coverage-for-compute-flex-cuds/</guid><category>Cost Management</category><category>Compute</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Save more with expanded coverage for Compute Flex CUDs</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/compute/expanded-coverage-for-compute-flex-cuds/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Yasmin Mowafy</name><title>Sr. Product Manager</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Rahul Sharma</name><title>Sr. Product Manager</title><department></department><company></company></author></item><item><title>Google is a Leader in the 2025 IDC MarketScape: FinOps Cloud Costs Optimization</title><link>https://cloud.google.com/blog/topics/cost-management/google-leader-in-idc-marketscape-finops-cloud-costs/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Our customers come first, and we’ve focused on building FinOps tools that help them understand their cloud spend, optimize for efficiency, and prevent cost surprises. We’re excited to be recognized for this work, and named a leader in the 2025 IDC MarketScape for FinOps cloud cost optimization. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--medium
      
      
        h-c-grid__col
        
        h-c-grid__col--4 h-c-grid__col--offset-4
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1-IDC_MarketScape.max-1000x1000.png"
        
          alt="1-IDC MarketScape"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;"This study evaluated the five global hyperscalers and their FinOps cloud cost optimization capabilities, assessing several dimensions within strategy and product capabilities. A strength of Google Cloud FinOps is its integration with Gemini, helping customers mature, automate, and accelerate cost optimization. This is in addition to the thought leadership Google has been demonstrating with its product strategy and driving industry support for open standards." - Jevin Jensen, IDC Research Vice President and triple-certified FinOps practitioner and engineer&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;Here are the &lt;strong&gt;top 10 of our top FinOps innovations that are helping Google Cloud customers&lt;/strong&gt;:&lt;/p&gt;
&lt;ol&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;We stream net-cost data in real time, so your information stays current with actual cloud costs. 99% of that data arrives within 24 hours, and many services update several times a day.  &lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;We provide granular, sub-resource cost data out-of-the-box – without additional hoops to jump through, like agent installs – which means you can understand your cost drivers faster. For example, for more than two years, we have broken up Kubernetes costs into clusters, namespaces, and pods.  &lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;The FinOps Hub centralizes all cost optimization activities in one place, highlighting inefficiencies so business professionals can collaborate with development teams to drive meaningful change.&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/2-FinOps_Hub.gif"
        
          alt="2-FinOps Hub"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="b829n"&gt;FinOps Hub in action&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;4. We integrated generative AI into FinOps workflows early, creating specialized business use cases for &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;all&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; users. This saves time when finding cost insights and optimization opportunities, with grounded answers to ensure accuracy and relevance.  &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/3-GCA_for_FinOps.gif"
        
          alt="3-GCA for FinOps"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="b829n"&gt;Gemini Cloud Assist for FinOps in action.&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;5. We focus on the FinOps user, and the rest follows. Over the years, we have built up an amazing group of FinOps practitioners we work closely with to evolve our FinOps products. We also have a FinOps executive advisory board that allows us to look forward and understand where the industry is evolving. &lt;/span&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;6. We believe we can make billing enjoyable. The microinteractions, zero states, guided tours, and elegant material design, all work together to create experiences that feel intuitive and Googley.&lt;/span&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;7. We provide customers a FinOps score to help you make data-informed decisions when building business cases for committed use discounts or identifying spend that needs better organization through tagging or budget coverage. Using this score you can see how you benchmark against peers.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--medium
      
      
        h-c-grid__col
        
        h-c-grid__col--4 h-c-grid__col--offset-4
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/4-FinOps_Score.max-1000x1000.png"
        
          alt="4-FinOps Score"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="f01au"&gt;Google Cloud customers get their own FinOps score and can see how they compare with their peers.&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;8. We have fast cost-anomaly detection that runs hourly, with high precision. And we also offer root cause analysis information for our users to take action quickly. &lt;/span&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;9. We provide real-time scenario modelling for rate optimizations, managing terabytes of data in our UI quickly and easily. Customer controls let you shape and model the data as needed.  &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/5-FinOps_Hub_Scenario_Modeling.gif"
        
          alt="5-FinOps Hub Scenario Modeling"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="f01au"&gt;FinOps Hub scenario modelling in action.&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;10. We provide these FinOps tools at no additional charge to Google Cloud customers. We don’t charge extra for extended data lookback windows, UI views and analysis, or FinOps Hub cost optimizations. This helps customers spend less time on understanding their bills and more time driving business innovation. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/resources/content/idc-marketscape-worldwide-finops-cloud-costs-optimization-hyperscalers-2025-vendor-assessment?e=48754805&amp;amp;hl=en"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Read the full IDC MarketScape&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; excerpt to learn more about our capabilities. &lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;p&gt;&lt;sup&gt;&lt;em&gt;&lt;span style="vertical-align: baseline;"&gt;Source:&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;“IDC MarketScape: Worldwide FinOps Cloud Costs Optimization Hyperscalers 2025 Vendor Assessment” by Jevin Jensen, July 2025, IDC #US53679825&lt;/span&gt;&lt;/em&gt;&lt;/sup&gt;&lt;/p&gt;
&lt;p&gt;&lt;sup&gt;&lt;em&gt;&lt;span style="vertical-align: baseline;"&gt;IDC MarketScape vendor analysis model is designed to provide an overview of the competitive fitness of ICT suppliers in a given market.  The research methodology utilizes a rigorous scoring methodology based on both qualitative and quantitative criteria that results in a single graphical illustration of each vendor’s position within a given market. The Capabilities score measures vendor product, go-to-market and business execution in the short-term. The Strategy score measures alignment of vendor strategies with customer requirements in a 3-5-year timeframe. Vendor market share is represented by the size of the circles. Vendor year-over-year growth rate relative to the given market is indicated by a plus, neutral or minus next to the vendor name.&lt;/span&gt;&lt;/em&gt;&lt;/sup&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Tue, 05 Aug 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/google-leader-in-idc-marketscape-finops-cloud-costs/</guid><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Google is a Leader in the 2025 IDC MarketScape: FinOps Cloud Costs Optimization</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/google-leader-in-idc-marketscape-finops-cloud-costs/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Sarah McMullin</name><title>Head of Cloud FinOps Product</title><department></department><company></company></author></item><item><title>Optimize your cloud costs using Cloud Hub Optimization and Cost Explorer</title><link>https://cloud.google.com/blog/products/management-tools/announcing-cloud-hub-optimization-and-cost-explorer-for-developers/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Application owners are looking for three things when they think about optimizing cloud costs:&lt;/span&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;What are the most expensive resources?&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Which resources are costing me more this week or month?&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Which resources are poorly utilized?&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To help you answer these questions quickly and easily, we &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/application-development/an-application-centric-ai-powered-cloud?e=13802955"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;announced&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; Cloud Hub Optimization and Cost Explorer, in private preview, at Google Cloud Next 2025. And today, we are excited to announce that both Cloud Hub Optimization and Cost Explorer are now in public preview.&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;&lt;span style="vertical-align: baseline;"&gt;Application cost and utilization&lt;/span&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As an app owner, your primary objective is keeping your application healthy at all times. Yet, monitoring all the individual components of your application, which may straddle dozens of Projects, can be quite overwhelming. &lt;/span&gt;&lt;a href="https://cloud.google.com/products/app-hub"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AppHub Applications&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; allow you to reorganize cloud around your application, giving you the information and controls you need at your fingertips.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In addition to supporting Google Cloud Projects, Cloud Hub Optimization and Cost Explorer leverage &lt;/span&gt;&lt;a href="https://cloud.google.com/products/app-hub"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;App Hub&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; applications to show you the cost-efficiency of your application’s workloads and services instantly. This is great for instance when you are trying to pinpoint deployments running on GKE clusters that might be wasting valuable resources, such as GPUs.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_CHO_utilization_summary_app.max-1000x1000.jpg"
        
          alt="1_CHO_utilization summary app"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h2&gt;&lt;span style="vertical-align: baseline;"&gt;Not just another cost dashboard&lt;/span&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;When you bring up Cloud Hub Optimization, you can immediately see the resources that are costing you the most, along with the percentage change in their cost. With this highly granular cost information, you can now attribute your costs to specific resources and resource owners to reason about any changes in costs.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_CHO_cost_summary.max-1000x1000.jpg"
        
          alt="2_CHO_cost summary"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We have additionally integrated granular cost data from Cloud Billing and resource utilization data from Cloud Monitoring to give you a comprehensive picture of your cost efficiency. This includes average vCPU utilization for your Project, which helps you find the most promising optimization candidates across hundreds of Google Cloud Projects.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/3_CHO_utilization_summary_project.max-1000x1000.jpg"
        
          alt="3_CHO_utilization summary project"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The Cost Explorer dashboard also shows you your costs logically organized at the product level, for even more cost explainability. Instead of seeing a lump sum cost for Compute Engine, you can now see your exact spend on individual products including Google Kubernetes Engine (GKE) clusters, Persistent Disks, Cloud Load Balancing, and more.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/4_CHO_cost_explorer.max-1000x1000.jpg"
        
          alt="4_CHO_cost explorer"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h2&gt;&lt;strong style="vertical-align: baseline;"&gt;Simple is powerful&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Customers who have tried these new tools love the information that is surfaced as well as the simplicity of the interfaces.&lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“My team has to keep an eye on cloud costs across tens of business units and hundreds of developers. The Cloud Hub Optimization and Cost Explorer dashboards are a force multiplier for my team as they tell us where to look for cost savings and potential optimization opportunities.”&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; - Frank Dice, Principal Cloud Architect, Major League Baseball&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Customers especially appreciate the &lt;/span&gt;&lt;a href="https://cloud.google.com/stackdriver/docs/costs/optimize-costs#supported_products"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;breadth of product coverage&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; available out of the box without any additional setup, and the fact that there is no additional charge to using these features.&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;&lt;strong style="vertical-align: baseline;"&gt;What’s next&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As your organization “shifts left” on cloud cost management, we are working to help application owners and developers understand and optimize their cloud costs. You can try Cloud Hub Optimize and Cost Explorer &lt;/span&gt;&lt;a href="https://console.cloud.google.com/cloud-hub/optimization"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can also see a live demo of how Cloud Hub Optimization and Cost Explorer can be used to identify underutilized GKE clusters within seconds in the Google Cloud Next 2025 talk Maximize Your Cloud ROI.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-video"&gt;



&lt;div class="article-module article-video "&gt;
  &lt;figure&gt;
    &lt;a class="h-c-video h-c-video--marquee"
      href="https://youtube.com/watch?v=7csgD3iIc2Q"
      data-glue-modal-trigger="uni-modal-7csgD3iIc2Q-"
      data-glue-modal-disabled-on-mobile="true"&gt;

      
        

        &lt;div class="article-video__aspect-image"
          style="background-image: url(https://storage.googleapis.com/gweb-cloudblog-publish/images/maxresdefault_LGJSUja.max-1000x1000.jpg);"&gt;
          &lt;span class="h-u-visually-hidden"&gt;Maximize your cloud ROI: A practical approach to efficiency and optimization&lt;/span&gt;
        &lt;/div&gt;
      
      &lt;svg role="img" class="h-c-video__play h-c-icon h-c-icon--color-white"&gt;
        &lt;use xlink:href="#mi-youtube-icon"&gt;&lt;/use&gt;
      &lt;/svg&gt;
    &lt;/a&gt;

    
  &lt;/figure&gt;
&lt;/div&gt;

&lt;div class="h-c-modal--video"
     data-glue-modal="uni-modal-7csgD3iIc2Q-"
     data-glue-modal-close-label="Close Dialog"&gt;
   &lt;a class="glue-yt-video"
      data-glue-yt-video-autoplay="true"
      data-glue-yt-video-height="99%"
      data-glue-yt-video-vid="7csgD3iIc2Q"
      data-glue-yt-video-width="100%"
      href="https://youtube.com/watch?v=7csgD3iIc2Q"
      ng-cloak&gt;
   &lt;/a&gt;
&lt;/div&gt;

&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;hr/&gt;
&lt;p&gt;&lt;sup&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Major League Baseball trademarks and copyrights are used with permission of Major League Baseball. Visit MLB.com.&lt;/span&gt;&lt;/sup&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Mon, 04 Aug 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/management-tools/announcing-cloud-hub-optimization-and-cost-explorer-for-developers/</guid><category>AI &amp; Machine Learning</category><category>DevOps &amp; SRE</category><category>Cost Management</category><category>Management Tools</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Optimize your cloud costs using Cloud Hub Optimization and Cost Explorer</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/management-tools/announcing-cloud-hub-optimization-and-cost-explorer-for-developers/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Gobind Johar</name><title>Product Manager</title><department></department><company></company></author></item><item><title>Spring cleaning with FinOps Hub 2.0</title><link>https://cloud.google.com/blog/topics/cost-management/spring-cleaning-with-finops-hub/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Spring is a great reminder to spring clean – an annual tradition that should extend not only to your household, but also to your virtual cloud infrastructure. Why not start with Google Cloud’s FinOps Hub? &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As Google Cloud customers have adopted the FinOps hub to guide their optimization initiatives, we started getting additional feedback from our business community. For example, while DevOps users have access to tools and utilization metrics to identify waste, business teams often lack clear insights into resource consumption, leading to a significant blind spot. The most recent &lt;/span&gt;&lt;a href="https://data.finops.org/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;State of FinOps 2025 Report&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; reinforces this need, underscoring the importance of workload optimization and waste reduction as the #1 Top FinOps concern. It’s extremely difficult to optimize workloads or applications if customers cannot fully understand how much is even being used. Why purchase a committed use discount for compute cores that you might not even be fully using? &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Sometimes the easiest optimizations our customers can make are really just using more efficiently the resources they are actually paying for. That’s why, in 2025, we are focused on the deep clean of your optimization opportunities and have upgraded FinOps Hub to help you find, highlight, and eliminate wasted spend.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Try Google Cloud for free&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f20693df640&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;Get started for free&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;https://console.cloud.google.com/freetrial?redirectPath=/welcome&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;1. &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Find waste: FinOps Hub 2.0 now comes with new utilization insights to zero in on optimization opportunities. &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At Google Cloud Next 2025, we introduced FinOps Hub 2.0,&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;focused exclusively on bringing utilization insights on your resources to the forefront so you can see what potential waste may exist and take action immediately. Waste can come in many forms: from a VM that is barely getting used at 5% (overprovisioned), to a GKE cluster that is actually running hot at 110% utilization and might fail (underprovisioned), to managed resources like Cloud Run instances that may not be optimally configured (suboptimal configuration) or, worse yet, a VM that might not ever have been used (idle). FinOps users can now quickly view the most expensive waste category in one, easy-to-understand heatmap by service or AppHub application. But FinOps Hub doesn’t just show you where there may be waste; it also includes more cost optimizations for Kubernetes Engine (GKE), Compute Engine (GCE), Cloud Run, and Cloud SQL to remedy the waste too.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_GCA-Wastemap.max-1000x1000.png"
        
          alt="1 GCA-Wastemap"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="yc3wp"&gt;Waste map showing identified resources with their corresponding utilization metrics&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;2. &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Highlight waste: Gemini Cloud Assist supercharges FinOps Hub to summarize optimization insights and send opportunities to engineering.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;But perhaps what really makes this a 2.0 release is that we supercharged the most time-consuming tasks on FinOps Hub with Gemini Cloud Assist. Our first launch of Gemini Cloud Assist, which helps create personalized cost reports and synthesize insights, has resulted in &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;&amp;gt;100k FinOps hours saved by our customers&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; annually (from January 2024 to January 2025). The power of Gemini Cloud Assist to supercharge and automate workflows is a huge benefit, so we applied that to FinOps Hub in two ways. First, FinOps can now see embedded optimization insights on the hub itself –similar to cost reports – so you don’t need to solve the “needle in the haystack” problem of optimization. Second, you can now use Gemini Cloud Assist to summarize and send top waste insights to your engineering teams to take action and remediate fast&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_GCA-Email.max-1000x1000.png"
        
          alt="2 GCA-Email"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="yc3wp"&gt;Gemini summary and draft emails with top optimization opportunities&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;3. &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Eliminate waste: introducing a NEW IAM role permission for your tech solution owners to see &amp;amp; directly take action on these optimization opportunities.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Finally, perhaps our most exciting feature – and long overdue for FinOps – is that we are unlocking access to the Billing console for tech solution owners, so that these owners can get FinOps insights and Gemini Cloud Assist insights across all their projects, in a single pane. For example, if you want to give access to FinOps Hub or cost reports to an entire department that only uses a subset of projects for their infrastructure – without providing them with broader billing data access, but still allowing them to see all of their data in a single view – now you can, with multi-project views in the billing console. Multi-project views are enabled using the new &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Project Billing Costs Manager&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; IAM role (or related granular permissions). These new permissions are currently in private preview so &lt;/span&gt;&lt;a href="https://forms.gle/kvSQivkDZ6RiyD7P9" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;sign-up&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to get access. Now you can truly extend the power of FinOps tools across your organization with these new access controls.  &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;So take this Spring to try FinOps Hub 2.0 with Gemini Cloud Assist, and do some spring cleaning on your cloud infrastructure, because as the saying goes, “With clouds overgrown, like winter’s old grime, Spring clean your servers, save dollars and time.” – well at least that’s what they say according to Gemini.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Wed, 16 Apr 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/spring-cleaning-with-finops-hub/</guid><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Spring cleaning with FinOps Hub 2.0</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/spring-cleaning-with-finops-hub/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Sarah McMullin</name><title>Head of Cloud FinOps Product</title><department></department><company></company></author></item><item><title>How to calculate your AI costs on Google Cloud</title><link>https://cloud.google.com/blog/topics/cost-management/unlock-the-true-cost-of-enterprise-ai-on-google-cloud/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;What is the true cost of enterprise AI?&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As a technology leader and a steward of company resources, understanding these costs isn't just prudent – it's essential for sustainable AI adoption. To help, we’ll unveil a comprehensive approach to understanding and managing your AI costs on Google Cloud, ensuring your organization captures maximum value from its AI investments.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Whether you're just beginning your AI journey or scaling existing solutions, this approach will equip you with the insights needed to make informed decisions about your AI strategy.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Why understanding AI costs matters &lt;/strong&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;now&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Google Cloud offers a vast and ever-expanding array of AI services, each with its own pricing structure. Without a clear understanding of these costs, you risk budget overruns, stalled projects, and ultimately, a failure to realize the full potential of your AI investments. This isn't just about saving money; it's about &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;responsible AI development&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; – building solutions that are both innovative and financially sustainable.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Breaking down the Total Cost of Ownership (TCO) for AI on Google Cloud&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Let's dissect the major cost components of running AI workloads on Google Cloud:&lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;/p&gt;
&lt;div align="center"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;&lt;table&gt;&lt;colgroup&gt;&lt;col/&gt;&lt;col/&gt;&lt;col/&gt;&lt;/colgroup&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Cost category&lt;/strong&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Description&lt;/strong&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Google Cloud services (Examples)&lt;/strong&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Model serving cost&lt;/strong&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The cost of running your trained AI model to make predictions (inference). This is often a per-request or per-unit-of-time cost.&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;OOTB models available in Vertex AI, Vertex AI Prediction, GKE (if self-managing), Cloud Run Functions (for serverless inference)&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Training and tuning costs&lt;/strong&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The expense of training your AI model on your data and fine-tuning it for optimal performance. This includes compute resources (GPUs/TPUs) and potentially the cost of the training data itself.&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Vertex AI Training, Compute Engine (with GPUs/TPUs), GKE or Cloud Run (with GPUs/TPUs)&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Cloud hosting costs&lt;/strong&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The fundamental infrastructure costs for running your AI application, including compute, networking, and storage.&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Compute Engine, GKE or Cloud Run, Cloud Storage, Cloud SQL (if your application uses a database)&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Training data storage and adapter layers costs&lt;/strong&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The cost of storing your training data and any "adapter layers" (intermediate representations or fine-tuned model components) created during the training process.&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Cloud Storage, BigQuery&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Application layer and setup costs&lt;/strong&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The expenses associated with any additional cloud services needed to support your AI application, such as API gateways, load balancers, monitoring tools, etc.&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Cloud Load Balancing, Cloud Monitoring, Cloud Logging, API Gateway, Cloud Functions (for supporting logic)&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Operational support cost&lt;/strong&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The ongoing costs of maintaining and supporting your AI model, including monitoring performance, troubleshooting issues, and potentially retraining the model over time.&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Google Cloud Support, internal staff time, potential third-party monitoring tools&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Try Google Cloud for free&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f20437e0820&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;Get started for free&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;https://console.cloud.google.com/freetrial?redirectPath=/welcome&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Let’s estimate costs with an example&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Let's illustrate this with a hypothetical, yet realistic, generative AI use case: Imagine you’re a retail customer with an automated customer support chatbot.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Scenario:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; A medium-sized e-commerce company wants to deploy a chatbot on their website to handle common customer inquiries (order status, returns, product information and more). They plan to use a pre-trained language model (like one available through &lt;/span&gt;&lt;a href="https://cloud.google.com/model-garden?e=48754805"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Vertex AI Model Garden&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;) and fine-tune it on their own customer support data.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Assumptions:&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Model:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Fine-tuning a low latency language model (in this case we will use Gemini 1.5 Flash).&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Training data:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; 1 million customer support conversations (text data).&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Traffic:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; 100K chatbot interactions per day.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Hosting:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Vertex AI Prediction for serving the model.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Fine-tuning frequency:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Monthly.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Cost estimation&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As the retail customer in this example, here’s how you might approach this. &lt;/span&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;1. First, discover your &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;model serving cost:&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Vertex AI Prediction (Gemini 1.5 Flash for Chat) pricing is modality-based pricing so in this case since our input and output is text, the usage unit will be characters. Let's assume an average of 1000 input characters and 500 output characters per interaction.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Cost per 1M characters input: $0.0375.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Cost per 1M characters output: $0.15&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Input cost per day: 100,000 interactions * 1000 characters * $0.0375 / 1000000 = $3.75&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Output cost per day: 100,000 interactions * 500 characters * $0.15 / 1000000 characters = $7.5&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Total model serving cost per day: $11.25&lt;/strong&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Total model serving cost per month (~30 days): ~$337&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/fig1.jpg"
        
          alt="Figure-1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="9v32r"&gt;Servicing cost of Gemini Flash 1.5 LLM model&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;2. Second, identify your &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;training and tuning costs:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In this scenario, we aim to enhance the model's accuracy and relevance to our specific use case through fine-tuning. This involves inputting a million past chat interactions, enabling the model to deliver more precise and customized interactions.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Cost per training tokens: $8 / M tokens&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Cost per training characters: $2 / M characters (where each token approximately equates to 4 characters)&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Tuning cost (first month): 1,000,000 conversation (training data) * 1500 characters (input + output) * 2 /1,000,000 = $3,000&lt;/strong&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Tuning cost (subsequent month): 100,000 conversation (new training data) * 1500 characters (input + output) * 2 /1,000,000 = $300&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;3. Third, understand the &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;cloud hosting costs:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Since we're using Vertex AI Prediction, the underlying infrastructure is managed by Google Cloud. The cost is included in the per-request pricing. However, if we are self-managing the model on GKE or Compute Engine, we'd need to factor in VM costs, GPU/TPU costs (if applicable), and networking costs. For this example, we assume this is $0, as it is part of Vertex AI cost.&lt;/span&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;4. Fourth, define the&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; training data storage and adapter layers costs:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The infrastructure costs for deploying machine learning models often raise concerns, but the data storage components can be economical at moderate scales. When implementing a conversational AI system, storing both the training data and the specialized model adapters represents a minor fraction of the overall costs. Let's break down these storage requirements and their associated expenses.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;1M conversations, assuming an average size of 5KB per conversation, would be roughly 5GB of data.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Cloud Storage cost for 5GB is negligible: $0.1 per month.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Adapter layers (fine-tuned model weights) might add another 1GB of storage. This would still be very inexpensive: $0.02 per month.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Total storage cost per month: &amp;lt; $1/month&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;5. Fifth, consider the&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; application layer and setup costs: &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This depends heavily on the specific application. In this case we are using Cloud Run Functions and Logging. Cloud Run to handle pre- and post-processing of chatbot requests (e.g., formatting, database lookups). In this case let's assume we use request-based billing so we are only charged when it processes the request. In this example we are processing 3M requests per month (100K * 30) and assuming 1 sec for average execution time: $14.30&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/fig2.jpg"
        
          alt="Figure-2"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="9v32r"&gt;Cloud Run function cost for request-based billing&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;ul&gt;
&lt;li&gt;&lt;span style="vertical-align: baseline;"&gt;Cloud Logging and Monitoring for tracking chatbot performance and debugging issues. Let's estimate 100GB of logging volume (which is on higher end) and retaining the logs for 3 months: $28&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/fig3.max-1000x1000.jpg"
        
          alt="Figure-3"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="9v32r"&gt;Cloud Logging costs for storage and retention&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Total application layer cost per month:~ $40&lt;/strong&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;6. Finally, incorporate the&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; Operational support cost:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This is the hardest to estimate, as it depends on the internal team's size and responsibilities. Let's assume a conservative estimate of 5 hours per week of an engineer's time dedicated to monitoring and maintaining the chatbot, at an hourly rate of $100.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Total operational support cost per month: 5 hours/week * 4 weeks/month * $100/hour = $2000&lt;/strong&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Total estimated monthly cost (First month):&lt;/strong&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;$ 340 (Serving) + $3000 (Training) + $1 (Storage) + $40 (Application) + $2000 (Operational) = $5,381&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Total estimated monthly cost (Subsequent months):&lt;/strong&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;$340 (Serving) + $300 (Training) + $1 (Storage) + $40 (Application) + $2000 (Operational) = $2,681&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can find the full estimate of cost &lt;/span&gt;&lt;a href="https://cloud.google.com/products/calculator/estimate-preview/CiQzM2ExNTMxZi0xZjY3LTQwOGUtOTVmYi1hYjIzNjNkNTdlN2YQAQ==?e=48754805&amp;amp;hl=en"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. Note that this does not include tuning and operational cost as it is not available in pricing export yet. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Once you have a good understanding of your AI costs, it is important to develop an optimization strategy that encompasses infrastructure choices, resource utilization, and monitoring practices to maintain performance while controlling expenses. By understanding the various cost components and leveraging Google Cloud's tools and resources, you can confidently embark on your AI journey. Cost management isn't a barrier; it's an enabler. It allows you to experiment, innovate, and build transformative AI solutions in a financially responsible way. &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Get started&lt;/strong&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Start understanding your AI costs today:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Explore the&lt;/span&gt;&lt;a href="https://cloud.google.com/products/calculator"&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Pricing Calculator&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and the&lt;/span&gt;&lt;a href="https://cloud.google.com/vertex-ai/pricing"&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Vertex AI Pricing Page&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn more at Google Cloud Next:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Register for the Google Next session on &lt;/span&gt;&lt;a href="https://cloud.withgoogle.com/next/25/session-library?session=BRK1-002&amp;amp;utm_source=copylink&amp;amp;utm_medium=unpaidsoc&amp;amp;utm_campaign=FY25-Q2-global-EXP106-physicalevent-er-next25-mc&amp;amp;utm_content=reg-is-live-next-homepage-social-share&amp;amp;utm_term=-" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AI Investment to Impact: Unlocking Sustainable ROI with Google Cloud&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Engage Google Cloud for expert guidance&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: Get expert help to design cost effect AI architectures, contact &lt;/span&gt;&lt;a href="https://cloud.google.com/consulting/innovation-and-transformation" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Consulting or PSO&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;</description><pubDate>Mon, 03 Mar 2025 17:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/unlock-the-true-cost-of-enterprise-ai-on-google-cloud/</guid><category>AI &amp; Machine Learning</category><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>How to calculate your AI costs on Google Cloud</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/unlock-the-true-cost-of-enterprise-ai-on-google-cloud/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Pathik Sharma</name><title>Cloud FinOps Lead, delta, Google Cloud Consulting</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Eric Lam</name><title>Head of Cloud FinOps, delta, Google Cloud Consulting</title><department></department><company></company></author></item><item><title>Accelerate your cloud journey using a well-architected, principles-based framework</title><link>https://cloud.google.com/blog/products/application-modernization/well-architected-framework-to-accelerate-your-cloud-journey/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In today's dynamic digital landscape, building and operating secure, reliable, cost-efficient and high-performing cloud solutions is no easy feat. Enterprises grapple with the complexities of cloud adoption, and often struggle to bridge the gap between business needs, technical implementation, and operational readiness. This is where the &lt;/span&gt;&lt;a href="https://cloud.google.com/architecture/framework"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Well-Architected Framework&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; comes in. The framework provides comprehensive guidance to help you design, develop, deploy, and operate efficient, secure, resilient, high-performing, and cost-effective Google Cloud topologies that support your security and compliance requirements.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Who should use the Well-Architected Framework?&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The Well-Architected Framework caters to a broad spectrum of cloud professionals. Cloud architects, developers, IT administrators, decision makers and other practitioners can benefit from years of subject-matter expertise and knowledge both from within Google and from the industry. The framework distills this vast expertise and presents it as an easy-to-consume set of recommendations. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The recommendations in the Well-Architected Framework are organized under five, business-focused pillars.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/af-infographic.max-1000x1000.jpg"
        
          alt="af-infographic"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We recently completed a revamp of the guidance in all the pillars and perspectives of the Well-Architected Framework to center the recommendations around a core set of design principles.&lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;/p&gt;
&lt;div align="left"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;
&lt;div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"&gt;&lt;table&gt;&lt;colgroup&gt;&lt;col/&gt;&lt;col/&gt;&lt;col/&gt;&lt;col/&gt;&lt;col/&gt;&lt;/colgroup&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/architecture/framework/operational-excellence"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Operational excellence&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/architecture/framework/security"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Security, privacy, and compliance&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/architecture/framework/reliability"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Reliability&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/architecture/framework/cost-optimization"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Cost optimization&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/architecture/framework/performance-optimization"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Performance optimization&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Operational readiness&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Incident management&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Resource optimization&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Change management&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Continuous improvement&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Security by design&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Zero trust&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Shift-left security&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Preemptive cyber-defense&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Secure and responsible AI&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;AI for security&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Regulatory, privacy, and compliance needs&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;User-focused goals&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Realistic targets&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;HA through redundancy&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Horizontal scaling&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Observability&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Graceful degradation&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Recovery testing&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Thorough postmortems&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Spending aligned with business value&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Culture of cost awareness&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Resource optimization&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Continuous optimization&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/td&gt;
&lt;td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Resource allocation planning&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Elasticity&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Modular design&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Continuous  improvement&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;p&gt;&lt;span&gt;&lt;span style="vertical-align: baseline;"&gt;In addition to the above pillars, the Well-Architected Framework provides cross-pillar perspectives that present recommendations for selected domains, industries, and technologies like &lt;/span&gt;&lt;a href="https://cloud.google.com/architecture/framework/perspectives/ai-ml"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AI and machine learning (ML)&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Try Google Cloud for free&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f206845ef10&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;Get started for free&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;https://console.cloud.google.com/freetrial?redirectPath=/welcome&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Benefits of adopting the Well-Architected Framework&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The Well-Architected Framework is much more than a collection of design and operational recommendations. The framework empowers you with a structured principles-oriented design methodology that unlocks many advantages:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Enhanced security, privacy, and compliance:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Security is paramount in the cloud. The Well-Architected Framework incorporates industry-leading security practices, helping ensure that your cloud architecture meets your security, privacy, and compliance requirements.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Optimized cost:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The Well-Architected Framework lets you build and operate cost-efficient cloud solutions by promoting a cost-aware culture, focusing on resource optimization, and leveraging built-in cost-saving features in Google Cloud.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Resilience, scalability, and flexibility:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; As your business needs evolve, the Well-Architected Framework helps you design cloud deployments that can scale to accommodate changing demands, remain highly available, and be resilient to disasters and failures.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Operational excellence:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The Well-Architected Framework promotes operationally sound architectures that are easy to operate, monitor, and maintain.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Predictable and workload-specific performance:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The Well-Architected Framework offers guidance to help you build, deploy, and operate workloads that provide predictable performance based on your workloads’ needs.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;The Well-Architected Framework also includes cross-pillar perspectives for selected domains, industries, and technologies like &lt;/span&gt;&lt;a href="https://cloud.google.com/architecture/framework/perspectives/ai-ml"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AI and machine learning (ML)&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The principles and recommendations in the Google Cloud Well-Architected Framework are aligned with Google and industry best practices like Google’s &lt;/span&gt;&lt;a href="https://sre.google/sre-book/introduction/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Site Reliability Engineering (SRE) practices&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://dora.dev/capabilities/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;DORA capabilities&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, the Google &lt;/span&gt;&lt;a href="https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/36299.pdf" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;HEART framework for user-centered metrics&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, the &lt;/span&gt;&lt;a href="https://www.finops.org/framework/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;FinOps framework&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://slsa.dev/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Supply-chain Levels for Software Artifacts (SLSA)&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, and Google's &lt;/span&gt;&lt;a href="https://safety.google/cybersecurity-advancements/saif/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Secure AI Framework (SAIF)&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Embrace the Well-Architected Framework to transform your Google Cloud journey, and get comprehensive guidance on security, reliability, cost, performance, and operations — as well as targeted recommendations for specific industries and domains like AI and ML. To learn more, visit &lt;/span&gt;&lt;a href="https://cloud.google.com/architecture/framework"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Well-Architected Framework&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Fri, 14 Feb 2025 17:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/application-modernization/well-architected-framework-to-accelerate-your-cloud-journey/</guid><category>Application Development</category><category>Cost Management</category><category>DevOps &amp; SRE</category><category>Application Modernization</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Accelerate your cloud journey using a well-architected, principles-based framework</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/application-modernization/well-architected-framework-to-accelerate-your-cloud-journey/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Kumar Dhanagopal</name><title>Cross-Product Solution Developer</title><department></department><company></company></author></item><item><title>To avoid “bill shocks,” Palo Alto Networks deploys custom AI-powered cost anomaly detection</title><link>https://cloud.google.com/blog/topics/cost-management/palo-alto-networks-custom-cost-anomaly-detection-with-ai-bill-shocks/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In today's fast-paced digital world, businesses are constantly seeking innovative ways to leverage cutting-edge technologies to gain a competitive edge. AI has emerged as a transformative force, empowering organizations to automate complex processes, gain valuable insights from data, and deliver exceptional customer experiences. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;However, with the rapid adoption of AI comes a significant challenge: managing the associated cloud costs. As AI — and really cloud workloads in general — grow and become increasingly sophisticated, so do their associated costs and potential for overruns if organizations don’t plan their spend carefully.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;These unexpected charges can arise from a variety of factors: &lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Human error and mismanagement:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Misconfigurations in cloud services (e.g., accidentally enabling a higher-tiered service or changing scaling settings) can inadvertently drive up costs.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Unexpected workload changes:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Spikes in traffic or usage, or changes in application behavior (e.g., marketing campaign or sudden change in user activity) can lead to unforeseen service charges.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Lack of proactive governance and cost transparency: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Without a robust cloud FinOps framework, it's easy for cloud spending to spiral out of control, leading to significant financial overruns. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Organizations have an opportunity to proactively manage their cloud costs and avoid budget surprises. By implementing real-time cost monitoring and analysis, they can identify and address potential anomalies before they result in unexpected expenses. This approach empowers businesses to maintain financial control and support their growth objectives.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Try Google Cloud for free&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2069b6c250&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;Get started for free&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;https://console.cloud.google.com/freetrial?redirectPath=/welcome&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As one of the world’s leading cybersecurity organizations — serving more than 70,000 organizations in 150 countries — Palo Alto Networks must bring a level of vigilance and awareness to its digital business. Since it experiments often with new technologies and tools and deals with spikes in activity when threat actors mount an attack, the chances for anomalous spending run higher than most.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Recognizing the need of all its customers to effectively manage its cloud spend, Google Cloud launched the &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/cost-management/introducing-cost-anomaly-detection"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Cost Anomaly Detection&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; as part of the &lt;/span&gt;&lt;a href="https://cloud.google.com/cost-management"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Cost Management toolkit&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. It does not require any setup and automatically detects anomalies for your Google Cloud projects and empowers teams with details to alert and provide root-cause analysis. While Palo Alto Networks used  this feature for a while and found it useful, it eventually realized the need for a customized solution. Due to stringent custom requirements, it wanted a service that could identify anomalies based on labels, such as applications or products that span across Google Cloud projects, and provide more control over anomaly variables that are detected and alerted to its teams. Creating a consistent experience across its multicloud environments was also a priority.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Palo Alto Networks’ purpose-built solution tackles cloud management and AI costs head-on, helping the organization to be proactive at scale. It is designed to enhance cost transparency by providing real-time alerts to product owners, so they can make informed decisions and act quickly. The solution also delivers automated insights at scale, freeing up valuable time for the team to focus on innovation. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;By removing the worry of unexpected costs, Palo Alto Networks can now confidently embrace new cloud and AI workloads, accelerating its digital transformation journey.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Lifecycle of an anomaly &lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;For Palo Alto Networks, anomalies are unexpected events or patterns that deviate from the norm. In a cloud environment, anomalies can indicate anything from a simple misconfiguration to a full-blown security breach. That's why it's critical to have a system in place to detect, analyze, and mitigate anomalies before they can cause significant damage.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This flowchart illustrates the typical lifecycle of an anomaly, broken down into three key stages:&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_-_Lifecycle_of_an_Anomaly.max-1000x1000.png"
        
          alt="1 - Lifecycle of an Anomaly"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="tf1hg"&gt;Figure 1 - Lifecycle of an Anomaly&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The following sections will take a deeper dive into how Palo Alto Networks used Google Cloud to build its custom AI-powered anomaly solution to address each of these stages.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;1. Detection&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The first step is to identify potential anomalies.Palo Alto Networks partnered with Google Cloud Consulting to train the &lt;/span&gt;&lt;a href="https://cloud.google.com/vertex-ai/docs/tabular-data/forecasting-arima/overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;ARIMA+ model&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; with billing data from its applications using &lt;/span&gt;&lt;a href="https://cloud.google.com/bigquery/docs/bqml-introduction"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;BigQuery ML&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (BQML). The team chose this model for its great results for time-series billing data, its ability to customize hyper parameters, and its overall effective cost of operation at scale. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The ARIMA+ model allowed Palo Alto Networks to generate a baseline spend with upper and lower bounds for its cost anomaly solution.  The team also tuned the model using Palo Alto Networks’ historic billing data, enabling it to inherently understand factors like seasonality, common spikes and dips, migration patterns, and more. If the spend exceeds the upper bound created by the model, the team can then quantify the business cost impact (both percentage and dollar amount) to determine the severity of the alert to be investigated further.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_-_AI-Powered_Cost_Anomaly_Solution_Archi.max-1000x1000.png"
        
          alt="2 - AI-Powered Cost Anomaly Solution Architecture on Google Cloud"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="tf1hg"&gt;Figure 2 - AI-Powered Cost Anomaly Solution Architecture on Google Cloud&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Looker, Google Cloud’s business intelligence platform, serves as the foundation for custom data modeling and visualization, seamlessly integrating with Palo Alto Networks’ existing billing data infrastructure, which continuously streams into BigQuery multiple times a day. This eliminates the need for additional data pipelines, ensuring the team has the most up-to-date information for analysis.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;BigQuery MLempowers Palo Alto Networks with robust capabilities for machine learning model training and inference. By leveraging BQML, the team can build and deploy sophisticated models directly within BigQuery, eliminating the complexities of managing separate machine learning environments. This streamlined approach accelerates the ability to detect and analyze cost anomalies in real time. In this case, Palo Alto Networks trained the ARIMA+ model on the last 13 months of billing data for specific applications on the &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Net Spend field&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; to capture seasonality, spikes and dips, along with migration patterns and known spikes based on a custom calendar. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To enhance alerting and anomaly management processes, the team also utilizes Google Cloud Pub/Sub and Cloud Run functions. Pub/Sub facilitates the reliable and scalable delivery of anomaly notifications to relevant stakeholders. Cloud Run functions enable custom logic for processing these notifications, including intelligent grouping of similar anomalies to minimize alert fatigue and streamline investigations. This powerful combination allows Palo Alto Networks to respond swiftly and effectively to potential cost issues.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;2. Notification and analysis&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Once the anomaly is captured, the solution computes the business cost impact and routes alerts to the appropriate application teams through Slack for further investigation. To accelerate root-cause analysis, it synthesizes critical information through text and images to provide all the details about anomaly, pinpointing exactly when it occurred and which SKUs or resources are involved. Application teams can then further analyze this information and, with their application context, quickly arrive at a decision. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Here is an example of snapshot that captured an increased cost in BigQuery that started on July 30th:&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/3_-_Example_of_Anomaly_Detected_with_resou.max-1000x1000.png"
        
          alt="3 - Example of Anomaly Detected with resource details"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="tf1hg"&gt;Figure 3 - Example of Anomaly Detected with Resource details&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The cost anomaly solution automatically gathered all the information related to the flagged anomalies, such as Google Cloud project ID, data, environment, service names andSKUs, along with the cost impact. This data provided much of the necessary context for the application team to act quickly. Here is an example of the Slack alert: &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/4_-_Example_of_anomaly_alert_on_Slack.max-1000x1000.png"
        
          alt="4 - Example of anomaly alert on Slack"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="tf1hg"&gt;Figure 4 - Example of anomaly alert on Slack&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;3. Mitigation&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Once the root cause is identified, it's time to take action to mitigate the anomaly. This may involve anything from making a simple configuration change to deploying a hotfix. In some cases, it may be necessary to escalate the issue and involve cross-functional teams.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In the provided example, a cloud hosted tenant encountered a substantial increase in data volume due to a configuration error. This misconfiguration led to unusually high BigQuery usage. As no default BigQuery reservation existed in the newly established region, the system defaulted to the on-demand pricing model, incurring higher costs. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To address this, the team procured 100 baseline slots with a 3-year commitment and implemented autoscaling to accommodate any future spikes without impacting performance. To prevent similar incidents, especially in new regions, a long-term cost governance policy was implemented at the organizational level.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Post incident, the cost anomaly solution generates a blameless post mortem document containing the highlights of the actions taken, the impact of collaboration, and the cost savings achieved through timely detection and mitigation. This document focuses on: &lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;A detailed timeline of events&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: This list might include when a cost increase was captured, when the team was alerted, and the mitigation plan with short-term and long-term initiatives to prevent this in future.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Actions taken&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: This description includes details about anomaly detection, the analysis conducted by the application team, and mitigative actions taken. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Preventative strategy: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;This describes the short-term and long-term plan to avoid similar future incidents.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Cost impact and cost avoidance: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;These calculations include the overall cost incurred from the anomaly and estimate the additional cost if the issue had not been detected in a timely manner.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;A formal communication is then sent out to the Palo Alto Networks application team, including leadership, for further visibility. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;From its experience working at scale, Palo Alto Networks has learned to embrace the fact that anomalies are unavoidable in cloud environments. To manage them effectively, a well-defined lifecycle encompassing detection, analysis, and mitigation is crucial. Automated monitoring tools play a key role in identifying potential anomalies, while collaboration across teams is also essential for successful resolution. In particular, the team places huge emphasis on the importance of continuous improvement for optimizing the anomaly management process. For example, they established the reporting dashboard below for long-term continuous governance.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/5_-_Cost_Anomaly_Reporting_Dashboard_in_Lo.max-1000x1000.png"
        
          alt="5 - Cost Anomaly Reporting Dashboard in Looker"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="tf1hg"&gt;Figure 5 - Cost Anomaly Reporting Dashboard in Looker&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;By leveraging the power of AI and partnering with Google Cloud, Palo Alto Networks is enabling businesses to unlock the full potential of AI while ensuring responsible and sustainable cloud spending. With a proactive approach to cost anomaly management, organizations can confidently navigate the evolving landscape of AI, drive innovation, and achieve their strategic goals. Check out the public preview of Cost Anomaly Detection or reach out to Google Cloud Consulting for a customized solution. &lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;p&gt;&lt;sub&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;We are extremely grateful to the entire team for partnering together to build this solution: &lt;/span&gt;&lt;a href="https://www.linkedin.com/in/yapinggu/" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;Yaping Gu&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://www.linkedin.com/in/matthew-orr/" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;Matt Orr&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://www.linkedin.com/in/andy-crutchfield-409694b/" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;Andy Crutchfield&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, and &lt;/span&gt;&lt;a href="https://www.linkedin.com/in/ginajeeyounghuh/" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;Gina Huh&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/sub&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Mon, 09 Dec 2024 17:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/palo-alto-networks-custom-cost-anomaly-detection-with-ai-bill-shocks/</guid><category>AI &amp; Machine Learning</category><category>Security &amp; Identity</category><category>Customers</category><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>To avoid “bill shocks,” Palo Alto Networks deploys custom AI-powered cost anomaly detection</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/palo-alto-networks-custom-cost-anomaly-detection-with-ai-bill-shocks/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Kuntal Patel</name><title>Manager, Cloud FinOps, Palo Alto Networks</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Pathik Sharma</name><title>Cloud FinOps Lead, delta, Google Cloud Consulting</title><department></department><company></company></author></item><item><title>Gain control of your Google Cloud costs: Introducing the Cost Attribution Solution</title><link>https://cloud.google.com/blog/topics/cost-management/introducing-the-google-cloud-cost-attribution-solution/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As your Google Cloud usage expands, managing and understanding your cloud costs can become increasingly complex. As you drive adoption of &lt;/span&gt;&lt;a href="https://cloud.google.com/learn/what-is-finops"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;cloud FinOps&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; in your organization, identifying exactly which teams, projects, or services are driving your expenses is essential.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;That's why we're excited to introduce the &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Google Cloud &lt;/strong&gt;&lt;a href="https://github.com/google/cost-attribution-solution/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Cost Attribution Solution&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;.&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; This comprehensive suite of tools and best practices is designed to improve your cost metadata and labeling governance processes, enabling data-driven decisions so you can ultimately optimize your cloud spending. Whether you are just getting started or have been using Google Cloud for a while, the solution has tools and resources to help you.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Harness the power of labels&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The Cost Attribution Solution leverages a fundamental Google Cloud feature that often goes underutilized: labels. These simple yet incredibly powerful key-value pairs act as metadata tags that you can attach to your Google Cloud resources. Think of them as customizable identifiers for your virtual machines, storage buckets, databases, and more. By strategically applying labels, you can unlock a wealth of cost insights:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Granular cost breakdowns:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; See exactly how much you're spending on specific services, applications, environments (like development, testing, and production), or even individual teams within your organization.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Data-driven decisions:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Make informed choices about where to allocate resources, how to optimize costs, and what future investments are justified.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Customizable reporting:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Generate reports tailored to your organization's specific needs. Need a breakdown of costs by department? Or by project phase? Labels make it possible.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Imagine being able to instantly answer questions like:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;What's the cost difference between our development and production environments?&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;How much is the marketing team spending on cloud resources compared to the engineering team?&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Are there specific services or applications that are disproportionately driving our monthly bill?&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;What's the true infrastructure cost of running our critical shopping cart service?&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With the Cost Attribution Solution, these insights are no longer out of reach.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Proactive and reactive strategies for label governance&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We understand that every organization's Google Cloud environment is unique, with different levels of maturity in cloud adoption and resource management. That's why the Cost Attribution Solution offers both proactive and reactive governance approaches for labels:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Proactive governance (enforcement):&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Start on the right foot by enforcing consistent and accurate labeling from the moment you provision new resources. &lt;/span&gt;&lt;a href="https://cloud.google.com/docs/terraform/policy-validation"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Terraform Policy Validation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; integrates into your infrastructure-as-code workflows, helping ensure that every new resource is tagged correctly according to your organization’s labeling policies. This prevents cost tracking gaps and improves data accuracy from day one.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_d31je9Z.max-1000x1000.png"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Reactive governance (reporting, alerting and &lt;/strong&gt;&lt;strong style="vertical-align: baseline;"&gt;reconciliation&lt;/strong&gt;&lt;strong style="vertical-align: baseline;"&gt;):&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; For existing resources, we offer a dual approach:&lt;/span&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Reporting:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Our tools help you identify unlabeled resources, providing a clear picture of where you may have gaps in cost visibility down to individual projects and resources.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_C4FAb5w.max-1000x1000.jpg"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;ul&gt;
&lt;li style="list-style-type: none;"&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong style="vertical-align: baseline;"&gt;Alerting:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Receive near real-time alerts when resources are created or modified without the proper labels, enabling you to quickly rectify any issues and maintain control over your cloud costs.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/3_X7TKTJ9.max-1000x1000.png"
        
          alt="3"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Reconciliation&lt;/strong&gt;&lt;strong style="vertical-align: baseline;"&gt;: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Go beyond just reporting by actively enforcing your labeling policies on existing projects. This empowers you to automate the application of correct labels to unlabeled or mislabeled resources, for comprehensive cost visibility and data accuracy across your entire Google Cloud landscape. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Getting started&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Ready to embark on your journey towards cost transparency? Our &lt;/span&gt;&lt;a href="https://github.com/google/cost-attribution-solution" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;GitHub repository&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and the documentation on &lt;/span&gt;&lt;a href="https://cloud.google.com/resource-manager/docs/best-practices-labels"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;best practices for labels&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; is your starting point. You'll find a wealth of resources, including:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Best practices:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; A guide to designing and implementing an effective labeling strategy tailored to your organization's structure and goals.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Solution architectures:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Detailed diagrams and explanations of how to deploy the Cost Attribution Solution components in your Google Cloud environment.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Code samples and tutorials:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Hands-on examples to help you get started quickly.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Here is a Looker Studio dashboard for &lt;/span&gt;&lt;a href="https://cloud.google.com/billing/docs/how-to/visualize-data"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;interactive cost visualization&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and additional tools to streamline your cost management processes. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Furthermore, our &lt;/span&gt;&lt;a href="https://cloud.google.com/consulting/innovation-and-transformation"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Consulting FinOps experts&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; can assess your needs and chart a course to fully integrate the cost attribution solution across your organization running on Google Cloud today.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Embrace cost transparency&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Gain granular visibility into your cloud spending with the Google Cloud Cost Attribution Solution. Leverage labels to achieve granular cost breakdowns, optimize resource usage, and make data-driven decisions that align with your business goals. The solution will soon incorporate support for &lt;/span&gt;&lt;a href="https://cloud.google.com/resource-manager/docs/tags/tags-overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;tags&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, offering a powerful way to organize resources across projects and implement fine-grained access control through IAM conditions. This additional layer of resource management empowers you to not only understand your costs but also streamline operations and enhance security.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Unlock the full potential of your cloud infrastructure and drive greater efficiency and ROI with the &lt;/span&gt;&lt;a href="https://github.com/google/cost-attribution-solution/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Cost Attribution Solution&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Fri, 11 Oct 2024 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/introducing-the-google-cloud-cost-attribution-solution/</guid><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Gain control of your Google Cloud costs: Introducing the Cost Attribution Solution</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/introducing-the-google-cloud-cost-attribution-solution/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Ben Good</name><title>Solutions Architect</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Anuradha Bajpai</name><title>Solutions Architect</title><department></department><company></company></author></item><item><title>Reduce unexpected costs with the new AI-powered Cost Anomaly Detection</title><link>https://cloud.google.com/blog/topics/cost-management/introducing-cost-anomaly-detection/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Controlling runaway spend and minimizing unexpected costs is a priority for every business. Imagine a scenario where faulty development or rogue code results in a usage spike over the weekend, unbeknownst to you. If not caught in time, &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;this kind of usage can result in cost spikes that &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;can exhaust your budgets and put a strain on finances. &lt;/span&gt;&lt;/p&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;At Google Cloud, we provide customers with a comprehensive set of cost management tools and controls to help prevent surprises. Now, we’re expanding our FinOps capabilities with AI technology that further simplifies cost management and helps ensure spend predictability. At &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;Google Cloud Next ’24, we announced Cost Anomaly Detection and today, it's available to all customers in public preview. Cost Anomaly Detection helps &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;identify anomalies in real or near-real-time and enables timely alerts so that you can avoid surprises, take swift action and control runaway costs&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;h3 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Getting to know Cost Anomaly Detection&lt;/strong&gt;&lt;/h3&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Google Cloud’s Cost Anomaly Detection can help you identify unusual spikes in cloud spending, across all products and services, by automatically monitoring your cloud projects and displaying any spikes in your billing console. This product does not require any setup and is available at no cost for all customers. Important components include:&lt;/span&gt;&lt;/p&gt;
&lt;h3 role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;1. Detection&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Using AI, Cost Anomaly Detection identifies your spend patterns based on historical and seasonal trends and forecasts an expected rate of daily spend specific to your project. It continuously monitors your actual spend &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;every hour&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; and detects any deviation. These deviations are then identified as spikes or anomalies — a.k.a. ‘cost impact’ within the Cost Anomaly Detection dashboard. Since Cost Anomaly Detection monitors your spend on an hourly basis, it can identify any unexpected upward spikes within 24 hours, for most services&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;, detecting anomalies in near real-time.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_zbNsB2b.max-1000x1000.png"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="xri9h"&gt;List of anomalies ordered by date&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3 role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;2. Investigation&lt;/strong&gt;&lt;/h3&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Once an anomaly is detected, you want to understand its root cause. For each anomaly it identifies, Cost Anomaly Detection provides a detailed, easy-to-understand root-cause analysis that lists the top contributors to the spend. This allows you to narrow your investigation on the exact project, service, region or SKU that needs corrective action, thereby enabling quicker remediation. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--medium
      
      
        h-c-grid__col
        
        h-c-grid__col--4 h-c-grid__col--offset-4
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_Sfpe1d3.max-1000x1000.png"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="xri9h"&gt;Root cause analysis panel&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3 role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;3. Alerts&lt;/strong&gt;&lt;/h3&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Once you know of an anomaly and its root cause, the appropriate owners need to be alerted of the impact to their respective projects, so they can cap or turn off usage. Today, anomaly notifications are sent through email and Pub/Sub, allowing for a wide range of personas to be notified, from the FinOps team to engineering. Cost Anomaly Detection also lets you easily set up customizable alert preferences that notify a set of desired recipients of an anomaly as soon as it is detected, while Pub/Sub alerts help with integration with your internal workflow management tools. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/3_aQWgrKC.max-1000x1000.png"
        
          alt="3"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="xri9h"&gt;Set customized alerts for anomalies&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Cost Anomaly Detection also lets you tailor your alerting threshold, based on cost impact, so that only significant anomalies are displayed and alerted. We recommend monitoring anomalies for at least one month before defining a threshold that applies across all your projects. &lt;/span&gt;&lt;/p&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Additionally, Cost Anomaly Detection is continuously learning about your spend patterns, helping to reduce the possibility of false positives and increase sensitivity to not only monthly and seasonal trends, but also inter-day and inter-week fluctuations. To that end, for every identified anomaly, you can provide feedback on whether it was truly unexpected or a false positive due to, for example, a planned migration. This feedback helps the Cost Anomaly Detection AI models adapt in real-time, to your usage and take planned usage into consideration when evaluating future spikes.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Enhanced cost observability&lt;/strong&gt;&lt;/h3&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;With Cost Anomaly Detection, you have another way of optimizing your spend: controlling unintended cost. This, when coupled with existing tools such as &lt;/span&gt;&lt;a href="https://cloud.google.com/billing/docs/how-to/budgets"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Budgets&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, allows for a more robust and flexible cost-control governance. The product requires no setup, detects same-day anomalies, and enables focused action through detailed root-cause analysis and near-real-time alerts. If you’re already using your own anomaly detection solution, we encourage you to try Cost Anomaly Detection for free, to compare and contrast the results and the customizable controls available. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Head over to the Google Cloud &lt;/span&gt;&lt;a href="https://console.cloud.google.com/billing"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;billing console&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to access this experience and start elevating your FinOps game! For more details on this product, read the documentation &lt;/span&gt;&lt;a href="https://cloud.google.com/billing/docs/how-to/manage-anomalies"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Mon, 07 Oct 2024 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/introducing-cost-anomaly-detection/</guid><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Reduce unexpected costs with the new AI-powered Cost Anomaly Detection</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/introducing-cost-anomaly-detection/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Shruthi Nambi</name><title>Product Manager</title><department></department><company></company></author></item><item><title>BigQuery jobs explorer: Your central hub for monitoring and troubleshooting BigQuery jobs</title><link>https://cloud.google.com/blog/products/data-analytics/bigquery-jobs-explorer-is-now-ga/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Ever feel overwhelmed by the sheer number of SQL queries running in your organization? Identifying expensive queries, tracking who's running them, and spotting spikes in errors are all part of the daily work. Efficient monitoring and management of query activity are essential to maintain a healthy and performant system.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We are excited to announce &lt;/span&gt;&lt;a href="https://cloud.google.com/bigquery/docs/admin-jobs-explorer"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;BigQuery jobs explorer&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, your command center for all things query-related. Now generally available, BigQuery jobs explorer helps you gain deep visibility into your organization's query activity, streamline troubleshooting, and optimize resource utilization.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_eiVqHsq.max-1000x1000.png"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p style="padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;"BigQuery jobs explorer gives us a comprehensive single-pane view of SQL activity across our entire organization, which helps us pinpoint anomalies and address them proactively. Since its launch, jobs explorer has become an essential asset in boosting platform efficiency and maintaining optimal system performance at PayPal!" &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;- Abhijit Vyas, Senior MTS Database Engineer, PayPal&lt;/span&gt;&lt;/p&gt;
&lt;h2&gt;&lt;strong style="vertical-align: baseline;"&gt;Solve multiple challenges with a single tool&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;BigQuery jobs explorer is a versatile tool that empowers you to tackle a wide range of use cases, all from a single platform. &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Monitor: Get a bird's-eye view of query activity&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Jobs explorer provides a comprehensive, real-time view of all SQL activity across your organization. No more piecing together information from different sources — you get a single pane of glass to see what's happening, when, and where.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Real-time monitoring:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Track job status, progress, and resource usage as they happen.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Key metrics at a glance:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Sort and analyze traffic based on metrics like TotalSlotMS, bytes processed, and more.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Visualize query execution:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/data-analytics/understanding-the-bigquery-query-execution-graph"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Intuitive graphs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; make it easy to understand query performance patterns.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With this level of visibility, you can proactively identify potential issues, spot trends, and make informed decisions about resource allocation.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Troubleshoot: Quickly identify and resolve problems&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;When something goes wrong, jobs explorer helps you get to the root of the problem fast.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;No more complex queries:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Access critical job information without writing any INFORMATION_SCHEMA queries.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Powerful filtering and sorting:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Quickly narrow down jobs by status, priority, owner, project, and more.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Take action:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Kill runaway queries directly from jobs explorer to save costs and reclaim resources&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Deep dive into query details:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Click on any job to see its &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/data-analytics/understanding-the-bigquery-query-execution-graph" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;execution graph&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and other key execution details.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_RFjQiOw.max-1000x1000.png"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Jobs explorer simplifies the process of troubleshooting, allowing you to focus on keeping your BigQuery environment running smoothly.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Optimize: Improve performance and control costs&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Jobs explorer isn't just about reacting to problems — it's about proactively optimizing your BigQuery usage.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Identify performance bottlenecks:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Pinpoint queries that are consuming excessive resources or taking too long to complete.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Query performance insights:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Find and address queries that have been tagged by BigQuery with &lt;/span&gt;&lt;a href="https://cloud.google.com/bigquery/docs/query-insights"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;actionable performance insights&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Control costs:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Avoid overspending by identifying and addressing inefficient queries. Often a small fraction of your queries account for the majority of your optimization gains!&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;By giving you a deeper understanding of your query activity, Jobs Explorer helps you make the most of your BigQuery spend.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;What's next?&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;BigQuery Jobs Explorer is just the beginning. We're committed to continuously improving and expanding its capabilities to meet your evolving needs. Stay tuned for future updates, and feel free to share your feedback at &lt;/span&gt;&lt;a href="mailto:bq-query-inspector-feedback@google.com"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;bq-query-inspector-feedback@google.com&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. To learn about the feature in more detail, please see the &lt;/span&gt;&lt;a href="https://cloud.google.com/bigquery/docs/admin-jobs-explorer"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;public documentation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Mon, 23 Sep 2024 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/data-analytics/bigquery-jobs-explorer-is-now-ga/</guid><category>Cost Management</category><category>Management Tools</category><category>Data Analytics</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>BigQuery jobs explorer: Your central hub for monitoring and troubleshooting BigQuery jobs</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/data-analytics/bigquery-jobs-explorer-is-now-ga/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Vinay Yerramilli</name><title>Product Manager, BigQuery</title><department></department><company></company></author></item><item><title>Flexible committed-use discounts are now even more flexible</title><link>https://cloud.google.com/blog/products/containers-kubernetes/compute-flexible-cud-expands-to-gke-autopilot-and-cloud-run/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Google Cloud offers many great ways to run your workloads: low-level VMs in Google Compute Engine, container orchestration with Google Kubernetes Engine (GKE) — including via fully-managed &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/concepts/autopilot-overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Autopilot mode&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; — and Cloud Run. Until now, to optimize your spend, you needed to purchase several Committed-use Discounts (CUDs) to cover each of these different products. For example, you might have purchased a Compute Engine Flexible CUD for VM spend including workloads running on GKE’s standard mode, a Cloud Run CUD for Cloud Run always-on instances, and an Autopilot CUD for workloads running in GKE Autopilot.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Expanding Compute Flexible CUDs&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Today we are excited to announce that the Compute Engine Flexible CUD, now known as the&lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/instances/committed-use-discounts-overview#spend_based"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt; Compute Flexible CUD,&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; has been expanded to cover Cloud Run on-demand resources, most GKE Autopilot Pods and the premiums for Autopilot Performance and Accelerator compute classes. The &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/instances/committed-use-discounts-overview#spend_based"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;documentation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and our &lt;/span&gt;&lt;a href="https://cloud.google.com/skus/sku-groups/compute-engine-flexible-cud-eligible-skus"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;SKU list&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; has the precise details on what’s included.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With one CUD purchase, you can cover eligible spend on all three products: Compute Engine, GKE, and Cloud Run. You can save 46% for a three-year commitment, and 28% for one-year commitments. With this single unified CUD, you can now make a single commitment and spend it across all these products, maximizing its flexibility. Furthermore, these commitments are not region-specific, so you can use them on resources in any region across these products.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Retiring the Autopilot CUD&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Since the new expanded Compute Flexible CUD has a higher discount than the GKE Autopilot CUD and greater overall flexibility, we’re retiring the GKE Autopilot CUD. You can still purchase the legacy GKE Autopilot CUD until October 15, after which it will no longer be available for purchase. Any existing CUDs will continue to apply through their term regardless of when you purchase them. That said, we recommend looking into the newly expanded Compute Flexible CUD for your needs now and in the future, for its greater flexibility and better discounts!&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;How to get started&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;If you're already using Flexible CUDs for Compute Engine, you'll automatically see the discounts applied to eligible Cloud Run and GKE Autopilot usage (if you have product-specific CUDs like the legacy GKE Autopilot CUD, those will apply first). If you're new to Compute Flexible CUD, it's easy to get started: estimate your hourly spend across eligible SKUs, and purchase a commitment that matches your expected sustained usage over the one- or three-year term, and start enjoying the savings! You can add additional CUDs as your usage grows.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Use-latest.max-1000x1000.png"
        
          alt="Use-latest"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We hope you find this new flexibility useful when it comes to platforming your workloads on Google Cloud!&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Next steps&lt;/strong&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Learn about &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/instances/committed-use-discounts-overview#spend_based"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Compute Flexible CUDs&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;View &lt;/span&gt;&lt;a href="https://cloud.google.com/run/pricing"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Cloud Run pricing&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;View &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/pricing#autopilot_mode"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;GKE pricing&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/cud"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;CUD options&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://console.cloud.google.com/billing/reports/commitments"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Purchase a Compute Flexible CUD in the console&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;</description><pubDate>Mon, 15 Jul 2024 18:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/containers-kubernetes/compute-flexible-cud-expands-to-gke-autopilot-and-cloud-run/</guid><category>GKE</category><category>Cost Management</category><category>Serverless</category><category>Containers &amp; Kubernetes</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Flexible committed-use discounts are now even more flexible</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/containers-kubernetes/compute-flexible-cud-expands-to-gke-autopilot-and-cloud-run/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>William Denniss</name><title>Group Product Manager, Google Kubernetes Engine</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Yasmin Mowafy</name><title>Sr. Product Manager</title><department></department><company></company></author></item><item><title>Normalize billing data across clouds with new Looker template and BigQuery views</title><link>https://cloud.google.com/blog/topics/cost-management/cloud-costs-come-into-view-with-focus-v1-0-ga/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At Google Cloud, we strongly believe you should have resources to analyze Google Cloud costs alongside other cloud providers, so you can better manage and optimize cloud costs. You should not need to spend time mapping billing terminology across cloud providers. And we believe in doing that through open standards. We were a founding member of the FinOps Foundation, &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/cost-management/working-with-finops-foundation-on-open-cloud-billing-data"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;a founding Steering Committee member&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; of the FOCUS™ project, and a core contributor for the v0.5 and v1.0 Preview and GA open billing specifications. Today, we’re excited to announce a new Looker template view that leverages the recent FOCUS v1.0 GA to help simplify cloud cost management across clouds.0&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;What is FOCUS?&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The unifying specification for cloud billing data, FOCUS is a &lt;/span&gt;&lt;a href="http://focus.finops.org" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;technical specification&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; that normalizes cost and usage billing data across cloud vendors. FOCUS aims to deliver consistency and standardization across cloud billing data by unifying cloud and usage data into one common data schema. Before FOCUS, there was no industry-standard way to normalize key cloud cost and usage measures across multiple cloud service providers (CSPs), making it challenging to understand how billing costs, credits, usage, and metrics map from one cloud provider to another (see &lt;/span&gt;&lt;a href="https://focus.finops.org/faqs/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;FinOps FAQs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for more details). &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The FOCUS initiative is developing an open standard for cloud billing data and is being adopted by all major cloud vendors. With the introduction of Version 1.0, there is a common taxonomy, terminology, and metrics for billing datasets produced by CSPs. &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Introducing a new Looker template for FOCUS v1.0 GA&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Our new Looker template allows you to visualize your open billing data in Looker, generating a table based on the results of the FOCUS query. The provided LookML code creates and manages these tables automatically, so you won't need to create them manually. This template offers a glimpse of what’s possible to visualize your cost trends across services, SKUS, zones, regions, and resource types, offering many benefits: &lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Out-of-the box template:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; No more waiting for custom dashboards. The templates give you immediate access to pre-built visualizations that reveal cost trends, breakdowns by services, charges, and regions.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Easy filtering:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; You don't need to be a data analyst to user this template. Looker has an intuitive interface that lets you filter to specific time periods or services, and drill down into details with just a few clicks.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Customizability:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; While the template is a great starting point, Looker's flexibility lets you tailor the views to your specific needs. If you need to add custom metrics, change the visualizations, or embed the dashboards into your existing workflows, you can do that easily.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_mj8zNrS.max-1000x1000.png"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="e144y"&gt;View your costs by billed services, publisher, commitments and more&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;An updated BigQuery view for FOCUS v1.0 GA&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We offer three ways to &lt;/span&gt;&lt;a href="https://cloud.google.com/billing/docs/how-to/export-data-bigquery"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;export&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; cost and usage-related &lt;/span&gt;&lt;a href="https://cloud.google.com/billing/docs/how-to/export-data-bigquery"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Cloud Billing data to BigQuery&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;: Standard Billing Export, Detailed Billing Export (resource-level data and price fields to join with Price Export table), and Price Export. In January, we introduced a new &lt;/span&gt;&lt;a href="https://cloud.google.com/bigquery/docs/views-intro"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;BigQuery view&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, a virtual table that represents the results of a SQL query, that transforms data towards FOCUS v1.0 Preview format. Today, we’re announcing an update to that BigQuery view to adapt towards the FOCUS v1.0 GA.  If you are already using the Preview and want to update your BigQuery view to the FOCUS GA, please see the existing guide, which is kept up-to-date to reflect any new changes. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;BigQuery views are great because the queryable virtual table only contains data from the tables and fields specified in the base query that defines the view. BigQuery views are virtual tables, so they incur no additional charges for data storage if you are already using Billing Export to BigQuery. With this BigQuery view you can:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;View and query Google Cloud billing data that is adapted towards the FOCUS v1.0 specification&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Use the BigQuery view as a data source for a visualization tools like Looker Studio&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Analyze your Google Cloud costs alongside data from other providers using the common FOCUS format&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;How it works&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;The FOCUS BigQuery view acts as a virtual table that sits on top of your existing &lt;/span&gt;&lt;a href="https://cloud.google.com/billing/docs/concepts"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Cloud Billing&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; data. To use this feature, you will need Detailed Billing Export and Price Exports enabled. The FOCUS BigQuery view uses a base SQL query to map your Cloud Billing data into the FOCUS schema, presenting it in the specified format. This allows you to query and analyze your data as if it were native to FOCUS, making it easier to analyze costs across different cloud providers.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;The Looker template is supported by Looker and Looker Core, not Looker Studio. To use the template out of the box, ensure you have Detailed Billing Export and Pricing Export Enabled. You will also need permissions to create new Looker Project &amp;amp; Connection. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Unlike BigQuery Views, this Looker template utilizes temporary tables. The provided LookML code will create and manage these tables automatically, so you won't need to create them manually.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We've made it easy to leverage the power of FOCUS in Looker and in BigQuery with a step-by-step guide. To view this Looker template and sample SQL query and follow the step-by-step guide, &lt;/span&gt;&lt;a href="https://cloud.google.com/resources/google-cloud-focus"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;sign up here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_1pie0lA.max-1000x1000.png"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="e144y"&gt;Compare costs by services, regions, availability zones, and commitments&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Looking ahead: Leading in open billing standards &lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We look forward to continuing to shape the standards of open billing standards alongside our customers, FinOps practitioners in the industry, the FinOps Foundation, CSPs, SaaS providers, and more. Get a unified view of your cloud costs today with the &lt;/span&gt;&lt;a href="https://cloud.google.com/resources/google-cloud-focus"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;FOCUS Looker template and BigQuery view&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. Sign up &lt;/span&gt;&lt;a href="https://cloud.google.com/resources/google-cloud-focus"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to learn more and get started.&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;p&gt;&lt;sup&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Special thanks to Paige Rutherford, Sidney Stefani, Jingjie Zheng, Jacky Liu, and Gina Huh who helped develop these features.&lt;/span&gt;&lt;/sup&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Fri, 21 Jun 2024 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/cloud-costs-come-into-view-with-focus-v1-0-ga/</guid><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Normalize billing data across clouds with new Looker template and BigQuery views</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/cloud-costs-come-into-view-with-focus-v1-0-ga/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Rupa Patel</name><title>Senior Product Manager, Cloud FinOps</title><department></department><company></company></author></item><item><title>Leveling up FinOps: 5 cost management innovations from FinOps X 2024</title><link>https://cloud.google.com/blog/topics/cost-management/cloud-cost-management-enhancements-at-finops-x-2024/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At Google Cloud, our FinOps product philosophy is that all cloud costs should be visible and allocated, spend should be efficient with no waste, and there are of course no surprise costs. And once again, Google Cloud is at the forefront of FinOps innovation, leading with some exciting new product announcements at &lt;/span&gt;&lt;a href="https://x.finops.org/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;FinOps X 2024&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. Because if there is one thing we love, it’s unlocking cloud value for everyone through innovation! &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Here are five ways we’re revolutionizing FinOps this year at FinOps X:&lt;/span&gt;&lt;/p&gt;
&lt;h3 role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;1. Making open cloud billing data a reality &lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At Next’24 we announced a new &lt;/span&gt;&lt;a href="https://cloud.google.com/bigquery"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;BigQuery&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; view that transforms Google Cloud cost data so that it aligns with the attributes and metrics defined in the latest &lt;/span&gt;&lt;a href="http://focus.finops.org" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;FinOps Open Cost &amp;amp; Usage (FOCUS) specification&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. A BigQuery view is a virtual table that represents the results of a SQL query; if you already use Billing Export to BigQuery, it incurs no additional data storage charges. &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/cost-management/cloud-costs-come-into-view-with-focus-v1-0-ga"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;This week&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, we updated the BigQuery view to match the latest FOCUS v1.0 Specification GA release, and announced a FOCUS Looker view that works with this BigQuery View. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With the FOCUS Looker view, you can now: &lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Visualize and filter your Google Cloud billing data that is adapted towards the FOCUS specification&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Visualize your costs, changes, services, regions, and availability zones on intuitive graphs &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Limit your manual work; the provided LookML code creates and manages these tables automatically — no need to create them manually&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_mj8zNrS.max-1000x1000.png"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="uiny3"&gt;Visualize your Google Cloud data, normalized according to FOCUS 1.0 standards, sorted by list cost&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3 role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;2. Speaking in the language of business, not technology   &lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In partnership with Google Cloud’s AI research teams, we have evolved &lt;/span&gt;&lt;a href="https://cloud.google.com/products/gemini/cloud-assist"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Gemini Cloud Assist&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to help augment your FinOps cost management capabilities, &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;embedded&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; within Reports. With Gemini Cloud Assist, our express goal is to put accuracy above everything — because when it comes to cloud costs, you can’t afford to be right only some of the time. Here are  few ways Gemini can help you &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;save time:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Create cost reports on the fly&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: Simply tell Gemini Cloud Assist what costs you want to learn about, for example &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“What are my Compute costs for Project Dora last month?”&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; Or you can ask a business question you’re grappling with e.g., &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“What caused my costs to increase last quarter?”&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; Gemini Cloud Assist helps to provide you with the right Cost Report, so you can be confident about its answer, and dive deeper to answer your questions.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Summarize key insights: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;You no longer need to download and manually analyze data to understand your costs. Gemini Cloud Assist provides key insights directly within your cost reports, offering instant access to the most significant cost drivers and trends without digging through the data. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Go deep into granular cost trends: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Using Billing BigQuery Exports (BQE), you no longer have to write queries to replicate the data you see in your Cost Reports.&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Anytime you view a cost report of your Google Cloud usage, we can provide you with a BigQuery script to dive deeper into the granular costs, turning FinOps professionals into data scientists. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Gemini Cloud Assist for FinOps to augment your efforts in a manner that puts accuracy and privacy above all else. And for extra peace of mind, we’ve also made it easier for you to quickly audit our answers, for extra peace of mind: &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-video"&gt;



&lt;div class="article-module article-video "&gt;
  &lt;figure&gt;
    &lt;a class="h-c-video h-c-video--marquee"
      href="https://youtube.com/watch?v=0KwzSX6l28I"
      data-glue-modal-trigger="uni-modal-0KwzSX6l28I-"
      data-glue-modal-disabled-on-mobile="true"&gt;

      
        &lt;img src="//img.youtube.com/vi/0KwzSX6l28I/maxresdefault.jpg"
             alt="Gemini Cloud Assist for FinOps"/&gt;
      
      &lt;svg role="img" class="h-c-video__play h-c-icon h-c-icon--color-white"&gt;
        &lt;use xlink:href="#mi-youtube-icon"&gt;&lt;/use&gt;
      &lt;/svg&gt;
    &lt;/a&gt;

    
  &lt;/figure&gt;
&lt;/div&gt;

&lt;div class="h-c-modal--video"
     data-glue-modal="uni-modal-0KwzSX6l28I-"
     data-glue-modal-close-label="Close Dialog"&gt;
   &lt;a class="glue-yt-video"
      data-glue-yt-video-autoplay="true"
      data-glue-yt-video-height="99%"
      data-glue-yt-video-vid="0KwzSX6l28I"
      data-glue-yt-video-width="100%"
      href="https://youtube.com/watch?v=0KwzSX6l28I"
      ng-cloak&gt;
   &lt;/a&gt;
&lt;/div&gt;

&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3 role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;3. Expanding the definition of cost to include carbon&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;a href="http://goo.gle/finops-hub" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;FinOps &lt;/span&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;h&lt;/span&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;ub&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; now integrates carbon footprint reporting to optimize your cloud environments for both financial performance as well as sustainability. Carbon footprint reporting lets you measure, report, and reduce carbon emissions while achieving your business goals. Through location-based carbon emission data and Google's unattended project recommendations, FinOps Hub provides actionable insights to drive impactful decisions that benefit both the bottom line and the planet. Google’s unattended project recommendation uses historical usage to provide recommendations about idle resources that can save you both money and carbon emissions. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;By using carbon reporting directly in FinOps hub, you can gain a better understanding of your cloud environment's environmental impact, for example: &lt;/span&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Identify emission hotspots&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;: easily pinpoint the regions, projects, and products that contribute to most of your carbon footprint. Use this valuable information to help you identify changes you can make to improve your sustainability posture.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Set, track, and achieve sustainability goals&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;: the carbon footprint report can be used as the baseline for setting and tracking your sustainability goals.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;Identify carbon efficient regions: To reduce your carbon footprint, you can use carbon reporting footprint to identify and deploy your resources on the most carbon-efficient regions. FinOps hub recommendations now include "Low CO2" indicators to identify the most efficient regions.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/2_c4qpwuh.gif"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="q5m3u"&gt;View your carbon footprint across regions, projects, or individual Google Cloud services.&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3 role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;4. Modeling what an efficient cloud looks like, in near real-time &lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We've heard your feedback loud and clear. You love our CUD recommendations, but you need more power to model "what-if" scenarios that reflect your unique business reality. That's why we're thrilled to introduce FinOps hub’s &lt;/span&gt;&lt;a href="https://cloud.google.com/docs/cuds-recommender"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Scenario Modeling for CUDs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Now, you can build scenarios that reflect your business reality and quickly identify the right level of commitments to match your commitment strategy. Then, unlock more savings by:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Understanding usage patterns: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Dive deep into historical data with customizable lookback periods of 30, 60, 90 or 180 days (lookback of 180 days will be available in July 2024)&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Eliminating data noise:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Easily filter out anomalies and outliers that could skew your projections. &lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Seeing instant results:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Adjust your model parameters and watch the recommended commitment amount, estimated monthly savings, and usage pattern graphs update in near real-time.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Collaborating with confidence:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; share your model with colleagues and decision-makers to foster alignment and drive informed decision making.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Speaking of FInOps hub, we’re also adding &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;new idle reservation recommendations! &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Compute Engine reservations guarantee your business access to critical Google Cloud Platform compute resources even during periods of peak demand or unexpected events, helping to ensure uninterrupted operations and preventing costly downtime. However, some customers forget to remove these reservations once they don’t need them any longer. FinOps hub's new Idle Reservation recommendation lets you optimize cloud costs and eliminate waste by analyzing usage patterns. For example, it can identify reservations that haven't been utilized for a customizable period (default: 7 days), so you can delete them and reduce unnecessary spending. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/4_imLaFtz.max-1000x1000.png"
        
          alt="4"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="q5m3u"&gt;See unused reservation recommendations and review details now within the FinOps Hub!&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3 role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;5. Sending actionable alerts, not noise  &lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At Next, &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/cost-management/finops-news-from-next24"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;we announced&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; the private preview of our Cost Anomaly Detection solution, which continuously monitors your Google Cloud  projects to identify any unexpected cost overruns, at near real-time. Each unexpected cost spike is explained with a granular root-cause, indicating the top drivers down to the SKU. Now, you can easily configure alert preferences for your anomalies, within the billing console. Through an easy, one-time setup, you can &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;configure email or pubsub alerts&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; either for every individual anomaly or opt-in for a daily summary for your &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;desired set of recipients&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;. You can also set up a &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;cost impact threshold&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; to ensure that you only receive alerts for anomalies that you consider &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;significant&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;.Further, you can influence our smart, AI-driven anomaly detection algorithm by providing feedback with a single click. Lastly, you can download a CSV for anomalies dating back to three months. &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Final thoughts&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Through continual product innovation and evolution, we’re constantly striving to solve real-world FinOps problems for our Google Cloud customers. Please try out these new releases, and sign up for our &lt;/span&gt;&lt;a href="https://docs.google.com/forms/d/e/1FAIpQLScoSGrOoY62-KSXkk8T2Xjcxek7JE1OcwY1FGiEP1UCR9iwug/viewform?resourcekey=0-u22hthMuMNaqbpzpaNwRqQ" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;FinOps User Group&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to be a part of our product development efforts. Let us know what you think. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Fri, 21 Jun 2024 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/cost-management/cloud-cost-management-enhancements-at-finops-x-2024/</guid><category>Cost Management</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Leveling up FinOps: 5 cost management innovations from FinOps X 2024</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/cost-management/cloud-cost-management-enhancements-at-finops-x-2024/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Sarah McMullin</name><title>Head of Cloud FinOps Product</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Pravir Gupta</name><title>VP &amp; GM, Google Cloud Business Platform</title><department></department><company></company></author></item></channel></rss>