<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/"><channel><title>Infrastructure Modernization</title><link>https://cloud.google.com/blog/products/infrastructure-modernization/</link><description>Infrastructure Modernization</description><atom:link href="https://cloudblog.withgoogle.com/blog/products/infrastructure-modernization/rss/" rel="self"></atom:link><language>en</language><lastBuildDate>Thu, 16 Apr 2026 13:15:02 +0000</lastBuildDate><image><url>https://cloud.google.com/blog/products/infrastructure-modernization/static/blog/images/google.a51985becaa6.png</url><title>Infrastructure Modernization</title><link>https://cloud.google.com/blog/products/infrastructure-modernization/</link></image><item><title>Building the agentic future: A spotlight on Google Cloud’s media &amp; entertainment partner ecosystem</title><link>https://cloud.google.com/blog/products/media-entertainment/agentic-media-and-entertainment-is-here-see-how-our-ecosystem-helps-build-it/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As we gather in Las Vegas for &lt;/span&gt;&lt;a href="https://www.nabshow.com/las-vegas/?gad_source=1&amp;amp;gad_campaignid=23481113509" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;NAB Show 2026&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, the industry conversation has shifted. We are no longer asking if AI works; we’re now focused on how it scales. The era of AI experimentation is over — production-grade execution is here. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At Google Cloud, we believe no studio or broadcaster should have to build this future in isolation. Our mission is to provide the agentic platform and AI and cloud tools that allow our partners to innovate at the speed of ideas — from the tools used in the edit suite, to the technology that delivers video to millions of viewers worldwide.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Enhancing production: From manual tasks to intelligent assistants&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Modern creative workflows are often slowed down by manual technical tasks. Google Cloud is working with ecosystem leaders to integrate advanced AI capabilities directly into the core of production software, so creators can focus on their artistry, not tedious tasks. With AI acting as a proactive assistant within the creative suite, production teams can significantly reduce the time between a raw idea and a finished frame.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://www.avid.com/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Avid&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; With the launch of Content Core on Google Cloud, Avid is &lt;/span&gt;&lt;a href="https://www.googlecloudpresscorner.com/2026-04-16-Avid-and-Google-Cloud-Announce-Partnership-to-Bring-Agentic-AI-to-Media-Production" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;delivering&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; a truly cloud-native studio. And by integrating multimodal AI search into Media Composer, editors can find the exact frame they need using natural language, turning hours of logging into seconds of discovery.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://www.backlight.co/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Backlight&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Backlight makes complex media workflows simple for teams of all sizes, from production through monetization. Built on Google Cloud with the Video Intelligence API, Backlight's Iconik platform automatically adds searchable metadata upon upload. Customers see up to 50% shorter production cycles and save up to 60% on storage by deeply understanding their media libraries, reducing duplications, and optimizing asset placement.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://www.brahma.io/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Brahma.ai&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Brahma AI, an enterprise AI content platform, is powering high-fidelity digital likenesses across retail, entertainment, and healthcare, making them interactive and intelligence-driven within a secure and governed framework.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Unlocking content value: From static archives to active assets&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Data is only as valuable as the insights you can extract from it. Our partners, listed on the &lt;/span&gt;&lt;a href="https://cloud.google.com/marketplace?e=48754805"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Marketplace&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, are using generative media models to transform massive, static archives into searchable, revenue-generating engines. By making every frame discoverable, we’re helping media companies turn decades of history into immediate opportunities.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://www.ateme.com/contribution-and-video-distribution-software/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Ateme&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Ateme helps simplify global distribution with its new generative AI-powered subtitling solution, which can significantly reduce the manual labor of localizing different media types.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://perfect-memory.com/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Perfect Memory&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Perfect Memory helps customers turn traditional storage into a context-aware knowledge engine. The platform understands the relationships between athletes, historical events, and emotional nuances — transforming massive media archives into an intelligent library that lets creative teams instantly surface the perfect content for any story.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://www.vionlabs.com/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;VionLabs&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Working with companies like Cineverse, Plex, and Crunchyroll, Vionlabs uses AI to analyze and index content libraries — making video assets more accessible and enabling metadata generation. By understanding the specific mood and aesthetic of each scene, Vionlabs helps streaming platforms move beyond basic genre tags toward more nuanced, sentiment-driven content discovery and marketing.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Scaling global reach: From simple streams to audience growth&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To succeed today, media companies must provide a smooth viewing experience and easy payment options. Our ecosystem provides the tools to grow a company’s reach and maximize the value of every subscriber through reliable, high-quality delivery.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://www.bendingspoons.com/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Bending Spoons&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; By leveraging the global scale of Google Cloud, Bending Spoons’ properties such as Brightcove and Vimeo are delivering professional-grade tools for large enterprises, SMBs, the next generation of creators, and more. Its platforms ensure that high-quality video production and distribution are accessible to everyone, from global brands to independent storytellers.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://bitmovin.com/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Bitmovin&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Bitmovin enables streaming services to scale efficiently while delivering a premium experience across the widest range of devices. By combining real-time observability with AI-driven insights, media teams can proactively optimize engagement and monetization. Furthermore, Bitmovin’s advanced encoding ensures superior visual quality at lower bitrates, supporting everything from high-demand Video on Demand (VOD) to massive, 24/7 live events.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://evergent.com/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Evergent&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Evergent automates complex billing and monetization workflows for AI-powered revenue management. Media and telecommunications companies can use Evergent’s tools to maximize subscription growth and improve long-term customer retention through personalized and agile payment offers.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://www.harmonicinc.com/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Harmonic&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Harmonic is helping major broadcasters like Grupo Globo modernize their operations. By integrating new digital broadcast capabilities into their cloud-based streaming solutions, Harmonic provides leaders with a faster, more efficient path to manage video processing and delivery at a massive scale.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Ensuring reliability: From infrastructure to a foundation of trust&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;High-quality content requires a high-performance foundation. We are partnering with infrastructure leaders to ensure that even the most complex global broadcasts remain stable, secure, and responsive. &lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://zixi.com/" rel="noopener" target="_blank"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Zixi&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; provides the broadcast-grade transport and workflow automation needed to move professional video across any network. By offering centralized control and complete visibility into the delivery process, Zixi ensures that leaders like Fubo can manage high-stakes, broadcast-quality live events without the risk of a signal drop.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Visit the ecosystem in action&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The strength of our ecosystem is its integration across all aspects of the media and entertainment landscape. From the cameras, to the cloud, to the viewers' screens, these partners represent the future of a more creative, efficient, and agentic media industry.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Visit the &lt;/span&gt;&lt;a href="https://google.jifflenow.com/external-request/nab2026/meeting-request?token=af4443b4fe1d6bcc74cf" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Booth&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (West Hall, #W2731) at NAB Show from April 19-22 to see many of these partners in action through live demonstrations and theater sessions.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Thu, 16 Apr 2026 13:15:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/media-entertainment/agentic-media-and-entertainment-is-here-see-how-our-ecosystem-helps-build-it/</guid><category>AI &amp; Machine Learning</category><category>Customers</category><category>Partners</category><category>Application Modernization</category><category>Infrastructure Modernization</category><category>Media &amp; Entertainment</category><media:content height="540" url="https://storage.googleapis.com/gweb-cloudblog-publish/images/agents_go_to_hollywood.max-600x600.png" width="540"></media:content><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Building the agentic future: A spotlight on Google Cloud’s media &amp; entertainment partner ecosystem</title><description></description><image>https://storage.googleapis.com/gweb-cloudblog-publish/images/agents_go_to_hollywood.max-600x600.png</image><site_name>Google</site_name><url>https://cloud.google.com/blog/products/media-entertainment/agentic-media-and-entertainment-is-here-see-how-our-ecosystem-helps-build-it/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Anshul Kapoor</name><title>Global Lead, Telecommunication, Media, Entertainment &amp; Games</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Buzz Hays</name><title>Global Lead, Entertainment Industry Solutions, Google Cloud</title><department></department><company></company></author></item><item><title>Cool stuff Google Cloud customers built, April edition: BMW big on SLMs, MLB’s Scout Insights AI, personalized resort experiences</title><link>https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;AI and cloud technology are reshaping every corner of every industry around the world. Without our customers, who are building the future on our platform, there would be no Google Cloud. In this &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up-december-2025"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;regular round-up&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, we dive into some of the exciting projects redefining businesses, shaping industries, and creating new categories. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;For our latest edition, we learn why &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;BMW Group&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; is experimenting with small language models (SLMs); catch AI-powered commentary from &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Major League Baseball&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;; hit the slopes with &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Vail Resort&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;’s AI concierge; build an intelligent grid with &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;CTC Global&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;; witness how &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;ID.me&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; created secure global scale; and see how &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Manhattan Associates&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; supply chain tools now handle 1 billion daily API calls.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Be sure to check back next month to see how more industry leaders and exciting startups are putting Google Cloud technologies to use. And if you haven’t already, please peruse our list of &lt;/span&gt;&lt;a href="https://workspace.google.com/blog/ai-and-machine-learning/how-our-customers-are-using-ai-for-business" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;1,001 real-world gen AI use cases&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; from our customers.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;BMW tests the big potential of small models&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; As one of the world’s leading providers of premium cars and motorcycles, BMW Group is always at the forefront of automotive technology. This ethos pushed the company to test what type of AI language models are ideally suited to driving situations, where access to cloud-based LLMs isn’t always possible.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/topics/manufacturing/how-bmw-is-testing-slms-not-llms-for-in-vehicle-voice-commands"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;BMW Group wanted to explore &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;the potential of small language models (SLMs)&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;, which could run within the limited hardware on a vehicle. Finding the right trade-off between size and capability requires careful optimization, and the sheer volume of viable combinations renders manual searches for the optimal configuration an incredibly tedious, if not impossible, undertaking. To overcome this challenge, BMW and Google built &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;automated, reproducible workflows&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; through executable pipelines using &lt;/span&gt;&lt;a href="https://cloud.google.com/vertex-ai"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Vertex AI&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The path from a general-purpose LLM to a specialized SLM isn’t straightforward. Every choice — from type of quantization to characteristics and contents of the fine-tuning domain-specific dataset — affects the quality and efficiency of the final model. This creates an exponential range of configurations, each with different trade-offs. It’s a great example of &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;using AI to scale an optimization problem for other AI&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “With automated pipelines, we can rapidly adapt models to our domain and rigorously test and evaluate them against domain-specific benchmarks. This allows us to iterate and optimize models in hours rather than days.” &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;– &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Dr. Céline Laurent-Winter&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, vice president, Connected Vehicle Platforms at BMW Group&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;MLB Scout Insights: AI-powered color commentary&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Major League Baseball is famous for its colorful announcers. Now, MLB is bringing more baseball color straight to your pocket, and Gemini is helping give it a voice.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/transform/mlb-scout-insights-ai-powered-color-commentary-gameday-app"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; Each season, millions of baseball fans use the MLB app and tap over to the Gameday feature for live, up-to-the-pitch action across more than a dozen games. Starting this season, the league launched MLB Scout Insights in Gameday, which uses Gemini models to quickly scan decades of game and player data, cross-references it with situational game scenarios, and then delivers game-relevant context during key matchups.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Given the sport’s storied history, 162-game regular season, and global reach, baseball fans are among the most sophisticated and passionate out there. To keep them engaged with Gameday and the MLB app, the league wanted to deliver insights that truly felt meaningful and interesting. Building the tool meant answering a rather squishy question: &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;What makes an insight actually insightful&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;, not just an accurate fact, and &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;how can an AI learn that distinction?&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The answer came from some clever “&lt;/span&gt;&lt;a href="https://en.wikipedia.org/wiki/Information_content" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;surprisal&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;” analysis.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; "With Scout Insights, every fan can feel like the smartest person in the stands, at the water cooler, or on the couch. It’s about deepening connections to the game, and sharing that passion with others. That’s the magic of sports, and we’re making more of it possible with the magic of AI." – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Josh Frost&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, senior vice president of product &amp;amp; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Matt Graser&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, director of engineering, Major League Baseball&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Vail Resorts makes personalized AI assistance easy&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Vail Resorts operates some of the most iconic and beloved mountain destinations in the world, including Whistler Blackcomb, Park City Mountain, Stowe, and Crested Butte.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/products/ai-machine-learning/how-vail-resorts-built-an-ai-assistant-to-automate-personalized-recommendations"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; Vail Resorts launched My Epic Assistant during the 2024-2025 snow season, and expanded it this year to add even more AI-powered chat features powered by Google’s powerful &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Gemini models&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;. The result is an agentic guide to the slopes that can help skiers and snowboarders decide on the right season pass, share the latest snow report, check on lesson preparations, or suggest a good stop for cocoa. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Vail Resorts wanted &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;more than a chatbot; they wanted a digital concierge&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; that understands the nuance between a powder day at Whistler and a family trip to Beaver Creek. As the company implemented and refined personalization, improved search, summary capabilities, and conversational flow within My Epic Assistant, the app has delivered &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;a 45% reduction in escalation to human agents&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; since launch.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; "Utilizing tooling from Google Cloud, we could lean into agentic design patterns that gave us a way to unlock natural, personalized conversations. These boosted customer satisfaction, while reducing the need for manual intent design. These tools also let us combine flexibility and control to enable the assistant to respond fluidly but always within the boundaries of our brand, policies, and product strategy.”&lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;— The Vail Resorts technical team&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;CTC Global turns the smart grid into an intelligent one&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; CTC Global is a leading manufacturer of advanced transmission conductors and power lines. While many nodes in the grid contain IoT sensors, it recognized a literal gap in the transmission lines themselves.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/transform/intelligent-grid-ai-powered-smart-transmission-lines-ctc-grid-vista"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; CTC’s new GridVista platform threads fiber-optic cable into its high-strength carbon fiber composite core, and connects these to monitoring technology built with AI and monitoring technology from Google Cloud and &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Tapestry&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;. With GridVista, CTC can turn every inch of transmission into a smart sensor.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; GridVista gives CTC grid operators an accurate and reliable view of what’s happening across the entire line — based on actual, real-time data from the entire length of the conductor, not point estimates from a static model of line conditions or the occasional clamped-on sensor. This means they can significantly &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;improve safety&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;manage costs&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;, increase the line’s capacity to transmit power, and &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;enhance reliability&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; with more precise insights about events that might trigger an outage.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “This awareness allows for a grid that can truly sense its own health in real time and provide unprecedented awareness of conditions on the entire line. Whether that’s real time storm impacts, ice load, wind load, branches on the wire, or temperatures on or under the line. The GridVista system truly represents next generation capabilities. ” — &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;J.D. Sitton&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, CEO, CTC Global&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;ID.me reduces risk while scaling past 160 million users&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; ID.me is transforming digital identity security for the modern era, offering a single login that lets you easily prove you’re you across a wide range of platforms and wallets.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/products/databases/id-me-scales-and-fights-ai-fraud-with-alloydb"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; ID.me currently serves more than 160 million users, including as many as 40,000 at any time, so they can prove their identity online as easily as flashing their driver’s license in person. Over the last two years, ID.me migrated more than 50 terabytes of data across 15 database instances to Google Cloud with minimal downtime. They also introduced a two-tier architecture with &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Cloud SQL&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; supporting its smaller and more standard services, while &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;AlloyDB&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; runs heavier workflows that form the backbone of the ID.me platform.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;a href="https://cloud.google.com/alloydb/ai?hl=en"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AlloyDB AI&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; has allowed ID.me to &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;scale its systems to handle 10X-20X&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; of what was possible before — and at a lower&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; price&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; to boot. That responsiveness and reliability led the U.S. federal government to recognize ID.me for its role in preventing large-scale fraud within national systems.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; "We’ve been able to scale both our infrastructure and trust. With a platform that’s faster, smarter, and built to handle portable identity at massive scale, we’re one step closer to our goal: a secure, digital way to prove who you are, wherever you need it, that works everywhere you need it." —&lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Kevin Liu&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;,&lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Cloud Platform Architect, &lt;/span&gt;&lt;a href="http://id.me" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;ID.me&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Manhattan Associates powers more than a billion daily API calls&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Manhattan Associates is a global leader in supply chain and omnichannel commerce solutions, offering tools and platforms that reach more than 2 billion people across 20 billion consumer touchpoints.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;a href="https://cloud.google.com/blog/products/databases/how-cloud-sql-powers-manhattan-associates-ai-supply-chain-platform"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;Manhattan Associates modernized its Manhattan Active SaaS platform by migrating from legacy Oracle and DB2 systems to Google Cloud databases. Each capability of Manhattan Active now runs as an independent, containerized service orchestrated by &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Google Kubernetes Engine (GKE)&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;. Data flows through &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Pub/Sub&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; into &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;BigQuery&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; for real-time analytics, while &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Cloud Logging&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Cloud Monitoring&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; deliver observability at scale.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; With its new microservices-first design, Manhattan gained the agility to evolve faster and the confidence that &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;mission-critical operations would remain resilient&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; across regions. With Cloud SQL and BigQuery, the company now processes more than a billion daily API calls with &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;average response times of less than 150 milliseconds&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;. This evolution supports hundreds of thousands of monthly active users across tens of thousands of stores and distribution centers. The new platform also created the foundation for Manhattan’s Agentic AI suite, which includes prebuilt agents — like the Intelligent Store Manager and Labor Optimizer — that coordinate real-time decisions across store and distribution center operations.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; "Operationally, the platform has become more elastic and efficient. The system automatically handles hundreds of thousands of scaling events per day, ensuring performance remains consistent during peak surges without expensive overprovisioning." —&lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Narayana Reddy Kothapu&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;,&lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Senior Director, Manhattan Associates &amp;amp; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Rajkumar Ramani&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;,&lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Technical Director, Manhattan Associates&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Wed, 15 Apr 2026 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up/</guid><category>Partners</category><category>AI &amp; Machine Learning</category><category>Data Analytics</category><category>Application Modernization</category><category>Infrastructure Modernization</category><category>Customers</category><media:content height="540" url="https://storage.googleapis.com/gweb-cloudblog-publish/images/cool-stuff-hero-april-2026.max-600x600.png" width="540"></media:content><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Cool stuff Google Cloud customers built, April edition: BMW big on SLMs, MLB’s Scout Insights AI, personalized resort experiences</title><description></description><image>https://storage.googleapis.com/gweb-cloudblog-publish/images/cool-stuff-hero-april-2026.max-600x600.png</image><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Google Cloud Content &amp; Editorial </name><title></title><department></department><company></company></author></item><item><title>Cool stuff Google Cloud customers built, Feb. edition: Telco data reinvention; Golden State’s “G.O.A.T.T.”; John Lewis explores DORA</title><link>https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up-feb-26/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;AI and cloud technology are reshaping every corner of every industry around the world. Without our customers, there would be no Google Cloud, as they are the ones building the future on our platform. In this &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up-december-2025"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;regular round-up&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, we dive into some of the exciting projects redefining businesses, shaping industries, and creating new categories. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;For our latest edition, we explore a&lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; new data approach for &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Vodafone and Fastweb&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;; evaluating &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;John Lewis Partnership&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;’s developer platforms; the &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Golden State Warrior&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;’s AI playbook; healthy, stable networks at &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Hackensack Meridian Health&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;; and &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Ab Initio &lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;brings better context to data for AI.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Be sure to check back next year to see how more industry leaders and exciting startups are putting Google Cloud technologies to use. And if you haven’t already, please peruse our list of &lt;/span&gt;&lt;a href="https://workspace.google.com/blog/ai-and-machine-learning/how-our-customers-are-using-ai-for-business" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;1,001 real-world gen AI use cases&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; from our customers.&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;Fastweb + Vodafone reimagined data workflows&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Following the acquisition of Vodafone Italy by Swisscom in 2025, these leading European telecom providers wanted to rethink how they serve customers and deliver timely, personalized experiences across mobile, broadband, and digital channels.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/products/databases/how-fastweb-vodafone-reimagined-data-workflows-with-spanner-bigquery"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Both companies had already begun modernizing customer data workflows with &lt;/span&gt;&lt;a href="https://cloud.google.com/bigquery"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;BigQuery&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, but combining ecosystems exposed certain limits of the existing setup. In order to give every channel real-time access to accurate customer data, they implemented &lt;/span&gt;&lt;a href="https://cloud.google.com/spanner"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Spanner&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; as a service and governance layer, delivering low-latency reads, horizontal scalability, high availability, and a fully managed environment with zero ops overhead. The team is also using &lt;/span&gt;&lt;a href="https://gemini.google.com/app" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Gemini&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to generate clear documentation directly from the code, which saves hours of manual work.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Using &lt;/span&gt;&lt;a href="https://cloud.google.com/products/spanner/graph?e=48754805"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Spanner Graph&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; allowed the organization to map lineage in a way that reflects how its platform actually works: which tables drive specific jobs, how transformations cascade, and where dependencies sit. Call centers now see &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;more complete, up-to-date customer information&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;, digital channels can rely on &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;consistent data without custom integrations&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;, and partners can access what they need with low latency through Apigee.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “Rebuilding our Customer 360 platform with Google Cloud services has already changed how Fastweb + Vodafone works. Workflow monitoring is simpler, pipelines are leaner, and real-time serving is now the norm. ” – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Vincenzo Forciniti, &lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;IT AI Adoption &amp;amp; Platform Engineering Lead, Fastweb + Vodafone&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;John Lewis measures the value of its developer platform&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The John Lewis Partnership is a major UK retailer operating John Lewis department stores and Waitrose supermarkets. To power their digital transformation, they built the John Lewis Digital Platform (JLDP) to support dozens of product teams building high-quality software for &lt;/span&gt;&lt;a href="http://johnlewis.com" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;johnlewis.com&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/products/application-development/how-john-lewis-partnership-chose-its-monitoring-metrics"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; Moving beyond simple usage metrics, John Lewis developed a sophisticated, multi-stage approach to measuring the real value of their platform. They transitioned from initial speed-based metrics (like "Onboarding Lead Time") to a comprehensive model using &lt;/span&gt;&lt;a href="https://dora.dev/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;DORA metrics&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and subjective engineer feedback via the &lt;/span&gt;&lt;a href="https://getdx.com/connectors/google-cloud/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;DX platform&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. This included a custom "Technical Health" feature that uses small, automated jobs to monitor more than 35 health measures — such as &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Kubernetes&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; best practices, security, and operational readiness — providing teams with real-time "traffic light" indicators of their service health.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; By focusing on value rather than just activity, John Lewis ensured the platform was actually &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;reducing friction for developers&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; rather than just being a mandatory tool. Their automated Technical Health checks allow product teams to manage technical debt and security vulnerabilities proactively. This approach has decoupled centralized operations teams from individual services, leading to &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;faster incident resolution&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; (MTTR), fewer outages, and &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;significant cost savings&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; "Measurement is a journey, not a destination. Start by measuring something meaningful to your stakeholders, but be prepared to adapt as your platform evolves. The things that mattered when you were proving out the platform's viability are unlikely to be what are important several years later when your features are mature." – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Alex Moss&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Principal Platform Engineer, John Lewis Partnership&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;Hackensack Meridian Health de-risks network migration using VPC Flow Logs&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Hackensack Meridian Health is a leading not-for-profit healthcare organization and the largest hospital system in New Jersey. System reliability is a cornerstone value for HMH as they manage a vast network of hospitals, urgent care centers, and physician practices.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/products/networking/using-vpc-flow-logs-to-de-risk-network-migration?e=48754805"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; Preparing for a large-scale migration to a new Google Cloud network design, Hackensack Meridian Health used &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/vpc/docs/flow-logs"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;VPC Flow Logs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/network-intelligence-center/docs/flow-analyzer/overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Flow Analyzer&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to eliminate the "black box" of hybrid traffic. By enabling logs on their Cloud Interconnect VLAN attachments, they captured granular telemetry — including source/destination IPs, ports, and protocols. They then exported this data to create a visual "who-is-talking-to-what" map. This allowed them to identify critical traffic patterns between on-premises data centers and specific Google Cloud regions, VPCs, and applications.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; In a healthcare environment, even minor network disruptions can have major consequences. By mapping traffic proactively, Hudson Meridian Health pinpointed exactly which moments in the cutover carried the highest risk. This preparation allowed them to &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;detect a migration issue in just three minutes&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; and resolve it within five — a process that previously could have taken hours. Beyond migration, this level of visibility enables the organization to better&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; manage capacity planning, cost attribution, and security compliance &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;across their hybrid infrastructure.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; "Getting a clear picture of our interconnect traffic always felt like a black box. Enabling VPC Flow Logs and feeding it into Flow Analyzer finally gave us the map we needed. Identifying those critical traffic flows before we changed any routes was key to de-risking the entire migration." &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;— &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Randall Brokaw&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Cloud Engineering Manager, Hackensack Meridian Health&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;The Golden State Warriors’ AI-powered back office&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The Golden State Warriors are one of the NBA’s most successful modern franchises. Behind their on-court wins are a specialized operations team who run what might be called organization’s "G.O.A.T.T." (Greatest of All-Time Technologies), a data and AI platform that helps drive game-time insights, trading decisions, and fan experience enhancements.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/transform/golden-state-warriors-ai-powered-back-office-team-digital-dynasty-informed-trades-line-up-changes"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; The Warriors transitioned from a "gut-feeling" culture to an "analytics-first" strategy by building an internal "digital brain" on Google Cloud. Using BigQuery and Gemini, the team now automates complex workflows that previously took hours, such as generating pre-game scouting reports. They use machine learning to run thousands of trade simulations that prioritize "team fit" over raw individual stats and employ computer vision to track the "shot quality" of every attempt in the NBA. On the business side, they built a content recommendation engine using the &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/docs/discovery"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Discovery API&lt;/span&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;to deliver personalized digital experiences to their global fan base.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; This AI-driven approach narrows the decision tree for leadership, allowing them to focus human expertise on the most viable options. By automating the “science” of data processing, &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;coaches and scouts have more time&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; for the "art" of face-to-face training, planning, and player development. This integration has not only influenced on-court strategy — like the three-point revolution — but has also &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;improved business efficiency,&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; with employees now proactively bringing AI-driven ideas to the IT team rather than waiting for top-down mandates.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; "You can never reach a point where either humans or machines are making all the decisions. The sweet spot is finding that middle ground where intuition and data converge on the same conclusion. Data helps us narrow our decision tree before we even start evaluating specific options." — &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Nick Manning,&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; Senior Director of Consumer Products &amp;amp; Emerging Technology, Golden State Warriors&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;span style="vertical-align: baseline;"&gt;Ab Initio unlocks enterprise data for the agentic AI era&lt;/span&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Ab Initio is an enterprise software company specializing in high-volume data integration and governance. Their platform is trusted by large-scale organizations to manage complex data lifecycles across hybrid and multi-cloud environments.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/products/data-analytics/unlocking-enterprise-data-to-accelerate-agentic-ai-how-ab-initio-does-it"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; To solve the challenge of grounding AI agents in accurate data, Ab Initio partnered with Google Cloud to integrate its data fabric with BigQuery, &lt;/span&gt;&lt;a href="https://cloud.google.com/dataplex"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Dataplex Universal Catalog&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, and Gemini. They launched a suite of more than 500 metadata and data connectors that bridge the gap between legacy systems (like mainframes, COBOL, and SAS) and modern cloud environments. This integration provides field-level, end-to-end lineage, allowing Gemini to access well-documented, "AI-ready" data regardless of where it resides.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; AI agents are only as effective as the data they can access. By using Ab Initio as a "neutral hub," enterprises can federate data from on-premises and multi-cloud sources into a single unified layer without moving the data itself. This provides the rich semantic context and lineage needed for Gemini to perform grounded, explainable reasoning. For businesses, this means &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;faster transition from experimental AI to production-ready agentic workflows&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; that are auditable, compliant, and capable of making complex, automated decisions.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; "Agentic AI requires trusted, AI-ready data and metadata. Understanding the origin, quality, and meaning of information matters as much as the data itself. Gemini serves as a key component of the agentic layer, using this context to make decisions that are explainable and auditable." —&lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Scott Studer&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Head of Development, Ab Initio &amp;amp; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Chai Pydimukkala&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Data Governance, Sharing &amp;amp; Integration Product Lead, Google Cloud&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Thu, 26 Feb 2026 17:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up-feb-26/</guid><category>Partners</category><category>AI &amp; Machine Learning</category><category>Data Analytics</category><category>Application Modernization</category><category>Infrastructure Modernization</category><category>Customers</category><media:content height="540" url="https://storage.googleapis.com/gweb-cloudblog-publish/images/feb-cool-stuff-hero-feb.max-600x600.png" width="540"></media:content><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Cool stuff Google Cloud customers built, Feb. edition: Telco data reinvention; Golden State’s “G.O.A.T.T.”; John Lewis explores DORA</title><description></description><image>https://storage.googleapis.com/gweb-cloudblog-publish/images/feb-cool-stuff-hero-feb.max-600x600.png</image><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up-feb-26/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Google Cloud Content &amp; Editorial </name><title></title><department></department><company></company></author></item><item><title>Accelerate migrations with new incentives from the Rapid Migration and Modernization Program (RaMP)</title><link>https://cloud.google.com/blog/products/infrastructure-modernization/new-ramp-incentives-for-cloud-migration/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To lead in 2026, you need to be AI-ready, lean, and optimized across every workload — and for many organizations, that means migrating and modernizing applications like SAP, Oracle, NetApp, and VMware to the cloud. At Google Cloud, we’ve helped thousands of customers with successful migrations through the Rapid Migration and Modernization Program (RaMP), and today, we’re introducing the new RaMP, so that the more you migrate, the more you save. Highlights of the new program include:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Google Cloud Service Credits&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: Earn credits based on your incremental usage of eligible workloads on Google Cloud. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Partner and Google Cloud Professional Services funds&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: We will fund partners and Google Cloud Professional Services to help you assess your needs, build your business case and implement your migration and modernization roadmap.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Earn more with advanced workloads&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: Earn additional credits and incentives for advanced workloads (like SAP, Oracle, VMware, Data Analytics, etc.) to offset higher technical costs.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;From technical debt to AI readiness&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You deserve a migration and modernization path that leads to less cost and complexity, not more. Moving to Google Cloud provides the infrastructure foundation you need to replace your technical debt with flexibility, so you can reduce cost and complexity. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;But migrating and modernizing your infrastructure is about more than that — it’s about data accessibility, optimization, and innovation. For example, when you migrate a legacy SAP environment or a massive Oracle database to Google Cloud, you aren't just changing where the data sits; you are making that data accessible to Vertex AI and our Gemini models.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;RaMP can accelerate your migration and replace technical debt with a scalable, secure foundation that can support existing enterprise workloads and the next generation of AI applications — whatever they may be.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Building for the future&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Migrating and modernizing with RaMP gives you immediate access to world-class infrastructure, data, and AI solutions, giving you the foundation you need to succeed in 2026 and beyond. To get started, visit &lt;/span&gt;&lt;a href="https://cloud.google.com/solutions/cloud-migration-program"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;our RaMP page&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to learn more and &lt;/span&gt;&lt;a href="https://cloud.google.com/resources/migration-assessment-offer"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;start your assessment&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Get ready to rapidly enter the AI era. Welcome to the fast lane.&lt;/strong&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Wed, 21 Jan 2026 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/infrastructure-modernization/new-ramp-incentives-for-cloud-migration/</guid><category>Cloud Migration</category><category>Infrastructure Modernization</category><media:content height="540" url="https://storage.googleapis.com/gweb-cloudblog-publish/images/ramp.max-600x600.jpg" width="540"></media:content><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Accelerate migrations with new incentives from the Rapid Migration and Modernization Program (RaMP)</title><description></description><image>https://storage.googleapis.com/gweb-cloudblog-publish/images/ramp.max-600x600.jpg</image><site_name>Google</site_name><url>https://cloud.google.com/blog/products/infrastructure-modernization/new-ramp-incentives-for-cloud-migration/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Lindsay Myers</name><title>VP Global Practice GTM, Google Cloud</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Hani Shakeel</name><title>Director, GTM Programs</title><department></department><company></company></author></item><item><title>Cool stuff Google Cloud customers built, Dec. edition: AI for better toys, reliable mapping tech, Gemini stumps an all-star &amp; more</title><link>https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up-december-2025/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;AI and cloud technology are reshaping every corner of every industry around the world. Without our customers, there would be no Google Cloud, as they are the ones building the future on our platform. In this &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up-october-2025"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;regular round-up&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, we dive into some of the exciting projects redefining businesses, shaping industries, and creating new categories. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;For our latest edition, we look into how &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Waze&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; made its network more reliable; NBA superstar &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Stephen Curry &lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;gets quizzed by Gemini; a financial market transformation at &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;CME Group&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;; a multi-agent business forecasting platform from &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;AppOrchid&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Mattel&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; crunches customer feedback with AI; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;VMO2&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; uses decentralized contracts for reliable data; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Mercado Libre&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;’s strategic use of Spanner; and how &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Ericsson&lt;/strong&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;enhances data governance.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Be sure to check back next year to see how more industry leaders and exciting startups are putting Google Cloud technologies to use. And if you haven’t already, please peruse our list of &lt;/span&gt;&lt;a href="https://workspace.google.com/blog/ai-and-machine-learning/how-our-customers-are-using-ai-for-business" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;1,001 real-world gen AI use cases&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; from our customers.&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Waze keeps traffic flowing with Memorystore&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Waze (a division of Google parent company Alphabet) is a community-driven, crowd-sourced navigation app with tens of millions of users who share real-time data to provide optimal driving routes, traffic updates, and alerts for hazards, police, and more.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/products/databases/how-waze-keeps-traffic-flowing-with-memorystore"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; Waze depends on vast volumes of dynamic, real-time user session data to power its core navigation features, but scaling that data to support concurrent users worldwide required a new approach. Their team built a centralized Session Server backed by &lt;/span&gt;&lt;a href="https://cloud.google.com/memorystore"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Memorystore for Redis Cluster&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, a fully managed service with 99.99% availability that supports partial updates and easily scales to Waze’s use case of over 1 million MGET commands per second with ~1ms latency.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Moving from Memcached’s 99.9% SLA to Memorystore for &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Redis Cluster’s 99.99% means higher availability and resiliency&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; from the service. And because Memorystore for Redis supports partial updates, Waze can change individual fields within a session object rather than rewriting the entire record. That reduces network traffic, speeds up write performance, and &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;makes the system more efficient overall&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “Real-time data drives the Waze app experience. Our turn-by-turn guidance, accident rerouting, and driver alerts depend on up-to-the-millisecond accuracy. But keeping that experience seamless for millions of concurrent sessions requires robust and battle hardened infrastructure that is built to manage a massive stream of user session data.” – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Eden Levin&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Waze BE infrastructure developer &amp;amp; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Yuval Kamran &lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Waze site reliability engineer&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;What Stephen Curry learned from a custom Gemini agent&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Stephen Curry is arguably of the greatest three-point shooter of all-time in the NBA — as well as Google’s performance advisor and an all-around stats-obsessive.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/transform/stephen-curry-gemini-agent-career-stats-analyzed"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; For a special engagement with Curry, the Google Cloud team wanted to showcase the power of Gemini for creative thinking, analysis, and data mining. They &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;took every regular season, play-in, and playoff game from Curry’s career (through the end of the 2024-2025 season) and input the data into a custom-built agent using Google Cloud’s &lt;/span&gt;&lt;a href="https://docs.cloud.google.com/agent-builder/agent-development-kit/overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Agent Development Kit&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/inference"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Gemini APIs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.The system could then be queried for obscure stats, to see if the team could stump Curry and teach him more about his game.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; For example, it found that his three-point shooting percentage after more than seven dribbles, with a minimum 105 attempts was 40.2%, and how many points Curry generated for his teammates off of screens since 2013: 1,105. Instead of countless hours of manual research, the team got &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;query results in less than a minute&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;. Some queries were so obscure, the team wouldn’t have reached a valid answer without the ability of the agent to analyze the rich data.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “Gemini is going to be in my head this year, cause I'm going to be looking at all these details.” – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Stephen Curry&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Golden State Warriors point guard and 4x NBA champ&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;How CME Group builds a faster, smarter exchange&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; CME Group has evolved from a nineteenth-century commodities exchange into one of the most advanced financial market infrastructures in the world. To support real-time trading and risk management at a global scale, the company launched a strategic partnership with Google Cloud.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/topics/financial-services/how-cme-group-builds-a-faster-smarter-exchange-on-cloud-sql"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; By migrating to &lt;/span&gt;&lt;a href="https://cloud.google.com/sql"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Cloud SQL&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and adopting AI-powered insights, CME Group empowered developers, paid down technical debt, and unlocked new opportunities for data-driven innovation across financial markets.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Cloud SQL has given CME a foundation for increased developer and team agility. &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Fewer performance issues mean more time focused on innovation&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: expanding CME’s analytics capabilities, accelerating AI initiatives, and exploring new ways to commercialize data responsibly. When teams stopped chasing outages, they unlocked more time to take bigger bets and build the future.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “With Cloud SQL, we’ve found a way to keep our data layer as fast and dependable as the markets we serve. Cloud SQL gives our teams real-time visibility into what’s happening inside the database. When an application slows, we can identify the root cause in minutes instead of hours. Those insights are built into the platform, which means we don’t need custom tooling or manual analysis to keep operations steady.” – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Kristofer Shane Sikora&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Executive Director, Cloud Data Engineering, CME Group&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;AppOrchid’s multi-agent system for superior business forecasting&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; App Orchid is an enterprise AI builder and a leader in making data actionable with AI, with a mission to make AI a force for good. Their goal is to empower every employee with trusted, understandable, and accessible data.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/products/ai-machine-learning/how-we-built-a-multi-agent-system-for-superior-business-forecasting"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; The business forecasting agent is actually built on the foundation of two powerful, specialized AI agents: a prediction agent built by Google Cloud and App Orchid’s Data Agent offering. These agents work in concert to solve complex business problems, acting as complementary specialists. App Orchid’s agent possesses unparalleled understanding of an enterprise's past and present, while Google’s agent brings world-class capabilities in predicting the future.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Adopting a multi-agent approach provides clear, tangible advantages that directly address the forecasting problems that often plague businesses, including improved accuracy; increased operational efficiency; faster insights; and reduced costs and increased revenue; and greater agility and adaptability. Neither of the underlying agents could achieve these results on their own, but &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;working together, this agent is more than the sum of its subagents&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “As the agentic era gets underway, it is evolving quickly. Our multi-agent approach demonstrates both how true agentic systems are most successful when multiple agents are at play, and the importance of finding strong partners with distinct capabilities to help build and assemble these agentic systems.” – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Brian Mills&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Director, Enterprise AI, Google Cloud&lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;&amp;amp; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Taka Shinagawa&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;,&lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Gen AI Field Solution Architect, Google Cloud&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;How Mattel uses AI for real-time product updates&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;span style="vertical-align: baseline;"&gt;Since its humble beginnings in a garage in 1945, Mattel has consistently been reshaping play for children and families across the globe with iconic franchises like Barbie, Hot Wheels, and Fisher-Price.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/transform/mattel-gen-ai-customer-feedback-real-time-product-improvements"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;To improve its understanding of consumer sentiment, Mattel developed an AI-powered feedback classification system, which can analyze millions of customer interactions from a diverse range of sources in a matter of seconds. At its core, the system relies on &lt;/span&gt;&lt;a href="https://cloud.google.com/bigquery"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;BigQuery&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for storing and efficiently processing its massive customer datasets and then utilizes &lt;/span&gt;&lt;a href="https://cloud.google.com/vertex-ai"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Vertex AI&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://deepmind.google/models/gemini/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google’s multimodal Gemini models&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to refine and train the sophisticated consumer feedback model.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Already, the new AI-powered system has delivered significant wins, delivering &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;a staggering 100x boost in data processing capacity&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; and reducing analysis times from a month to a single minute. By automating the analysis of many processes, analysts are now &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;freed from the noise of everyday tasks&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;, enabling them to focus on deeper research across the company’s iconic portfolio brand.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “Our big motto is ‘From months to minutes,’ but it’s real. We were literally spending months-worth of analysis and just getting data into the place that an analyst could tally up all the sentiment — and now it’s just at our fingertips.” – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Shaun Applegate&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Director of Product Quality Analytics, Mattel&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Virgin Media O2 uses data contracts for scalable AI products&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Virgin Media O2 is one of Europe’s largest telecommunications and media providers, with 45.8 million broadband, mobile, phone, and home subscribers across the UK. To build AI products that are adaptable and data-driven, they needed a decentralized system that internal customers could count on for clean, reliable data.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/products/data-analytics/vmo2-uses-data-contracts-to-build-scalable-ai-and-data-products?e=48754805"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; New decentralized data contracts, built with &lt;/span&gt;&lt;a href="https://cloud.google.com/dataplex"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Dataplex&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, serve as the data quality and assurance layer for VMO2’s data products; these ensure every dataset they publish is reliable, documented, and ready for consumption. Defined at the asset level, such as individual BigQuery tables or Google Cloud Storage buckets, data contracts are redefining how VMO2 manage and share data, enabling the creation of trusted and scalable AI products across their data mesh.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The power of this approach lies in moving beyond static documentation. Because they are machine-readable, data contracts become &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;living guarantees with continuous enforcement and real-time validation&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; directly within data pipelines. This proactive monitoring allows teams to detect schema changes or SLA breaches early, transforming data quality from a reactive fix into a scalable, automated mechanism.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “By operationalizing trust through data contracts, we are fostering a culture of shared responsibility and data-first thinking. This federated model does more than simply fix pipelines; it builds the trusted foundation needed to scale next-generation AI. It ensures that the resilient AI tools empowering our teams are built on data that is reliable, consistent, and well-defined.” – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Chandu Bhuman&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Head of Data Strategy, Cloud &amp;amp; Engineering, Virgin Media O2 &amp;amp; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Dženan Softić&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Data &amp;amp; AI Architect, Google Cloud&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Inside Mercado Libre's multi-faceted Spanner architecture&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Mercado Libre, an e-commerce and fintech pioneer across Latin America, operates at a staggering scale, demanding an infrastructure that's not just resilient and scalable, but also a catalyst for rapid innovation.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/topics/retail/inside-mercado-libres-multi-faceted-spanner-foundation-for-scale-and-ai"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;At the heart of Mercado Libre's strategy is Fury, an in-house middleware platform designed to abstract away the complexities of various backend technologies, providing developers with standardized, simplified interfaces to build applications. &lt;/span&gt;&lt;a href="https://cloud.google.com/spanner"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Spanner&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; provides Fury with an always-on, globally consistent, multi-model database with virtually unlimited scale. By designating Spanner as a choice within Fury, Mercado Libre ensures that applications built on the platform using Spanner stay consistent globally, scale without breaking, and rarely go down.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The strategic adoption of Spanner, amplified by internal platforms like Fury and sophisticated data workflows, has yielded significant benefits, including: &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;significant cost savings&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; and low total cost of ownership; business impact and &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;agility&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;for developers&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;; and &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;low operational overhead&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; thanks to automation.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “Mercado Libre's adoption of Spanner demonstrates how to use a powerful, globally consistent database not just for its core capabilities, but as a strategic enabler for developer productivity, operational efficiency, advanced analytics, and future AI ambitions.” – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Pablo Leopoldo Arrojo&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Software Technical Leader, Mercado Libre&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Ericsson achieves data integrity and superior governance with Dataplex&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Who:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Ericsson is one of the world's leading providers of telecommunications and networking technology and solutions. Its Managed Services unit provides network operations and optimization, including field operations, for various telecom and enterprise customers, including outsourcing network performance management, future provisioning, network vulnerability management, and network energy infrastructure management. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/blog/topics/telecommunications/how-ericsson-achieves-data-integrity-and-superior-governance-with-dataplex"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;What they did:&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;To power the future of its autonomous network operations and deliver on its strategic priorities &lt;span style="vertical-align: baseline;"&gt;across &lt;span style="vertical-align: baseline;"&gt;a global network of more than 710,000 sites&lt;/span&gt;&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;,&lt;/span&gt; Ericsson's Managed Services has been on a transformative data journey with governance at the center of its strategy. Ericsson moved from foundational practices to a sophisticated, business-enabling data governance framework using &lt;/span&gt;&lt;a href="https://cloud.google.com/dataplex"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;the Dataplex Universal Catalo&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;g — turning data from a simple resource into a strategic asset.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; With Dataplex as the governance foundation, Ericsson began implementing the core pillars of its governance program, moving from manual processes to an automated, intelligent data fabric. More specifically, Ericsson established a unified business vocabulary within Dataplex, which helped eliminate ambiguity and ensure their teams — from data scientists to data analysts — were speaking the same language.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn from us:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; “Governance is a value enabler, not a blocker. A modern data governance program should focus on business enablement first, driving value and innovation in order to complement policies, rules and risk management. Also remember this work is a journey, not a destination. Be prepared to fail fast, learn, and adapt. The landscape is constantly changing at breakneck speed.” – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;William McCann Murphy&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Head of Data Authority, Ericsson &amp;amp; &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Akanksha Bhagwanani&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, EMEA Data Analytics Solution Lead, Google Cloud&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Wed, 31 Dec 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up-december-2025/</guid><category>Partners</category><category>AI &amp; Machine Learning</category><category>Data Analytics</category><category>Application Modernization</category><category>Infrastructure Modernization</category><category>Customers</category><media:content height="540" url="https://storage.googleapis.com/gweb-cloudblog-publish/images/image1_W93en4g.max-600x600.png" width="540"></media:content><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Cool stuff Google Cloud customers built, Dec. edition: AI for better toys, reliable mapping tech, Gemini stumps an all-star &amp; more</title><description></description><image>https://storage.googleapis.com/gweb-cloudblog-publish/images/image1_W93en4g.max-600x600.png</image><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/customers/cool-stuff-google-cloud-customers-built-monthly-round-up-december-2025/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Google Cloud Content &amp; Editorial </name><title></title><department></department><company></company></author></item><item><title>How CME Group builds a faster, smarter exchange on Cloud SQL</title><link>https://cloud.google.com/blog/topics/financial-services/how-cme-group-builds-a-faster-smarter-exchange-on-cloud-sql/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Editor’s note: &lt;/span&gt;&lt;a href="https://www.cmegroup.com/" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;The Chicago Mercantile Exchange (CME Group)&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; has evolved from a nineteenth-century commodities exchange into one of the most advanced financial market infrastructures in the world. To support real-time trading and risk management at a global scale, the company launched a strategic partnership with Google Cloud. By migrating to Cloud SQL and adopting AI-powered insights, CME Group empowered developers, paid down technical debt, and unlocked new opportunities for data-driven innovation across financial markets.&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;From butter and eggs to bandwidth&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;CME Group is where risk meets opportunity. Every transaction that happens in our exchange — every order placed, trade executed, or risk calculated — relies on data moving flawlessly and instantly. The integrity of our markets depends on it.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Behind each of those trades is a database storing valuations, ownership, and so much more information, all of which can shift from millisecond to millisecond throughout the day. At our scale, those databases have to store and retrieve that information under relentless demand. We’re processing millions of messages a day with no margin for latency or error. That level of precision doesn’t come easily, especially in a highly regulated industry where performance has to coexist with security and reporting. Every change we make must align with strict compliance standards and global regulatory frameworks. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Speed has always been our currency, but scale became a challenge. CME Group's legacy database estate required significant engineering effort to maintain performance and meet regulatory demands. We needed to reduce operational overhead while improving our security posture. This required a managed database solution that offered transparent observability and clear compliance controls.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;When Cloud SQL meets the trading floor&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Our 10-year &lt;/span&gt;&lt;a href="https://www.cmegroup.com/media-room/press-releases/2021/11/04/cme_group_signs_10-yearpartnershipwithgooglecloudtotransformglob.html" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;strategic partnership with Google Cloud&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; aims to address this by migrating all our technology to the cloud, enabling us to innovate and collaborate on pushing the boundaries of what cloud infrastructure can support. Together, we’re experimenting with new ways to achieve ultra-low-latency performance in the cloud. As data volumes surge and AI becomes increasingly central to risk management, the ability to move and interpret information in milliseconds is a technical requirement. We’re building systems with Google Cloud that let us keep the market running, even as we lead it into the future.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With &lt;/span&gt;&lt;a href="https://cloud.google.com/sql?hl=en"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Cloud SQL&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, we’ve found a way to keep our data layer as fast and dependable as the markets we serve. Cloud SQL gives our teams real-time visibility into what’s happening inside the database. When an application slows, we can identify the root cause in minutes instead of hours. Those insights are built into the platform, which means we don’t need custom tooling or manual analysis to keep operations steady.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;But for us, the value of Cloud SQL goes beyond performance tuning. It’s about confidence. Our database administrators can focus on strategic improvements, and our developers can validate and optimize queries without waiting for escalation. Taken together, we have faster troubleshooting and a data foundation ready for the always-on demands of global trading.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Build smarter with Google Cloud databases!&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2e2707d8e0&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Cloud SQL is our new favorite teammate&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The more we use Cloud SQL, the more it feels like we’ve added a new member to the team. AI-assisted insights from Cloud SQL have changed how the CME Group team works. When an application slows, Cloud SQL tells us why. It surfaces anomalies, walks us through guided analysis, and even suggests query optimizations that restore performance in minutes. Developers can see those recommendations right in their workflows, test fixes, then move on. No waiting, no hand-offs, no firefights.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In other words, AI-assisted troubleshooting has made performance management into a shared responsibility. And because Cloud SQL delivers a consistent experience, our teams can move seamlessly between environments. There’s less training – and a lot more collaboration. The end result is a smarter, more unified data culture at CME Group.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Performance is our competitive advantage&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The work we’re doing with Google Cloud is about more than modernization. Every improvement in speed, reliability, and visibility translates directly into business confidence. CME Group can now deploy new features faster while maintaining the continuity our clients depend on.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Cloud SQL has given us a foundation for that agility. Fewer performance issues mean more time focused on innovation: expanding our analytics capabilities, accelerating AI initiatives, and exploring new ways to commercialize data responsibly. When you stop chasing outages, it turns out you have more time to take bigger bets and build the future.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;For us at CME Group, performance has always been the product. Now, it’s also the platform. We’re building the infrastructure with Google Cloud that keeps global markets moving and the intelligence that will shape what comes next.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Learn more:&lt;/strong&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;a href="https://docs.cloud.google.com/sql/docs/mysql/create-free-trial-instance"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Sign up for the new Cloud SQL free trial&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, a dedicated 30-day program designed to give both new and existing Google Cloud users hands-on access to premium, enterprise-grade features of Cloud SQL (PostgreSQL and MySQL).&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Download this &lt;/span&gt;&lt;a href="https://cloud.google.com/resources/idc-business-value-cloud-sql-analyst-report"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;IDC report&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to learn how migrating to Cloud SQL can lower costs, boost agility, and speed up deployments.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;span style="vertical-align: baseline;"&gt;Learn how &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/databases/ford-reduces-routine-database-management-with-google-cloud"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Ford&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/infrastructure-modernization/how-yahoo-calendar-broke-free-from-hardware-queues-and-dba-bottlenecks"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Yahoo&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; gained high performance and cut costs by modernizing with Cloud SQL.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;</description><pubDate>Wed, 03 Dec 2025 17:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/financial-services/how-cme-group-builds-a-faster-smarter-exchange-on-cloud-sql/</guid><category>Databases</category><category>Data Analytics</category><category>Infrastructure Modernization</category><category>Customers</category><category>Financial Services</category><media:content height="540" url="https://storage.googleapis.com/gweb-cloudblog-publish/images/cme-cloud-sql-header.max-600x600.png" width="540"></media:content><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>How CME Group builds a faster, smarter exchange on Cloud SQL</title><description></description><image>https://storage.googleapis.com/gweb-cloudblog-publish/images/cme-cloud-sql-header.max-600x600.png</image><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/financial-services/how-cme-group-builds-a-faster-smarter-exchange-on-cloud-sql/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Kristofer Shane Sikora</name><title>Executive Director, Cloud Data Engineering, CME Group</title><department></department><company></company></author></item><item><title>AWS and Google Cloud collaborate to simplify multicloud networking</title><link>https://cloud.google.com/blog/products/networking/aws-and-google-cloud-collaborate-on-multicloud-networking/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As organizations increasingly adopt multicloud architectures, the need for interoperability between cloud service providers has never been greater. Historically, however, connecting these environments has been a challenge, forcing customers to take a complex "do-it-yourself" approach to managing global multi-layered networks at scale.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To address these challenges and advance a more open cloud environment, Amazon Web Services (AWS) and Google Cloud collaborated to transform how cloud service providers could connect with one another in a simplified manner. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Today, AWS and Google Cloud are excited to announce a jointly engineered multicloud networking solution that uses both &lt;/span&gt;&lt;a href="https://aws.amazon.com/interconnect/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AWS Interconnect - multicloud&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://cloud.google.com/hybrid-connectivity#multicloud-networking-connectivity"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud’s Cross-Cloud Interconnect&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. This collaboration also introduces a new open specification for network interoperability, enabling customers to establish private, high-speed connectivity between Google Cloud and AWS with high levels of automation and speed.&lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“Integrating Salesforce Data 360 with the broader IT landscape requires robust, private connectivity. AWS Interconnect - multicloud allows us to establish these critical bridges to Google Cloud with the same ease as deploying internal AWS resources, utilizing pre-built capacity pools and the tools our teams already know and love. This native, streamlined experience — from provisioning through ongoing support — accelerates our customers' ability to ground their AI and analytics in trusted data, regardless of where it resides.”&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;strong&gt;- Jim Ostrognai, SVP Software Engineering, Salesforce&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Previously, to connect cloud service providers, customers had to manually set up complex networking components including physical connections and equipment; this approach required lengthy lead times and coordinating with multiple internal and external teams. This could take weeks or even months. AWS had a vision for developing this capability as a unified specification that could be adopted by any cloud service provider, and collaborated with Google Cloud to bring it to market.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Now, this new solution reimagines multicloud connectivity by moving away from physical infrastructure management toward a managed, cloud-native experience. By integrating AWS with Google Cloud’s Cross-Cloud Network architecture, we are abstracting the complexity of physical connectivity, network addressing, and routing policies. Customers no longer need to wait weeks for circuit provisioning: they can now provision dedicated bandwidth on demand and establish connectivity in minutes through their preferred cloud console or API. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Reliability and security are the cornerstone of this collaboration. We have collaborated on this solution to deliver high resiliency by leveraging quad-redundancy across physically redundant interconnect facilities and routers. Both providers engage in continuous monitoring to proactively detect and resolve issues. And this solution is built on a foundation of trust, utilizing MACsec encryption between the Google Cloud and AWS edge routers. &lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“This collaboration between AWS and Google Cloud represents a fundamental shift in multicloud connectivity. By defining and publishing a standard that removes the complexity of any physical components for customers, with high availability and security fused into that standard, customers no longer need to worry about any heavy lifting to create their desired connectivity. When they need multicloud connectivity, it's ready to activate in minutes with a simple point and click.”&lt;/span&gt;&lt;strong&gt;&lt;span style="vertical-align: baseline;"&gt; - Robert Kennedy, VP of Network Services, AWS&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p style="padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“We are excited about this collaboration which enables our customers to move their data and applications between clouds with simplified global connectivity and enhanced operational effectiveness. Today's announcement further delivers on Google Cloud’s Cross-Cloud Network solution focused on delivering an open and unified multicloud experience for customers.”&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;strong&gt;- Rob Enns, VP/GM of Cloud Networking, Google Cloud&lt;/strong&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This collaboration between AWS and Google Cloud is more than a multicloud solution: it’s a step toward a more open cloud environment. The &lt;/span&gt;&lt;a href="https://github.com/aws/AWSInterconnect" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;API specifications&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; developed for this product are open for other providers and partners to adopt, as we aim to simplify global connectivity for everyone. We invite you to explore this new capability today. To learn more about how to streamline your multicloud operations please visit the in-depth &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/networking/extending-cross-cloud-interconnect-to-aws-and-partners"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Cross-Cloud Interconnect blog&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and the &lt;/span&gt;&lt;a href="https://aws.amazon.com/interconnect/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AWS Interconnect - multicloud website&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to get started.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Sun, 30 Nov 2025 19:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/networking/aws-and-google-cloud-collaborate-on-multicloud-networking/</guid><category>Hybrid &amp; Multicloud</category><category>Infrastructure Modernization</category><category>Partners</category><category>Networking</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>AWS and Google Cloud collaborate to simplify multicloud networking</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/networking/aws-and-google-cloud-collaborate-on-multicloud-networking/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Rob Enns</name><title>VP/GM of Cloud Networking, Google Cloud</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Robert Kennedy</name><title>VP of Network Services, Amazon Web Services</title><department></department><company></company></author></item><item><title>How Lightricks trains video diffusion models at scale with JAX on TPU</title><link>https://cloud.google.com/blog/products/media-entertainment/how-lightricks-trains-video-diffusion-models-at-scale-with-jax-on-tpu/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Training large video diffusion models at scale isn't just computationally expensive — it can become impossible when your framework can't keep pace with your ambitions. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://jax.dev" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;JAX&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; has become a popular computational framework across AI applications, now recognized for its capabilities in training large-scale AI models, such as LLMs and &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/customers/escalante-uses-jax-on-tpus-for-ai-driven-protein-design"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;life sciences models&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;. Its strength lies not just in performance but in an expressive, scalable design that gives innovators the tools to push the boundaries of what's possible. We're consistently inspired by how researchers and engineers leverage JAX's ecosystem to solve unique, domain-specific challenges — including applications for generative media.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Today, we're excited to share the story of &lt;/span&gt;&lt;a href="https://www.lightricks.com/" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;Lightricks&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, a company at the forefront of the creator economy. Their &lt;/span&gt;&lt;a href="https://ltx.studio/blog/ltx-2-the-complete-ai-creative-engine-for-video-production" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;LTX-Video&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; team is building high-performance video generation models, and their journey is a masterclass in overcoming technical hurdles. I recently spoke with Yoav HaCohen and Yaki Bitterman, who lead the video and scaling teams, respectively. They shared their experience of hitting a hard scaling wall with their previous framework and how a strategic migration to JAX became the key to unlocking the performance they needed.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Here, Yoav and Yaki tell their story in their own words. – &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Srikanth Kilaru&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, Senior Product Manager, Google ML Frameworks&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;The creator's challenge&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;At Lightricks, our goal has always been to bring advanced creative technology to consumers. With apps like &lt;/span&gt;&lt;a href="https://www.facetuneapp.com/?srsltid=AfmBOoo8ZXXKPBsz1wyL8Rvq9ZtL65N9K51p_yyRjM1DoH6EqZ1oEkLQ" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Facetune&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, we saw the power of putting sophisticated editing tools directly into people's hands. When generative AI emerged, we knew it would fundamentally change content creation.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We launched &lt;/span&gt;&lt;a href="https://ltx.studio/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;LTX Studio&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to build generative video tools that truly serve the creative process. Many existing models felt like a "prompt and pray" experience, offering little control and long rendering times that stifled creativity. We needed to build our own models—ones that were not only efficient but also gave creators the controllability they deserve.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Our initial success came from training our first real-time video generation model on &lt;/span&gt;&lt;a href="https://cloud.google.com/tpu"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud TPUs &lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;with &lt;/span&gt;&lt;a href="https://docs.pytorch.org/xla/release/r2.8/index.html" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;PyTorch/XLA&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. But as our ambitions grew, so did the complexity. When we started developing our &lt;/span&gt;&lt;a href="https://www.prnewswire.com/news-releases/lightricks-launches-13b-parameters-ltx-video-model-breakthrough-rendering-approach-generates-high-quality-efficient-ai-video-30x-faster-than-comparable-models-302447660.html#:~:text=LTXV%2D13B%20introduces%20%22multiscale%20rendering,LTX%20Video%20in%20the%20marketplace." rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;13-billion-parameter model&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, we hit a wall.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Hitting the wall and making the switch&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Our existing stack wasn’t delivering the training step times and scalability we needed. After exploring optimization options, we decided to shift our approach. We paused development to rewrite our entire training codebase in JAX, and the results were immediate. Switching to JAX felt like a magic trick, instantly providing the necessary runtimes.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This transition enabled us to effectively scale our tokens per sample (the amount of data processed in each training step), model parameters, and chip count. With JAX, sharding strategies (sharding divides large models across multiple chips) that previously failed now work out of the box on both small and large pods (clusters of TPU chips).&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;These changes delivered linear scaling that translates to 40% more training steps per day — directly accelerating model development and time to market. Critical issues with FlashAttention and data loading also worked reliably. As a result, our team's productivity skyrocketed, doubling the number of pull requests we could merge in a week.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Why JAX worked: A complete ecosystem for scale&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The success wasn't just about raw speed; it was about the entire &lt;/span&gt;&lt;a href="https://docs.jax.dev/en/latest/index.html#ecosystem" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;JAX stack&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, which provided the building blocks for scalable and efficient research.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;A clear performance target with MaxText:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; We used the open-source &lt;/span&gt;&lt;a href="https://github.com/AI-Hypercomputer/maxtext" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;MaxText &lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;framework as a baseline to understand what acceptable performance looked like for a large model on TPUs. This gave us a clear destination and the confidence that our performance goals were achievable on the platform.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;A robust toolset:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; We built our new stack on the core components of the JAX ecosystem based on the MaxText blueprint. We used &lt;/span&gt;&lt;a href="https://flax.readthedocs.io/en/v0.8.3/index.html" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Flax&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for defining our models, &lt;/span&gt;&lt;a href="https://optax.readthedocs.io/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Optax&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for implementing optimizers, and &lt;/span&gt;&lt;a href="https://orbax.readthedocs.io/en/latest/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Orbax&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for robust checkpointing — all core components that work together natively.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Productive development and testing:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The transition was remarkably smooth. We implemented unit tests to compare our new JAX implementation with the old one, ensuring correctness every step of the way. A huge productivity win was discovering that we could test our &lt;/span&gt;&lt;a href="https://docs.jax.dev/en/latest/notebooks/Distributed_arrays_and_automatic_parallelization.html" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;sharding&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; logic on a single, cheap CPU before deploying to a large TPU slice. This allowed for rapid, cost-effective iteration.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Checkpointing reliability:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; For sharded models, JAX’s checkpointing is much more reliable than before, making training safer and more cost-effective.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Compile speed &amp;amp; memory:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; JAX compilation with &lt;/span&gt;&lt;a href="https://docs.jax.dev/en/latest/_autosummary/jax.lax.fori_loop.html" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;lax.fori_loop&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; is fast and uses less memory, freeing capacity for tokens and gradients.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong style="vertical-align: baseline;"&gt;Smooth scaling on a supercomputer:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; With our new JAX codebase, we were able to effectively train on a reservation of thousands of TPU cores. We chose TPUs because Google provides access to what we see as a "&lt;/span&gt;&lt;a href="https://cloud.google.com/solutions/ai-hypercomputer"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;supercomputer&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;" — a fully integrated system where the &lt;/span&gt;&lt;a href="https://cloud.google.com/tpu/docs/system-architecture-tpu-vm"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;interconnects and networking&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; were designed first, not as an afterthought. We manage these large-scale training jobs with our own custom Python scripts on &lt;/span&gt;&lt;a href="https://cloud.google.com/products/compute"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Compute Engine (GCE)&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, giving us direct control over our infrastructure. We also use &lt;/span&gt;&lt;a href="https://cloud.google.com/storage"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Storage&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and stream the training data to the TPU virtual machines.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/JAX-Stack-Lightricks-Architecture.max-1000x1000.png"
        
          alt="JAX-Stack-Lightricks-Architecture"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="9djnu"&gt;Architectural diagram showing the Lightricks stack&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;hr/&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Build your models with the JAX ecosystem&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Lightricks' story is a great example of how JAX's powerful, modular, and scalable design can help teams overcome critical engineering hurdles. Their ability to quickly pivot, rebuild their stack, and achieve massive performance gains is a testament to both their talented team and the tools at their disposal.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The JAX team at Google is committed to supporting innovators like Lightricks and the entire scientific computing community.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Share your story&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: Are you using JAX to tackle a challenging scientific problem? We would love to learn how JAX is accelerating your research.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Help guide our roadmap&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: Are there new features or capabilities that would unlock your next breakthrough? Your feature requests are essential for guiding the evolution of JAX.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Please reach out to the team via&lt;/span&gt; &lt;a href="https://github.com/google/jax" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;GitHub&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to share your work or discuss what you need from JAX. Check out documentation, examples, news, events and more at &lt;/span&gt;&lt;a href="http://jaxstack.ai" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;jaxstack.ai&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="http://jax.dev" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;jax.dev&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Sincere thanks to Yoav, Yaki, and the entire Lightricks team for sharing their insightful journey with us. We're excited to see what they create next.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Tue, 11 Nov 2025 17:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/media-entertainment/how-lightricks-trains-video-diffusion-models-at-scale-with-jax-on-tpu/</guid><category>AI &amp; Machine Learning</category><category>Infrastructure Modernization</category><category>Customers</category><category>Media &amp; Entertainment</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>How Lightricks trains video diffusion models at scale with JAX on TPU</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/media-entertainment/how-lightricks-trains-video-diffusion-models-at-scale-with-jax-on-tpu/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Yaki Bitterman</name><title>Research Team Lead, Model Scaling, Lightricks</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Yoav HaCohen, PhD</name><title>Director of Research, Model Foundations, Lightricks</title><department></department><company></company></author></item><item><title>11 ways to reduce your Google Cloud compute costs today</title><link>https://cloud.google.com/blog/products/compute/cost-saving-strategies-when-migrating-to-google-cloud-compute/</link><description>&lt;div class="block-paragraph"&gt;&lt;p data-block-key="t3t6l"&gt;As the saying goes, "a penny saved is a penny earned," and this couldn't be more true when it comes to cloud infrastructure. In today's competitive business landscape, you need to maintain the performance to meet your business needs. Luckily, Google Cloud’s &lt;a href="https://cloud.google.com/products/compute"&gt;Compute Engine&lt;/a&gt; and block storage services offer numerous opportunities to reduce costs without sacrificing performance, especially in the context of your migration and modernization initiatives.&lt;/p&gt;&lt;p data-block-key="fodf8"&gt;In this article, we'll explore &lt;b&gt;11 key ways&lt;/b&gt; to optimize your infrastructure spending on Google Cloud, from simple adjustments to strategic decisions that can result in significant long-term savings.&lt;/p&gt;&lt;h3 data-block-key="58qqa"&gt;&lt;b&gt;1. Choose the right VM instances&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="bhado"&gt;One of the most effective ways to reduce Compute Engine costs is to ensure that you’ve properly selected and right-sized your virtual machines (VMs) for their workloads to support your migration and modernization efforts. Whether you're new to Google Cloud or already using Compute Engine, adopting the latest-generation VMs — such as &lt;a href="https://cloud.google.com/compute/docs/general-purpose-machines#n4_series"&gt;N4&lt;/a&gt;, &lt;a href="https://cloud.google.com/compute/docs/general-purpose-machines#c4_series"&gt;C4&lt;/a&gt;, &lt;a href="https://cloud.google.com/compute/docs/general-purpose-machines#c4d_series"&gt;C4D&lt;/a&gt;, and &lt;a href="https://cloud.google.com/compute/docs/general-purpose-machines#c4a_series"&gt;C4A&lt;/a&gt; — can deliver substantial savings and improved price-performance.&lt;/p&gt;&lt;p data-block-key="8rrjo"&gt;Powered by Google Cloud’s &lt;a href="https://cloud.google.com/titanium?e=48754805&amp;amp;hl=en"&gt;Titanium&lt;/a&gt; architecture, our latest-generation VMs offer faster CPUs, higher memory bandwidth, and more efficient virtualization than their predecessors, so you can handle the same workloads with fewer resources. For existing customers, migrating from older VM generations to the newest VMs can significantly lower total costs while helping you exceed current performance levels. Organizations that have made the switch often report 20–40% better performance along with meaningful reductions in cloud compute spend. For example, &lt;a href="https://www.elastic.co/blog/elasticsearch-runs-faster-google-axion-processors" target="_blank"&gt;Elastic&lt;/a&gt; leveraged the general-purpose C4A machine series based on &lt;a href="https://cloud.google.com/blog/products/compute/introducing-googles-new-arm-based-cpu?e=48754805"&gt;Google Cloud's Arm-based Axion CPUs&lt;/a&gt;, to achieve a compelling efficiency and performance uplift for their workloads.&lt;/p&gt;&lt;p data-block-key="febac"&gt;Beyond &lt;a href="https://cloud.google.com/compute/docs/general-purpose-machines"&gt;general-purpose VMs&lt;/a&gt;, we also offer specialized machine types to address unique customer requirements. Compute-optimized HPC VMs like &lt;a href="https://cloud.google.com/blog/products/compute/new-h4d-vms-optimized-for-hpc?e=48754805"&gt;H4D&lt;/a&gt; are designed for high-performance computing and data analytics, offering extreme performance for demanding workloads. &lt;a href="https://cloud.google.com/compute/docs/memory-optimized-machines#m4_series"&gt;M4&lt;/a&gt; and &lt;a href="https://cloud.google.com/compute/docs/memory-optimized-machines#x4_series"&gt;X4&lt;/a&gt; instances cater to memory-intensive applications, while &lt;a href="https://cloud.google.com/compute/docs/storage-optimized-machines#z3_series"&gt;Z3&lt;/a&gt; instances are ideal for storage-intensive workloads. Furthermore, if you need complete control over your hardware environment and maximum performance isolation, we offer &lt;a href="https://cloud.google.com/compute/docs/instances/bare-metal-instances#:~:text=Bare%20metal%20instances%20provide%20direct,same%20way%20as%20VM%20instances."&gt;bare metal instances&lt;/a&gt;.&lt;/p&gt;&lt;p data-block-key="eqplt"&gt;These options help ensure that even the most specialized and performance-sensitive workloads can find an optimal and cost-effective home within the Compute Engine portfolio.&lt;/p&gt;&lt;h3 data-block-key="5i8hm"&gt;&lt;b&gt;2. Optimize your block storage selections&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="bsaae"&gt;The best way to lower your block storage TCO, while ensuring your workloads remain successful, is to drive high resource efficiency. &lt;a href="https://cloud.google.com/compute/docs/disks/hyperdisks"&gt;Hyperdisk&lt;/a&gt; makes it simple to drive high performance and high efficiency by enabling you to optimize your block storage to your workload and through Storage Pools. We’ll discuss each of these capabilities, and how you can use them to lower your block storage TCO below.&lt;/p&gt;&lt;p data-block-key="6kmjp"&gt;Workload Optimized: With Hyperdisk, you can independently tune capacity and performance to match your block storage resources to your workload. Hyperdisk enables you to independently provision performance and capacity at the volume level. You can leverage this capability to purchase just the capacity and performance you need, no more and no less. You can also take advantage of Hyperdisk Balanced’s “baseline” performance (i.e. included free with every volume), you can serve the vast majority of your VMs without purchasing any extra performance.&lt;/p&gt;&lt;p data-block-key="at87k"&gt;Storage Pools: Hyperdisk is the only hyperscale cloud block storage to offer thin-provisioned performance and capacity. With Hyperdisk Storage Pools, you can provision the aggregate performance and capacity your workload requires, while still provisioning the volume level capacity performance your workloads request (also known as &lt;a href="https://en.wikipedia.org/wiki/Thin_provisioning" target="_blank"&gt;thin-provisioning&lt;/a&gt;). This allows you to pay for the resources you need, not the sum of the volumes you’ve provisioned. As a result, you can &lt;a href="https://cloud.google.com/blog/products/storage-data-transfer/hyperdisk-storage-pools-is-now-generally-available#:~:text=Infrastructure%20Manager%2C%20REWE-,Get%20started,use%20and%20manage%20your%20pools."&gt;lower your overall block storage TCO by as much as 50%.&lt;/a&gt;&lt;/p&gt;&lt;p data-block-key="929m4"&gt;For more information on how to select the right block storage for your workload and to see how customers have benefitted from Hyperdisk, read this &lt;a href="https://cloud.google.com/blog/products/storage-data-transfer/how-to-choose-the-right-hyperdisk-block-storage-for-your-use-case?e=48754805"&gt;blog&lt;/a&gt;.&lt;/p&gt;&lt;h3 data-block-key="e9sir"&gt;&lt;b&gt;3. Consider custom compute classes&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="616a2"&gt;To get the most out of our latest-generation VMs, Google Kubernetes Engine (GKE) &lt;a href="https://cloud.google.com/kubernetes-engine/docs/concepts/about-custom-compute-classes"&gt;&lt;b&gt;custom compute classes&lt;/b&gt;&lt;/a&gt; (CCC) offer an advanced way to optimize compute choices and provide high availability. Instead of being limited to a single machine type for your workloads, you can define a prioritized list of VM instance types. This allows you to set the newest, most price-performant VMs — including our latest-generation VMs — as your top priority. GKE custom compute classes provide the capability to automatically and seamlessly spin up instances based on your specified priority list. This feature helps you maximize the availability of your compute capacity while still aiming for the most cost-effective options, so your workloads can scale reliably without manual intervention.&lt;/p&gt;&lt;p data-block-key="b9b8c"&gt;Here are some specific use cases for how custom compute classes can help you optimize costs:&lt;/p&gt;&lt;ul&gt;&lt;li data-block-key="b747a"&gt;&lt;b&gt;Autoscaling cost-performant fallbacks:&lt;/b&gt; When demand peaks, you might be tempted to autoscale using a highly available but less cost-efficient VM type. CCC allows you to take a tiered approach. You can set up several cost-efficient fallback alternatives, so that as demand increases, GKE first attempts to use the most cost-effective options, and progressively moves to the other choices in your list when necessary to meet demand.&lt;/li&gt;&lt;li data-block-key="smd6"&gt;&lt;b&gt;AI/ML inference:&lt;/b&gt; Running AI/ML inference workloads often involves significant compute resources. Instead of maintaining a large, static reservation that might sit idle during off-peak times, CCC lets you provision a minimal base reservation and leverage more cost-effective capacity types, such as &lt;a href="https://docs.google.com/document/d/1KLJ97-xgtX9pDaodkMsXN18xJOYiBFZUSqkdYb--_44/edit?tab=t.0" target="_blank"&gt;Spot VMs&lt;/a&gt;, to handle peak inference demand — all orchestrated through your CCC configuration.&lt;/li&gt;&lt;li data-block-key="4pm95"&gt;&lt;b&gt;Adopting new VM generations:&lt;/b&gt; Combine the power of GKE custom compute classes with &lt;a href="https://cloud.google.com/compute/docs/instances/committed-use-discounts-overview#spend_based"&gt;Compute Flexible committed use discounts&lt;/a&gt; (Flex CUDs) to de-risk the adoption of new, cost-efficient VM series like N4 and C4. With CCC, you can define fallback options, providing workload resilience, while Flex CUDs offer financial adaptability, as the discounts apply across your total eligible compute spend, regardless of the specific VM series you use. This dual approach is a safe, cost-effective strategy for leveraging the latest hardware without disruption. For more information, read this &lt;a href="https://cloud.google.com/blog/products/compute/adopt-new-vm-series-with-gke-compute-classes-flexible-cuds/?e=48754805"&gt;blog&lt;/a&gt;.&lt;/li&gt;&lt;li data-block-key="5fieq"&gt;&lt;b&gt;Using flexible Spot VMs:&lt;/b&gt; Spot VMs offer significant savings but can be preempted. Being constrained to a single Spot VM shape increases the risk that capacity will not be available. With CCC, you can define multiple fallback Spot VM types. This "spot surfing" capability allows the application to remain on cost-efficient Spot capacity by automatically pivoting to alternative Spot instance types if the primary choice is unavailable.&lt;/li&gt;&lt;/ul&gt;&lt;p data-block-key="9pj6j"&gt;In short, by leveraging GKE CCC, you can artfully mix and match various VM types and consumption models, including On-Demand, Spot, DWS FlexStart, and instances covered by CUDs, to build a resilient and highly cost-optimized infrastructure that adapts to the unique needs and patterns of your workloads.&lt;/p&gt;&lt;h3 data-block-key="4f7sf"&gt;&lt;b&gt;4. Leverage custom machine types (CMT)&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="44on9"&gt;&lt;a href="https://cloud.google.com/compute/docs/instances/creating-instance-with-custom-machine-type"&gt;Custom machine types&lt;/a&gt;, available on N4 VMs, allow you to precisely configure virtual machines to your exact specifications. Rather than selecting from predefined machine types that might include excess capacity, you can tailor the CPU-to-memory ratio specifically for your workloads, so you only pay for resources you actually use. This targeted approach minimizes waste and can significantly reduce your cloud spend, especially when migrating from on-premises to Google Cloud or from other cloud providers.&lt;/p&gt;&lt;p data-block-key="ditbj"&gt;This flexibility becomes particularly valuable if your applications have unique resource profiles that don't align well with our standard offerings. Custom machine types let you create the perfect environment for your needs. By avoiding the compromise of over-provisioning certain resources while potentially constraining others, you can achieve both better performance and more efficient spending across your Compute Engine deployment.&lt;/p&gt;&lt;p data-block-key="16cdr"&gt;As an example, take a memory-intensive workload that runs best with 16 vCPU, and 70 GB memory. Normally, you would need to pick a VM with 128 GB memory with our standard shapes, or in other cloud contexts, resulting in higher costs to run your workload due to the extra provisioned resources. Instead, with custom machine types, you can easily launch a VM with 16 vCPU and 70 GB memory, resulting in an 18% cost savings vs standard N4-highmem-16 VMs.&lt;/p&gt;&lt;h3 data-block-key="ei6g2"&gt;&lt;b&gt;5. Make the most of committed use discounts&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="3vd4p"&gt;CUDs are a strategic cost-saving opportunity for organizations with steady, predictable computing needs. By committing to resource usage over one- or three-year periods, you can reduce cloud costs by up to 70% compared to on-demand pricing. This approach not only helps ensure budget predictability but also converts fixed infrastructure spending into a financial advantage, making it ideal for stable workloads that support core business functions.&lt;/p&gt;&lt;p data-block-key="bnjpk"&gt;Google Cloud offers flexible CUD structures to align with various operational models. Resource-based commitments target specific machine types and regions, flexible commitments apply discounts across projects, regions, and machine series — great for dynamic environments. By analyzing historical usage and forecasting future needs, you can identify workloads suited for these discounts, reinvesting the savings into innovation and scaling initiatives.&lt;/p&gt;&lt;h3 data-block-key="einu0"&gt;&lt;b&gt;6. Manage unused disk space&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="5n36i"&gt;You pay for the total provisioned disk space, regardless of how much you actually use. Many organizations tend to over-provision storage "just in case," which often leads to unnecessary and costly waste. For instance, if you provision a 100GB disk but only use 20GB, you're still paying for the entire 100GB. Being intentional and precise with your storage allocations — rather than rounding up to common sizes — can lead to significant cost savings.&lt;/p&gt;&lt;p data-block-key="d58ss"&gt;To optimize spending, it's important to adopt a few best practices. Using &lt;a href="https://cloud.google.com/stackdriver/docs/solutions/agents/ops-agent"&gt;Ops Agent&lt;/a&gt;, regularly audit disk usage across your infrastructure to identify and eliminate inefficiencies. Resize disks to align with actual consumption, allowing a reasonable buffer for growth. Implement automated alerts in &lt;a href="https://cloud.google.com/monitoring?e=48754805&amp;amp;hl=en"&gt;Cloud Monitoring&lt;/a&gt; to detect underutilized disks and take corrective action. For stateless applications, consider using smaller boot disk images to minimize overhead and reduce costs even further.&lt;/p&gt;&lt;p data-block-key="4hu1t"&gt;In addition, consider the following optimization strategies to further reduce costs and improve efficiency:&lt;/p&gt;&lt;ul&gt;&lt;li data-block-key="as6ik"&gt;Use Google Cloud’s monitoring tools to track CPU, memory, and disk usage over time.&lt;/li&gt;&lt;li data-block-key="3uj3j"&gt;Establish a regular review cycle to identify and right-size over-provisioned resources.&lt;/li&gt;&lt;li data-block-key="7jobf"&gt;Test workloads across different VM configurations to find the optimal balance between cost and performance.&lt;/li&gt;&lt;/ul&gt;&lt;h3 data-block-key="a49bh"&gt;&lt;b&gt;7. Use Spot VMs&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="52fp7"&gt;&lt;a href="https://cloud.google.com/compute/docs/instances/spot"&gt;Spot VMs&lt;/a&gt; provide the same machine types and configuration​​ options as standard virtual machines but at a significantly reduced cost — typically offering a 60% to 91% discount. This cost efficiency comes with the tradeoff of potential preemption at short notice, making them most suitable for workloads that are fault-tolerant and can recover quickly from unexpected interruptions. Spot VMs are designed to take advantage of unused compute capacity, allowing you to optimize your cloud spending without compromising access to high-performance resources.&lt;/p&gt;&lt;p data-block-key="8abo0"&gt;Strong use cases for Spot VMs include batch processing jobs, big data and analytics workloads, continuous integration and deployment (CI/CD) pipelines, stateless web servers running in autoscaling groups, and compute-heavy tasks. When properly architected to handle interruptions — for example, by using job checkpointing, load balancing, task queues, or via GKE custom compute classes (see more above) — &lt;a href="https://cloud.google.com/solutions/spot-vms?e=48754805&amp;amp;hl=en"&gt;Spot VMs&lt;/a&gt; can play a critical role in minimizing infrastructure costs while maintaining high availability and system resilience. Leveraging Spot VMs in these scenarios lets you scale cost-effectively, especially when compute demand is variable or time-flexible.&lt;/p&gt;&lt;h3 data-block-key="9not2"&gt;&lt;b&gt;8. Use optimization recommendations&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="bjf0l"&gt;Google Cloud's &lt;a href="https://cloud.google.com/recommender/docs/recommenders"&gt;Recommenders&lt;/a&gt; are a powerful tool designed to help you optimize your cloud resources efficiently. When browsing the Google Cloud console, you may see lightbulb icons next to specific resources — these indicate potential improvements identified by Google's recommendation engine. By analyzing real-time usage patterns and current resource configurations, the &lt;a href="https://cloud.google.com/recommender/docs/key-concepts#recommenders"&gt;Recommender&lt;/a&gt; delivers actionable insights tailored to each user's unique environment. This intelligent system highlights opportunities not only to reduce costs but also to enhance security, performance, reliability, management efficiency, and environmental sustainability.&lt;/p&gt;&lt;p data-block-key="91nm7"&gt;For example, there are &lt;a href="https://cloud.google.com/compute/docs/instances/idle-vm-recommendations-overview"&gt;idle VM recommendations&lt;/a&gt; to help you identify VM instances that have not been used over the last 1 to 14 days. Common recommendations include switching to more suitable machine types, rightsizing underutilized compute instances, or adopting more cost-effective storage solutions. The tool allows you to apply many of these changes directly, streamlining the optimization process. By continuously evaluating workloads and offering these automated, data-driven suggestions, the Recommendation Hub helps organizations maintain cloud performance while managing costs more effectively.&lt;/p&gt;&lt;h3 data-block-key="35ft8"&gt;&lt;b&gt;9. Take advantage of auto-scaling and scheduling&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="5i72v"&gt;Matching your compute resources to actual demand patterns is one of the most effective ways to reduce cloud waste and improve overall cost efficiency. Many organizations over-provision their resources to handle peak workloads, leaving machines underutilized during off-peak periods. By aligning compute capacity more closely with real-time or predictable usage patterns, such as business hours or seasonal trends, you can significantly cut unnecessary spending without sacrificing performance.&lt;/p&gt;&lt;p data-block-key="c7ge7"&gt;&lt;a href="https://cloud.google.com/compute/docs/autoscaler"&gt;Autoscaling&lt;/a&gt; is the key to achieving this efficiency. In fact, customers who leverage Google Compute Engine autoscaling for their virtual machines have seen average infrastructure cost savings of more than 40%.&lt;/p&gt;&lt;p data-block-key="70opn"&gt;You can implement autoscaling strategies to dynamically adjust resources based on CPU utilization, load balancing capacity, or custom application metrics, so that workloads receive the necessary compute power when needed, while scaling down automatically during low-demand periods.&lt;/p&gt;&lt;p data-block-key="cj1lq"&gt;For workloads with predictable patterns, such as those that fluctuate with business hours or planned seasonal events, &lt;a href="https://cloud.google.com/compute/docs/autoscaler/scaling-schedules"&gt;schedule-based scaling&lt;/a&gt; is a particularly powerful tool. This approach allows you to proactively increase resources in anticipation of high demand and scale them down during lulls, for the performance you need without constant over-provisioning.&lt;/p&gt;&lt;p data-block-key="1i6kf"&gt;In addition to autoscaling, several practical implementation techniques can further optimize your resource usage. &lt;a href="https://cloud.google.com/scheduler/docs/start-and-stop-compute-engine-instances-on-a-schedule"&gt;Setting up instance scheduling&lt;/a&gt; lets you automatically start and stop development and test environments according to business hours — a simple yet highly effective approach that can lead to cost savings of up to 70%. You can also leverage maintenance windows to reduce disruptions and resource consumption, by concentrating updates and system changes into low-usage periods. Together, these tactics help maintain high availability and performance while keeping infrastructure costs under control.&lt;/p&gt;&lt;h3 data-block-key="evivu"&gt;&lt;b&gt;10. Understand your spend with detailed billing analysis&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="34tqj"&gt;Before implementing any cost-saving strategies in Google Cloud, it’s essential to &lt;a href="https://cloud.google.com/billing/docs/concepts"&gt;understand your current spending in detail&lt;/a&gt;. Google Cloud’s billing panel offers granular visibility into your expenses, including costs broken down by individual SKUs. This level of transparency lets you track where your money is going and identify potential inefficiencies. Begin by regularly reviewing your billing dashboard to monitor usage trends and spot anomalies. Applying labels and tags to your resources can further help categorize and attribute costs accurately, especially in complex environments with multiple projects or departments.&lt;/p&gt;&lt;p data-block-key="een0q"&gt;In addition, &lt;a href="https://cloud.google.com/billing/docs/how-to/budgets"&gt;setting up budget alerts&lt;/a&gt; is a practical way to stay ahead of overspending by notifying you when costs approach or exceed predefined thresholds. It’s also important to identify and eliminate unused or idle resources, such as virtual machines or persistent disks that are no longer in active use — these can often be shut down or deleted to immediately reduce costs. By thoroughly analyzing your cost structure, you can uncover “low-hanging fruit” — resources that provide little or no value — and make data-driven decisions to optimize your cloud usage efficiently.&lt;/p&gt;&lt;h3 data-block-key="9jfbk"&gt;&lt;b&gt;11. Consider serverless alternatives&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="e7aev"&gt;Last but not least, Google Cloud's &lt;a href="https://cloud.google.com/discover/what-is-serverless-computing?e=48754805&amp;amp;hl=en"&gt;serverless computing&lt;/a&gt; offerings provide a compelling alternative to traditional virtual machines, can deliver better cost efficiency, simplified operations, and greater scalability. By abstracting away infrastructure management, serverless platforms allow teams to focus on writing and deploying code without worrying about provisioning, scaling, or maintaining servers. This shift can not only reduce operational overhead but also cut costs by aligning compute spending directly with application usage.&lt;/p&gt;&lt;p data-block-key="4c30g"&gt;There are multiple serverless options available, each tailored to different workloads. &lt;a href="https://cloud.google.com/run?e=48754805&amp;amp;hl=en"&gt;Cloud Run&lt;/a&gt; is designed for running containerized applications that need rapid scaling and flexible deployment. &lt;a href="https://cloud.google.com/run/docs/write-event-driven-functions"&gt;Cloud Run Functions&lt;/a&gt; supports lightweight, event-driven code execution for microservices or automation tasks. &lt;a href="https://cloud.google.com/kubernetes-engine/docs/concepts/autopilot-overview"&gt;GKE (Autopilot Mode)&lt;/a&gt; simplifies Kubernetes operations by automatically managing nodes and scaling, allowing you to run Kubernetes workloads without handling the underlying infrastructure. All these options charge based on usage not allocation, significantly reducing costs associated with idle resources and over-provisioning. This makes them especially beneficial for variable or unpredictable workloads. Cloud Run and GKE both support GPU’s and flexibility to move between the two. You can start with &lt;a href="https://www.youtube.com/watch?v=nGFXKTz2jZM&amp;amp;t=2s&amp;amp;pp=ygUabW92ZSBmcm9tIGNsb3VkIHJ1biB0byBHS0U%3D" target="_blank"&gt;Cloud Run then move to GKE&lt;/a&gt; or &lt;a href="https://www.youtube.com/watch?v=x12EOsVt2oU&amp;amp;t=1s&amp;amp;pp=ygUabW92ZSBmcm9tIGNsb3VkIHJ1biB0byBHS0U%3D" target="_blank"&gt;vice-versa&lt;/a&gt;. Some customers also leverage both offerings for workloads. The rule of thumb is to start with GKE if you need access to the Kubernetes API. Otherwise, start with Cloud Run.&lt;/p&gt;&lt;h2 data-block-key="6n8fn"&gt;&lt;b&gt;Start reducing your costs today&lt;/b&gt;&lt;/h2&gt;&lt;p data-block-key="buet9"&gt;Migrate to Google Cloud and optimize your infrastructure costs without compromising on what your workloads need. If you are new to Google Cloud, start with &lt;a href="http://g.co/cloud/assess" target="_blank"&gt;a migration assessment&lt;/a&gt;. Google Cloud’s &lt;a href="https://cloud.google.com/migration-center/docs"&gt;Migration Center&lt;/a&gt; can help you with a clear understanding of your potential savings by migrating to Google Cloud, with detailed recommended paths for your workloads, along with TCO reports. Apply the strategies in this article and unlock substantial cost savings.&lt;/p&gt;&lt;/div&gt;</description><pubDate>Mon, 06 Oct 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/compute/cost-saving-strategies-when-migrating-to-google-cloud-compute/</guid><category>Infrastructure Modernization</category><category>Storage &amp; Data Transfer</category><category>Serverless</category><category>Compute</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>11 ways to reduce your Google Cloud compute costs today</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/compute/cost-saving-strategies-when-migrating-to-google-cloud-compute/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Alex Bestavros</name><title>Group Product Manager</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Sai Gopalan</name><title>Product Management, Google Cloud</title><department></department><company></company></author></item><item><title>How Baseten achieves 225% better cost-performance for AI inference (and you can too)</title><link>https://cloud.google.com/blog/products/ai-machine-learning/how-baseten-achieves-better-cost-performance-for-ai-inference/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Baseten is one of a growing number of AI infrastructure providers, helping other startups run their models and experiments at speed and scale. Given the importance of those two factors to its customers, Baseten has just passed a significant milestone. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;By leveraging the latest Google Cloud A4 virtual machines (VMs) based on NVIDIA Blackwell, and Google Cloud’s Dynamic Workload Scheduler (‘DWS’) Baseten has achieved 225% better cost-performance for high-throughput inference and 25% better cost-performance for latency-sensitive inference. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Why it matters&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: This breakthrough in performance and efficiency enables companies to move powerful &lt;/span&gt;&lt;a href="https://cloud.google.com/discover/what-is-agentic-ai"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;agentic AI&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and reasoning models out of the lab and into production affordably. For technical leaders, this provides a &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/ai-machine-learning/real-world-gen-ai-use-cases-with-technical-blueprints"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;blueprint&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for building &lt;/span&gt;&lt;a href="https://cloud.google.com/transform/101-real-world-generative-ai-use-cases-from-industry-leaders"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;next-generation AI products&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; — such as real-time voice AI, search, and agentic workflows — at a scale and cost-efficiency that has been previously unattainable.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;The big picture&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: &lt;/span&gt;&lt;a href="https://cloud.google.com/discover/what-is-ai-inference?e=48754805#:~:text=AI%20inference%20is%20the%20execution,highly%20optimized%20and%20scalable%20infrastructure."&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Inference&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; is the cornerstone of enterprise AI. As models for multi-step reasoning and decision-making demand exponentially greater compute, the challenge of serving them efficiently has become the primary bottleneck. Enter Baseten, a six-year-old Series C company that &lt;/span&gt;&lt;a href="https://www.baseten.co/blog/baseten-partners-with-google-cloud-to-deliver-high-performance-ai-infra/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;partners&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; with Google Cloud and NVIDIA to provide enterprise companies a scalable inference platform for their proprietary models as well as open models like &lt;/span&gt;&lt;a href="https://cloud.google.com/vertex-ai/generative-ai/docs/open-models/use-gemma"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Gemma&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, DeepSeek ,and Llama, with an emphasis on performance and cost efficiency. Their success hinges on a dual strategy: maximizing the potential of cutting-edge hardware and orchestrating it with a highly optimized, open software stack.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We wanted to share more about how Baseten architected its stack — and what this new level of cost-efficiency can unlock for your inference applications.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;$300 in free credit to try Google Cloud AI and ML&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2e26dcc3a0&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h2&gt;&lt;strong style="vertical-align: baseline;"&gt;Hardware optimization with the latest NVIDIA GPUs&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Baseten delivers production-grade inference by leveraging a wide range of NVIDIA GPUs on Google Cloud, from NVIDIA T4s through the recent A4 VMs (NVIDIA HGX B200). This access to the latest hardware is critical for achieving new levels of performance.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;With A4 VMs, Baseten now serves three of the most popular open-source models — DeepSeek V3, DeepSeek R1, and Llama 4 Maverick — directly on their Model APIs with over 225% better cost-performance for high throughput inference, and 25% better cost-performance for latency- sensitive inference.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;In addition to its production-ready model APIs, Baseten provides additional flexibility with NVIDIA B200-powered dedicated deployments for customers seeking to run their own custom AI models with the same reliability and efficiency.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;&lt;strong style="vertical-align: baseline;"&gt;Advanced software for peak performance&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Baseten’s approach is rooted in coupling the latest accelerated hardware with leading and open-source software to extract the most value possible from every chip. This integration is made possible with Google Cloud’s &lt;/span&gt;&lt;a href="https://cloud.google.com/solutions/ai-hypercomputer"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AI Hypercomputer&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, which includes a broad suite of advanced inference frameworks, including NVIDIA’s open-source software stack — NVIDIA Dynamo and TensorRT-LLM — as well as SGLang and vLLM.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Using TensorRT-LLM, Baseten optimizes and compiles custom LLMs for one of its largest AI customers, &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/databases/writer-wins-generative-ai-success-with-google-cloud-databases/"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Writer&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. This has boosted their throughput by more than 60% for Writer’s Palmyra LLMs. The flexibility of TensorRT-LLM also enabled Baseten to develop a custom model builder that speeds up model compilation.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;To serve reasoning models like DeepSeek R1 and Llama 4 on NVIDIA Blackwell GPUs, Baseten uses NVIDIA Dynamo. The combination of NVIDIA’s HGX B200 and Dynamo dramatically lowered latency and increased throughput, propelling Baseten to the top GPU performance spot on OpenRouter’s LLM ranking leaderboard.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;The team leverages techniques such as kernel fusion, memory hierarchy optimization, and custom attention kernels to increase tokens per second, reduce time to first token, and support longer context windows and larger batch sizes — all while maintaining low latency and high throughput.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;&lt;strong style="vertical-align: baseline;"&gt;Building a backbone for high availability and redundancy&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;For mission-critical AI services, resilience is non-negotiable. Baseten runs globally across multiple clouds and regions, requiring an infrastructure that can handle ad hoc demand and outages. Flexible consumption models, such as the &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/compute/introducing-dynamic-workload-scheduler"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Dynamic Workload Scheduler&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; within the AI Hypercomputer, help Baseten manage capacity similar to on-demand with additional price benefits. This allows them to scale up on Google Cloud if there are outages across other clouds.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;"Baseten runs globally across multi-clouds and Dynamic Workload Scheduler has saved us more than once when we encounter a failure,” said &lt;/span&gt;&lt;strong style="font-style: italic; vertical-align: baseline;"&gt;Colin McGrath&lt;/strong&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, head of infrastructure at Baseten.  “Our automated system moves affected workloads to other resources including Google Cloud Dynamic Workload scheduler and within minutes, everyone is up and running again. It is impressive — by the time we’re paged and check-in, everything is back and healthy. This is amazing and would not be possible without DWS. It has been the backbone for us to run our business.”&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--medium
      
      
        h-c-grid__col
        
        h-c-grid__col--4 h-c-grid__col--offset-4
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/image1_YjWWGkP.max-1000x1000.png"
        
          alt="image1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="qd5tr"&gt;Baseten’s scalable inference platform architecture&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h2&gt;&lt;strong style="vertical-align: baseline;"&gt;Unlocking new AI applications for end-users&lt;/strong&gt;&lt;/h2&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Baseten's collaboration with Google Cloud and NVIDIA demonstrates how a powerful combination of cutting-edge hardware and flexible, scalable cloud infrastructure can solve the most pressing challenges in AI inference through Google Cloud’s AI Hypercomputer. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This unique combination enables end-users across industries to bring new applications to market, such as powering agentic workflows in financial services, generating real-time audio and video content in media, and accelerating document processing in healthcare. And it’s all happening at a scale and cost that was previously unattainable.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;You can easily get started with Baseten's platform through the &lt;/span&gt;&lt;a href="https://pantheon.corp.google.com/marketplace/product/baseten-public/baseten?project=baseten-public&amp;amp;pli=1" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Marketplace&lt;/span&gt;&lt;/a&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;, or read more about their technical architecture &lt;/span&gt;&lt;a href="https://www.baseten.co/blog/accelerating-inference-nvidia-b200-gpus/?utm_term=nvidia%20b200&amp;amp;utm_campaign=Search+-+Self-Hosted&amp;amp;utm_source=adwords&amp;amp;utm_medium=ppc&amp;amp;hsa_acc=9990356727&amp;amp;hsa_cam=21607833837&amp;amp;hsa_grp=179204207220&amp;amp;hsa_ad=747940377782&amp;amp;hsa_src=g&amp;amp;hsa_tgt=kwd-2321080405420&amp;amp;hsa_kw=nvidia%20b200&amp;amp;hsa_mt=p&amp;amp;hsa_net=adwords&amp;amp;hsa_ver=3&amp;amp;gad_source=1&amp;amp;gad_campaignid=21607833837&amp;amp;gbraid=0AAAAAqCKh1vk1KRnJYRTFUBfqNlDv-N4W&amp;amp;gclid=CjwKCAjwravBBhBjEiwAIr30VA1EhznOvQhLwNS13cdOeKLRicBeZJLEZFc9sN2ZzXqzvL-frMzJMBoC#performance-boosts-using-b200s" rel="noopener" target="_blank"&gt;&lt;span style="font-style: italic; text-decoration: underline; vertical-align: baseline;"&gt;in their own post.&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Thu, 04 Sep 2025 17:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/ai-machine-learning/how-baseten-achieves-better-cost-performance-for-ai-inference/</guid><category>Infrastructure Modernization</category><category>Customers</category><category>Partners</category><category>AI &amp; Machine Learning</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>How Baseten achieves 225% better cost-performance for AI inference (and you can too)</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/ai-machine-learning/how-baseten-achieves-better-cost-performance-for-ai-inference/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Philip Kiely</name><title>Head of Developer Relations, Baseten</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Chelsie Czop</name><title>Sr. Product Manager, AI Infrastructure, Google Cloud</title><department></department><company></company></author></item><item><title>From clicks to clusters: Expanding Confidential Computing with Intel TDX</title><link>https://cloud.google.com/blog/products/identity-security/from-clicks-to-clusters-confidential-computing-expands-with-intel-tdx/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Privacy-protecting Confidential Computing has come a long way since we introduced Confidential Virtual Machines (VMs) five years ago. The technology, which can protect data while in use, strengthens a security gap beyond data encryption at rest and in transit. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Since then, customers have used Confidential Computing to &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/identity-security/securing-medical-device-software-with-google-confidential-vm?utm_medium=email&amp;amp;_hsmi=211594676&amp;amp;_hsenc=p2ANqtz-8RFEdL02azftGFen13A8grVu3HSDunpb3ryQVxVID_mUBNWwKzFAWEjoS3QFpbrnWpdHvOxVSCJwlQ0KljdXhh9QzBVw&amp;amp;utm_content=211594676&amp;amp;utm_source=hs_email"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;protect patient medical data&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://services.google.com/fh/files/misc/google_amd_zonar_case_study.pdf" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;comply with privacy guidance&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; of GDPR and Schrems II for U.S.-Europe data transfers, and run &lt;/span&gt;&lt;a href="https://yellowdog.ai/news_article/yellowdog-amd-cvms-with-google/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;high-performance computing (HPC) &lt;/span&gt;&lt;/a&gt;&lt;a href="http://google.com/url?q=https://yellowdog.co/2022/05/25/yellowdog-amd-cvms-with-google/&amp;amp;sa=D&amp;amp;source=editors&amp;amp;ust=1753151129521660&amp;amp;usg=AOvVaw3LtBSPGHf9Gkshlc5SKIdB" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;workloads securely&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;By isolating workloads in hardware-based Trusted Execution Environments (TEEs), Confidential Computing empowers customers to process their most sensitive information in the public cloud with assurance. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As part of the advancements we’ve made with Confidential Computing, we added even more security capabilities with the introduction of &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/identity-security/new-confidential-computing-updates-for-more-hardware-security-options/"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential VMs with Intel Trust Domain Extensions&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (TDX) last year. Intel TDX creates an isolated trust domain (TD) in a VM, uses hardware extensions for managing and encrypting memory to protect cloud workloads, and offers hardware-based &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-vm/docs/attestation"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;remote attestation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for verification. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Today, we are excited to highlight our greatly expanded, and generally available, Intel TDX-based offerings, which includes &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/how-to/confidential-gke-nodes"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential GKE Nodes&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-space/docs/confidential-space-overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential Space&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-vm/docs/create-a-confidential-vm-instance-with-gpu"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential GPU&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, and &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-vm/docs/supported-configurations#supported-zones"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;more regions and zones&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; where customers can use Confidential Computing.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Click to create a Confidential VM&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/cloud-console"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Console&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; now offers Google Compute Engine (GCE) customers a new interface for Intel TDX — no code changes required. To get started, follow these steps:&lt;/span&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Start at the GCE &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Create an instance&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; page&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Go to the &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Security&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; tab and under Confidential VM service, click &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Enable&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Then select Intel TDX from the dropdown menu and click &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Confirm&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;It’s that simple to create a Confidential VM.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/image1_6YVpf1C.gif"
        
          alt="image1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="tzsle"&gt;Create a new Confidential VM with Intel TDX in the Google Cloud console.&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Get Confidential Computing in more regions and zones&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Confidential VMs with Intel TDX were first available with support for three regions (and nine zones.) To accommodate growing demand, we’ve expanded support for Intel TDX on the C3 machine series to 10 regions (and 21 zones,) and we are planning more for the future. The full list is &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-vm/docs/supported-configurations#intel-tdx_1"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;available here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. As regional availability and scalability are critical, your account team is available to help you plan early to ensure your capacity needs are met. &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Confidential GKE Nodes with Intel TDX, now generally available&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/how-to/confidential-gke-nodes"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential GKE Nodes&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; are built on top of Confidential VM and deliver hardware-based protections to your Google Kubernetes Engine (GKE) clusters and node pools to ensure that your containerized workloads remain encrypted in memory. Today, Confidential GKE Nodes are generally available with Intel TDX on GKE Standard and GKE Autopilot. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Confidential GKE Nodes with Intel TDX on the C3 machine series can be created on &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/concepts/choose-cluster-mode"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;GKE Standard&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; via CLI, API, UI, and Terraform. The confidential setting can be set at the cluster level or the node pool level with no code changes. You can learn more &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/how-to/confidential-gke-nodes"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Confidential GKE Nodes with Intel TDX on the C3 machine series can also be created on &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/concepts/choose-cluster-mode"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;GKE Autopilot&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. It can be enabled through the use of &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/concepts/about-custom-compute-classes"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;custom compute classes&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. In GKE, a &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;compute class&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; is a profile that consists of a set of node attributes that GKE uses to provision the nodes that run your workloads during autoscaling events. Check out our &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/how-to/confidential-gke-nodes"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;documentation to get started&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;$300 in free credit to try Google Cloud security products&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2dfccc4d60&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Confidential Space with Intel TDX, now generally available&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Also built on Confidential VM, our &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-space/docs/confidential-space-overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential Space&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; offering is a robust solution for many common issues including addressing insider threats, enabling joint machine-learning training and private gen AI inference, and fostering multi-party collaboration on sensitive data. Here are just a few examples of what our customers have built with Confidential Space:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Confidential matching enabled customers to &lt;/span&gt;&lt;a href="https://blog.google/products/ads-commerce/google-confidential-matching-data-privacy/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;securely connect their first-party data&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for Google Ads measurement and audience solutions.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Symphony demonstrated with its Confidential Cloud how SaaS companies can &lt;/span&gt;&lt;a href="https://www.youtube.com/watch?v=oxjCeghgGtY" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;guarantee isolation of customer data&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; from privileged insiders in the highly regulated financial industry.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Duality delivered &lt;/span&gt;&lt;a href="https://services.google.com/fh/files/misc/duality_case_study_v3.pdf" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;privacy-preserving federated learning&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; solutions for a broad range of use cases in healthcare, financial services, and the public sector.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;Flare spearheaded innovation in&lt;/span&gt;&lt;a href="https://flare.network/news/flare-hackathon-winners" rel="noopener" target="_blank"&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;verifiable AI on blockchain&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Previously, Confidential Space was only available with &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-vm/docs/confidential-vm-overview#amd_sev"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AMD-based technology&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and hardware (on the &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;N2D, C2D, C3D, and C4D machine series), but now it is also available &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;with &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-space/docs/confidential-space-overview#requirements"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Intel-based technology&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and hardware&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;. This is ideal for those wanting &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/identity-security/innovate-with-confidential-computing-attestation-live-migration-on-google-cloud?e=48754805"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;attestation guarantees&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; with a hardware root of trust and for those focused on Intel’s &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/general-purpose-machines#c3_series"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;C3 machine series&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Additionally, Confidential Space with Intel TDX is measured into &lt;/span&gt;&lt;a href="https://www.intel.com/content/www/us/en/developer/articles/community/runtime-integrity-measure-and-attest-trust-domain.html" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;runtime measurement registers&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (RTMR) and the measurements are verified by &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/docs/attestation"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Attestation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. Note that for Confidential VMs with Intel TDX, RTMRs are now populated as well. Confidential Space benefits are highlighted in the NCC Group’s latest &lt;/span&gt;&lt;a href="https://www.nccgroup.com/us/research-blog/public-report-google-confidential-space-security-assessment/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;independent security evaluation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Confidential VM and Confidential GKE Nodes with NVIDIA H100 GPUs, now generally available&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;If you’re looking for performance and security while protecting data in use, Confidential VM and Confidential GKE Nodes with &lt;/span&gt;&lt;a href="https://www.nvidia.com/en-us/data-center/h100/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;NVIDIA H100 GPUs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; on the accelerator-optimized &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-high-vms"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;A3 machine series&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; are now generally available. These offerings deliver Google Cloud’s first Confidential GPUs, focus on ease of use to meet the demand for secure computing, and extend security to data-intensive, AI and ML workloads by having Intel TDX enabled on the CPU and &lt;/span&gt;&lt;a href="https://www.nvidia.com/en-us/data-center/solutions/confidential-computing/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;NVIDIA Confidential Computing&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; enabled on the GPU. You now have the ability to secure your data performantly during inference and training across models.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-vm/docs/create-a-confidential-vm-instance-with-gpu"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential VM with NVIDIA H100 GPUs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; is available with the &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/accelerator-optimized-machines#a3-high-vms"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;a3-highgpu-1g&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; machine type and in &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-vm/docs/supported-configurations#nvidia-confidential-computing_1"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;three zones&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;: europe-west4-c, us-central1-a, and us-east5-a. No code changes are needed for most AI and ML workloads. For pricing details, see &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-vm/pricing"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/how-to/gpus-confidential-nodes"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential GKE Nodes with NVIDIA H100 GPUs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; are generally available on both GKE Standard and &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/concepts/autopilot-overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;GKE Autopilot&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (through &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/concepts/about-custom-compute-classes"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;custom compute class&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;). To get started, click &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/how-to/gpus-confidential-nodes"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;And, we also have &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-space/docs/confidential-space-overview#requirements"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential Space&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; with NVIDIA H100 GPUs &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-space/docs/deploy-workloads#gpu-based-workloads"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;in preview&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Intel has a free tier for independent attestation&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Intel’s attestation verifier service, &lt;/span&gt;&lt;a href="https://www.intel.com/content/www/us/en/security/trust-authority.html" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Intel Tiber Trust Authority&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, now has a free tier. Google Cloud Confidential VMs and Confidential Space are both integrated with Intel Tiber Trust Authority as a third party attestation service, but now Intel Tiber Trust Authority is making secure attestation more accessible for all by offering a free tier (with optional paid support). &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;When Confidential VM and &lt;/span&gt;&lt;a href="https://community.intel.com/t5/Blogs/Products-and-Solutions/Security/Intel-Tiber-Trust-Authority-Integrates-with-Google-Cloud/post/1691578" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential Space customers use Intel Tiber Trust Authority&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, they can gain stronger separation of duties security guarantees. Click &lt;/span&gt;&lt;a href="https://docs.trustauthority.intel.com/main/articles/articles/ita/introduction.html" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to learn more. &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;What our customers say&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;"Thanks to the joint efforts of Super Protocol, Google Cloud, and NVIDIA, the world now gains a new layer of possibility — unlocking Confidential AI without cloud borders. With A3 Confidential VMs built on NVIDIA H100 GPUs now integrated into Super’s decentralized infrastructure and marketplace, companies can securely run, monetize, and collaborate on sensitive AI and data — across any environment. This enables seamless collaboration between Google Cloud customers and partners in other clouds — with no need for shared trust, manual agreements, or compromise. For the broader market, A3 instances at scale accelerate global access, while Super ensures confidentiality, verifiability, and self-sovereignty — fully automated and requiring no expertise in confidential computing. We are excited to open this next chapter of Confidential AI, built to work wherever you and your partners are," said Nukri Basharuli, founder and CEO, &lt;/span&gt;&lt;a href="https://superprotocol.com/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Super Protocol&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;“We’re proud to have partnered with Google Cloud to validate their Confidential Computing-enabled GPU solution — a major step forward in securing sensitive data for AI and machine learning workloads, without compromising on performance or scalability. Confidential Computing allows organizations to process sensitive workloads in the cloud while protecting sensitive data and models from both the cloud provider and the organization's insiders and internal threats. However, for gen AI and agentic AI use cases, protecting the CPU alone isn’t enough — both CPU and GPU must also run in confidential mode with mutual trust. With Google Cloud’s new offering, Anjuna can now launch Confidential Containers that leverage Intel TDX and NVIDIA H100 GPUs in confidential mode. This ensures that data, configurations, secrets, and code remain protected end-to-end from any untrusted entity, bringing state-of-the-art security for sensitive data.” said Steve Van Lare, CTO, &lt;/span&gt;&lt;a href="https://www.anjuna.io/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Anjuna Security&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;“With data processing worldwide growing up to three times faster than ever before and doubling every six months, the future of cloud computing must be built on trust. In collaboration with Google, Modelyo leverages Confidential VMs on the A3 machine series with NVIDIA H100 GPUs, transforming Confidential Computing into a seamless, intuitive, and fully integrated cloud experience. This enables us to deliver end-to-end managed solutions across interconnected environments, empowering organizations to innovate confidently knowing their data remains effortlessly protected at every stage.” said Benny Meir, CEO, &lt;/span&gt;&lt;a href="https://modelyo.com/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Modelyo&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;How to get started with Confidential Computing&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To add that extra layer of protection and privacy to your sensitive workloads, check out our documentation for &lt;/span&gt;&lt;a href="https://cloud.google.com/confidential-computing/confidential-vm/docs/confidential-vm-overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential VMs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://cloud.google.com/kubernetes-engine/docs/how-to/confidential-gke-nodes"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential GKE Nodes&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; today.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Fri, 29 Aug 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/identity-security/from-clicks-to-clusters-confidential-computing-expands-with-intel-tdx/</guid><category>Containers &amp; Kubernetes</category><category>Compute</category><category>Infrastructure Modernization</category><category>Security &amp; Identity</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>From clicks to clusters: Expanding Confidential Computing with Intel TDX</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/identity-security/from-clicks-to-clusters-confidential-computing-expands-with-intel-tdx/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Joanna Young</name><title>Senior Product Manager, Google</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Sam Lugani</name><title>Product Lead, Confidential Computing, Google Cloud</title><department></department><company></company></author></item><item><title>Run Gemini anywhere, including on-premises, with Google Distributed Cloud</title><link>https://cloud.google.com/blog/topics/hybrid-cloud/gemini-is-now-available-anywhere/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Earlier this year, we announced our commitment to bring &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/ai-machine-learning/run-gemini-and-ai-on-prem-with-google-distributed-cloud"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Gemini to on-premises environments&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; with &lt;/span&gt;&lt;a href="https://cloud.google.com/distributed-cloud"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Distributed Cloud (GDC&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;). Today, we are excited to announce that Gemini on GDC is now available to customers.&lt;/span&gt;&lt;/p&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;For years, enterprises and governments with the strictest data security and sovereignty requirements have faced a difficult choice: adopt modern AI or protect their data. Today, that compromise ends. We are announcing the general availability of Gemini on GDC air-gapped and preview of Gemini on GDC connected, bringing Google's most advanced models directly into your data center. &lt;/span&gt;&lt;/p&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;We are inspired by initial feedback from customers, including Singapore’s Centre for Strategic Infocomm Technologies (CSIT), Government Technology Agency of Singapore (GovTech Singapore), Home Team Science and Technology Agency (HTX), KDDI, and Liquid C2, who are excited to gain the advantages of generative AI with Gemini on GDC. &lt;/span&gt;&lt;/p&gt;
&lt;h3 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Transformative AI capabilities, on-premises&lt;/strong&gt;&lt;/h3&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Gemini models offer groundbreaking capabilities, from processing extensive context to native multimodal understanding of text, images, audio, and video. This unlocks a wide array of high-impact use cases on secure infrastructure:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Unlock new markets and global collaboration: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Instantly break down language barriers across your international operations, creating a more connected and efficient global workforce.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Accelerate decision-making: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Make faster, data-driven decisions by using AI to automatically summarize documents, analyze sentiment, and extract insights from your proprietary datasets.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Improve employee efficiency and customer satisfaction: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Deliver instant, 24/7 support and enhance user satisfaction by developing intelligent chatbots and virtual assistants for customers and employees.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Increase development velocity: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Ship higher-quality software faster by using Gemini for automated code generation, intelligent code completion, and proactive bug detection.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Strengthen safety &amp;amp; compliance&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: Protect your users with AI-powered safety tools that automatically filter harmful content and ensure adherence to industry policies.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Try Google Cloud for free&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2e29576d30&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Secure AI infrastructure where you need it&lt;/strong&gt;&lt;/h3&gt;
&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;It takes more than just a model to drive business value with generative AI; you need a complete platform that includes scalable AI infrastructure, a library with the latest foundational models, high-performance inferencing services, and pre-built AI agents like Agentspace search. GDC provides all that and more with an end-to-end AI stack combining our latest-generation AI infrastructure with the power of Gemini models to accelerate and enhance all your AI workloads.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Gemini_on_GDC_GA_Launch_-_August_2025.max-1000x1000.jpg"
        
          alt="Gemini on GDC (GA) Launch - August 2025"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p style="text-align: justify;"&gt;&lt;span style="vertical-align: baseline;"&gt;Delivering these transformative capabilities securely requires a complete, end-to-end platform that only Google is providing today :&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Performance at scale&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: GDC utilizes the latest NVIDIA GPU accelerators, including the NVIDIA Hopper and Blackwell GPUs. A fully managed Gemini endpoint is available within a customer or partner data center, featuring a seamless, zero-touch update experience. High performance and availability are maintained through automatic load balancing and auto-scaling of the Gemini endpoint, which is handled by our L7 load balancer and advanced fleet management capabilities.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Foundation of security and control: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Security is a core component of our solution, with audit logging and access control capabilities that provide full transparency for customers. This allows them to monitor all data traffic in and out of their on-premises AI environment and meet strict compliance requirements. The platform also features Confidential Computing support for both CPUs (with Intel TDX) and GPUs (with NVIDIA's confidential computing) to secure sensitive data and prevent tampering or exfiltration.&lt;/span&gt;&lt;/li&gt;
&lt;li role="presentation" style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Flexibility and speed for your AI strategy:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; the platform supports a variety of industry-leading models including Gemini 2.5 Flash and Pro, Vertex AI task-specific models (translation, optical character recognition, speech-to-text, and embeddings generation), and Google’s open-source Gemma models. GDC also provides managed VM shapes (A3 &amp;amp; A4 VMs) and Kubernetes clusters giving customers the ability to deploy any open-source or custom AI model, and custom AI workloads of their choice. This is complemented by Vertex AI services that provide an end-to-end AI platform including a managed serving engine, data connectors, and pre-built agents like Agentspace search (in preview) for a unified search experience across on-premises data.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;What our customers are saying&lt;/strong&gt;&lt;/h3&gt;
&lt;p style="text-align: justify; padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“As a key GDC collaboration partner in shaping the GDC air-gapped product roadmap and validating the deployment solutions, we’re delighted that this pioneering role has helped us grow our cutting-edge capabilities and establish a proven deployment blueprint that will benefit other agencies with similar requirements. This is only possible with the deep, strategic collaboration between CSIT and Google Cloud. We’re also excited about the availability of Gemini on GDC, and we look forward to building on our partnership to develop and deploy agentic AI applications for our national security mission.”  &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;- Loh Chee Kin, Deputy Chief Executive, Centre for Strategic Infocomm Technologies (CSIT)&lt;/strong&gt;&lt;/p&gt;
&lt;p style="text-align: justify; padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“One of our priorities is to harness the potential of AI while ensuring that our systems and the services citizens and businesses rely on remain secure. Google Cloud has demonstrated a strong commitment to supporting the public sector with initiatives that enable the agile and responsible adoption of AI. We look forward to working more closely with Google Cloud to deliver technology for the public good.” - &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Goh Wei Boon, Chief Executive, Government Technology Agency of Singapore&lt;/strong&gt;&lt;/p&gt;
&lt;p style="text-align: justify; padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“The ability to deploy Gemini on Google Distributed Cloud will allow us to bridge the gap between our on-premises data and the latest advancements in AI. Google Distributed Cloud gives us a secure, managed platform to innovate with AI, without compromising our strict data residency and compliance requirements.&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;”&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; - Ang Chee Wee, Chief AI Officer, Home Team Science &amp;amp; Technology Agency (HTX)&lt;/strong&gt;&lt;/p&gt;
&lt;p style="text-align: justify; padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“The partnership with Google Cloud and the integration of Google's leading Gemini models will bring cutting-edge AI capabilities, meet specific performance requirements, address data locality and regulatory needs of Japanese businesses and consumers.”&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; - &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Toru Maruta, Executive Officer, Head of Advancing Business Platform Division, KDDI&lt;/strong&gt;&lt;/p&gt;
&lt;p style="text-align: justify; padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;"Data security and sovereignty are paramount for our customers. With Gemini on Google Distributed Cloud, our Liquid Cloud and Cyber Security solution would deliver strategic value to ensure our customers in highly regulated industries can harness the power of AI while keeping their most valuable data under their control." &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;- Oswald Jumira, CEO Liquid C2&lt;/strong&gt;&lt;/p&gt;
&lt;h3 style="text-align: justify;"&gt;&lt;strong style="vertical-align: baseline;"&gt;Gemini everywhere is here&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The era of on-premises AI without compromise is here. To bring the power of Gemini to your on-premises environment, &lt;/span&gt;&lt;a href="https://cloud.google.com/contact?hl=en"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;request a strategy session with our experts&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Thu, 28 Aug 2025 05:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/topics/hybrid-cloud/gemini-is-now-available-anywhere/</guid><category>AI &amp; Machine Learning</category><category>Infrastructure Modernization</category><category>Hybrid &amp; Multicloud</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Run Gemini anywhere, including on-premises, with Google Distributed Cloud</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/topics/hybrid-cloud/gemini-is-now-available-anywhere/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Vithal Shirodkar</name><title>VP/GM, Google Distributed Cloud</title><department></department><company></company></author></item><item><title>An efficient path to production AI: Kakao’s journey with JAX and Cloud TPUs</title><link>https://cloud.google.com/blog/products/infrastructure-modernization/kakaos-journey-with-jax-and-cloud-tpus/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;When your messaging platform serves 49 million people – 93% of South Korea’s population – every technical decision carries enormous weight. The engineering team at &lt;/span&gt;&lt;a href="https://www.kakaocorp.com/page/service/service/KakaoTalk?lang=en" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Kakao&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; faced exactly this challenge when their existing infrastructure hit critical limitations. Their solution? A strategic shift to &lt;/span&gt;&lt;a href="https://cloud.google.com/tpu"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud TPUs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; using the &lt;/span&gt;&lt;a href="http://jax.dev" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;JAX&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; framework that not only solved their immediate scalability needs but opened new possibilities for advanced AI model development.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Kakao’s approach provides a compelling example of leveraging the high-performance array computing framework JAX for AI model development at scale. While their primary training environment was GPU-based, the team made a strategic decision to adopt the JAX stack on Google Cloud TPUs to optimize for cost and efficiency.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This work laid the groundwork for the development of their proprietary Kanana model family, and several Kanana models — including Kanana-MoE — have recently been released as open source on &lt;/span&gt;&lt;a href="https://kko.kakao.com/kananallm" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Hugging Face Hub&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In this post, Minho Ryu and Nayeon Kim detail Kakao’s technical journey. They cover their specific implementation details, from adapting the JAX large language model framework and &lt;/span&gt;&lt;a href="https://github.com/AI-Hypercomputer/maxtext" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;MaxText&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for custom data pipelines to their work on mixture-of-experts (MoE) model training.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Try Google Cloud for free&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2e2431f250&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Kakao’s journey by Minho and Nayeon:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As engineers at Kakao, we develop models that serve &lt;/span&gt;&lt;a href="https://www.kakaocorp.com/page/service/service/KakaoTalk?lang=en" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;KakaoTalk&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, a platform supporting services that extend far beyond text. Our rich ecosystem includes chat with over 700,000 images and stickers (emojis), voice and video calls, finance, and navigation. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_H9JmMnM.max-1000x1000.png"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;KakaoTalk’s massive scale and complexity demand that our language models are not only highly efficient but also excel at understanding the Korean language and are flexible enough for diverse applications. These real-world product requirements directly influenced our technical decisions and our need for a customizable training framework.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Our journey with JAX began at an important inflection point. Our existing GPU-based infrastructure was reaching power and budget capacity constraints. We had two options: expand our GPU infrastructure and maintain our existing codebase, or adopt Cloud TPUs, which offered cost-performance advantages while requiring adoption of a new toolchain. We chose Cloud TPUs, viewing the short-term investment as worthwhile for long-term cost-performance benefits, and built our stack on JAX. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We use &lt;/span&gt;&lt;a href="https://github.com/AI-Hypercomputer/xpk" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;XPK&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for Kubernetes cluster management, which simplifies job creation and management on GKE without requiring Kubernetes expertise. For the data pipeline, we adopted &lt;/span&gt;&lt;a href="https://google-grain.readthedocs.io/en/latest/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Grain&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; due to its deterministic behavior, which is essential for the stability of long-running AI model training jobs.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--medium
      
      
        h-c-grid__col
        
        h-c-grid__col--4 h-c-grid__col--offset-4
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/image4_5MDva7c.max-1000x1000.png"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We focused on adapting the MaxText framework to fit our specific research and compatibility needs. We made two key customizations to the pipeline:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;1. &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Multi-source data blending:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; When we began exploring training with MaxText, it assumed a single, pre-mixed corpus. Our research requires blending different data sources — such as web text, code, and math — with specific, dynamically-adjusted weights during different training phases. To achieve this flexibility without reprocessing terabytes of data for each experiment, we implemented a solution using &lt;/span&gt;&lt;a href="https://google-grain.readthedocs.io/en/latest/tutorials/dataset_advanced_tutorial.html" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Grain's mix&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; function. This approach allows us to define blending ratios in our configuration, providing the adaptability essential for our iterative research process. We filed a &lt;/span&gt;&lt;a href="https://github.com/AI-Hypercomputer/maxtext/pull/1801" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;PR for this feature&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to be supported in MaxText natively, and it has been incorporated &lt;/span&gt;&lt;a href="https://github.com/AI-Hypercomputer/maxtext/blob/main/MaxText/input_pipeline/_grain_data_processing.py#L55-L63" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; since. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/3_lmZuUrp.max-1000x1000.png"
        
          alt="3"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p role="presentation"&gt;&lt;span style="vertical-align: baseline;"&gt;2. &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Token Processing for Efficiency and Compatibility:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; To maintain compatibility with our existing &lt;/span&gt;&lt;a href="https://github.com/NVIDIA/Megatron-LM" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Megatron-LM&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; pipeline and improve efficiency, we modified MaxText's token processing logic. Our data preparation method constructs each training sequence by appending the first token of the subsequent sequence. This creates overlapping, continuous sequences, ensuring that no information is lost at the boundaries and maximizing data utilization.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To validate our new TPU-based workflow, we trained two models. First, we trained the Kanana 2.1 billion parameter model from scratch, and the results demonstrated that our MaxText implementation achieved performance comparable to our existing GPU-based Megatron-LM pipeline at each stage. Second, we performed depth upscaling with continued pre-training from our existing 8B model to a 9.8B architecture. Both approaches succeeded  and showed consistent improvements across various benchmarks, confirming that the results on GPU were effectively reproduced on TPU.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Advancing our approach: Training Mixture-of-Experts (MoE) models with MaxText&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With the core pipeline validated, we began experimenting with more advanced architectures, specifically MoE models, to build inference-efficient models that maintain strong performance. Our objectives were to explore upcycling an existing dense model into an MoE structure and to evaluate the suitability of the TPU and MaxText stack for this task.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;For the experiment, we upcycled our 2.1B dense model into a 13.4B parameter (2.3B active) MoE architecture with 64 experts and 8 active experts per token. We trained this model on the exact same dataset as the original dense model to isolate the impact of the architectural change. The training was performed on &lt;/span&gt;&lt;a href="https://cloud.google.com/tpu/docs/v5e"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;v5e TPUs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; using MaxText with Fully Sharded Data Parallelism (FSDP).&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The implementation process was straightforward. We found that MaxText's flexible design, built on &lt;/span&gt;&lt;a href="https://flax.readthedocs.io/en/latest/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Flax&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://optax.readthedocs.io/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Optax&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, and &lt;/span&gt;&lt;a href="https://orbax.readthedocs.io/en/latest/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Orbax&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, was well-suited for the wide range of ablations required for MoE research. Specifically:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Integrated Kernels:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; &lt;/span&gt;&lt;a href="https://arxiv.org/pdf/2211.15841" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Megablocks MoE kernels&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; which support optimized MoE features like &lt;/span&gt;&lt;a href="https://triton-lang.org/main/getting-started/tutorials/08-grouped-gemm.html" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Group GEMM&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; were already integrated into JAX.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong style="vertical-align: baseline;"&gt;Combining Schedules:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; We used the &lt;/span&gt;&lt;a href="https://optax.readthedocs.io/en/latest/api/optimizer_schedules.html#optax.schedules.join_schedules" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;optax.join_schedules&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; function to combine multiple learning rate schedules (e.g. warmup, constant, and annealing) into a single, custom schedule for our training run. This ability to combine different schedules is very useful to experiment with different training strategies.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/4_j16a08s.max-1000x1000.png"
        
          alt="4"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;ul&gt;
&lt;li&gt;&lt;strong style="vertical-align: baseline;"&gt;Code Customization:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; We needed to enable the load balancing loss for our sparse &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;matmul&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; implementation. This required inserting a single line of code in the&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; permute&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; function within the MoE block of MaxText to calculate the loss directly from the router logits.&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/5_jbEUhND.max-1000x1000.png"
        
          alt="5"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The results showed performance improvements, particularly in code and math benchmarks, suggesting domain specialization among the experts. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/6_rOvXBXi.max-1000x1000.png"
        
          alt="6"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="eo8rb"&gt;Performance Evaluation&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This met our objectives and further demonstrated the JAX stack's utility for advanced model development. We are now extending this work by experimenting with shared experts and replacing initial MoE layers with dense layers, modifications which are simple to implement within the MaxText framework.&lt;/span&gt;&lt;/p&gt;
&lt;h4&gt;&lt;strong style="vertical-align: baseline;"&gt;Performance improvements and key takeaways&lt;/strong&gt;&lt;/h4&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;During our work, we gained early access to &lt;/span&gt;&lt;a href="https://blog.google/feed/trillium-tpus/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Trillium TPUs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. We managed the transition from v5e by changing a few parameters in our XPK cluster and workload configurations. We observed an immediate and substantial throughput increase of 2.7x across our models, along with improved cost-performance efficiency.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/1_w3A2Df0.png"
        
          alt="7"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Based on our experience, the JAX stack on TPUs provides a comprehensive and efficient environment for AI model development. The key advantages for our team include:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Performance and scalability:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The &lt;/span&gt;&lt;a href="http://jax.dev" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;JAX&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://openxla.org/xla" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;XLA&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; combination provides just-in-time compilation, and &lt;/span&gt;&lt;a href="https://github.com/AI-Hypercomputer/maxtext" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;MaxText&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; is optimized for large-scale parallel computing with support for paradigms like SPMD and FSDP.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Customizability and control:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The codebase, being pure Python and built on libraries like &lt;/span&gt;&lt;a href="https://flax.readthedocs.io/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Flax&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://optax.readthedocs.io/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Optax&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, and &lt;/span&gt;&lt;a href="https://orbax.readthedocs.io/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Orbax&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; is intuitive and easy to modify. This allows us to implement custom data pipelines, training strategies, and novel architectures with minimal overhead.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Rapid feature adoption:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The MaxText framework is updated quickly with features from new state-of-the-art models, allowing us to stay current with our research.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;These strengths have made the JAX stack a powerful and flexible foundation for our work in training large language models at Kakao.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Build your Language Models with the JAX Ecosystem:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Kakao's journey demonstrates how  the JAX ecosystem’s modular design — including MaxText, Flax, Optax, and Orbax — enables the customization required for both production pipelines and advanced research, from tailored data blending to rapid experimentation with MoE architectures.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Our sincere thanks to Minho, Nayeon and their team for sharing their insightful engineering work. We look forward to seeing how they and other leading enterprises worldwide continue to use the JAX ecosystem to build the next generation of powerful and efficient language models.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Tue, 19 Aug 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/infrastructure-modernization/kakaos-journey-with-jax-and-cloud-tpus/</guid><category>AI &amp; Machine Learning</category><category>Customers</category><category>Infrastructure Modernization</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>An efficient path to production AI: Kakao’s journey with JAX and Cloud TPUs</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/infrastructure-modernization/kakaos-journey-with-jax-and-cloud-tpus/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Minho Ryu, Nayeon Kim </name><title>Language Model Research Engineers, Kakao</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Srikanth Kilaru</name><title>Senior Product Manager, Google ML Frameworks</title><department></department><company></company></author></item><item><title>How Yahoo Calendar broke free from hardware queues and DBA bottlenecks</title><link>https://cloud.google.com/blog/products/infrastructure-modernization/how-yahoo-calendar-broke-free-from-hardware-queues-and-dba-bottlenecks/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;em&gt;&lt;strong&gt;Editor's note&lt;/strong&gt;: &lt;a href="https://overview.mail.yahoo.com/" rel="noopener" target="_blank"&gt;Yahoo Mail&lt;/a&gt; is in the midst of one of its largest infrastructure transformations to date: a multi-year effort to modernize hundreds of petabytes of services by moving to Google Cloud.The Yahoo Mail migration - a high-scale always-on workload - began with Yahoo Calendar, a product that is an essential part of the experience for hundreds of millions of Yahoo Mail users.  It was a massive undertaking with no room for error, and the result was a smooth cutover with no customer impact that proved &lt;a href="https://cloud.google.com/sql"&gt;Cloud SQL&lt;/a&gt; could handle the complexity and pace of Yahoo’s operations. It also marked a shift in how Yahoo works by reducing manual overhead, unlocking developer agility, and laying the foundation for what comes next.&lt;/em&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;p&gt;At Yahoo, we knew migrating our cornerstone platform — Yahoo Mail — to the cloud would be one of the most significant infrastructure efforts we’d ever taken on. With over 500 petabytes of interconnected systems, we knew we needed to start with a smaller, high-impact workload to build early confidence. That’s how Yahoo Calendar,  a product that is an essential part of the experience for hundreds of millions of Yahoo Mail users, , became the first production service to make the move. &lt;/p&gt;
&lt;p&gt;We needed to migrate a high-scale, always-on service without disrupting the experience users rely on every day — or risk millions of people missing standups, birthday dinners, or that dentist appointment they actually remembered to schedule.&lt;/p&gt;
&lt;p&gt;We chose Google Cloud to help us modernize our operations with managed infrastructure, reduce manual effort, and tap into a trusted ecosystem for large-scale transformation. Migrating Yahoo Calendar became our proving ground for running mission-critical services on Cloud SQL and would set the pace for the rest of our multi-year migration plan for Yahoo Mail.&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Build smarter with Google Cloud databases!&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2e2706eb80&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h2&gt;Modernizing infrastructure without skipping a single invite&lt;/h2&gt;
&lt;p&gt;The infrastructure we were replacing included tens of on-premises MySQL (Percona) instances. It was solid but not built for operational speed. Scaling meant filing hardware requests and often waiting weeks or even months. Routine tasks like backups or upgrades had to go through a separate database administration (DBA) team. And as demand grew, the need for agility grew with it. To meet that growing need with more flexibility and speed, we took on a massive lift:&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1"&gt;
&lt;p role="presentation"&gt;Migrating tens of database shards across multiple regions&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1"&gt;
&lt;p role="presentation"&gt;Moving over 20+ TB of storage (excluding replicas)&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1"&gt;
&lt;p role="presentation"&gt;Supporting peak traffic of 1 million QPS reads and 2,500 QPS writes&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1"&gt;
&lt;p role="presentation"&gt;Replatforming our application stack to run on &lt;a href="https://cloud.google.com/kubernetes-engine/"&gt;Google Kubernetes Engine&lt;/a&gt; (GKE) to support the Calendar experience for the hundreds of millions of Yahoo Mail users&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Cloud SQL's support for our existing MySQL workloads with minimal changes lets us replicate our on-prem shards without a full re-architecture. That compatibility provided the foundation to restructure our full stack. To make it all work, we migrated the UI, API, and backend to GKE and connected everything to  Cloud SQL deployments in multiple Google Cloud regions. All of this had to be migrated incrementally, with no downtime for public users. Traffic continued flowing through existing endpoints, and our proxy layer routed requests based on each user’s location and migration state. As database shards became ready, we carefully flipped them into read-write mode on Cloud SQL to keep Calendar users running on schedule while shifting the backend in stages.&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/1_Zu3j3bD.jpg"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="y1uhk"&gt;Fig. 1 - MySQL On-Prem to Cloud SQL Initial Load + CDC&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h2&gt;A migration this big needed backup&lt;/h2&gt;
&lt;p&gt;Google Cloud’s Professional Services Organization (PSO) played a critical role in getting us there. From the earliest stages, they were embedded with our team. They helped us evaluate Cloud SQL and &lt;a href="https://cloud.google.com/database-migration"&gt;Database Migration Service&lt;/a&gt; (DMS), guide proof-of-concept work, and stress-test our migration architecture.&lt;/p&gt;
&lt;p&gt;When we hit a roadblock replicating data with DMS, PSO worked closely with Cloud SQL engineering and our internal security and DBA teams to design a custom workaround. During cutover, they were right there with us to help with hiccups like debugging capacity constraints or troubleshooting connection spikes during shadow traffic. They also helped us resolve reverse replication failures caused by permission changes — an edge case we wouldn’t have anticipated without their guidance.&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_cSrdJe5.max-1000x1000.png"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="y1uhk"&gt;Fig. 2 - Yahoo Calendar migration diagram&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h2&gt;Cloud SQL helped us block time for what matters&lt;/h2&gt;
&lt;p&gt;With managed infrastructure, we’ve significantly reduced manual operations, reduced  database admin overhead, and gained the agility to scale up without the wait. Our application teams now deploy and manage database shards ourselves using infrastructure as code (IaC), without relying on manual processes. Backups, patching, and failovers are automated to reduce risk and manual effort. Usage and cost monitoring are built-in, helping us optimize across the board. And thanks to tight integration with our security protocols, we’re able to maintain high confidence in operating a large-scale public-facing service.&lt;/p&gt;
&lt;p&gt;Today, Yahoo Calendar processes hundreds of thousands of queries per second, operates 26 Cloud SQL instances with disaster recovery (DR), and runs on infrastructure that includes 2,500 virtual CPUs and 17 TB of memory for databases alone. Our application tier spans 850 pods and 2,200 vCPUs, with 10 TB of memory to match. We now run at scale, with confidence — and without waiting on hardware or handoffs.&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/3_0rwvuTl.max-1000x1000.png"
        
          alt="3"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="y1uhk"&gt;Fig .3 - Architecture diagram of Yahoo Calendar’s services&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h2&gt;Up next on our calendar&lt;/h2&gt;
&lt;p&gt;We’re seeing the benefits of infrastructure that works with us, not against us. And we’re doing it all without compromising on scale, performance, or security. Now that we've pressure-tested our migration strategy and refined how we operate in the cloud, we're ready to take on Yahoo Mail’s full environment — 500 petabytes and counting.&lt;/p&gt;
&lt;p&gt;The next couple of years will be about scaling smart, staying nimble, and proving that modernization doesn’t have to mean disruption. But with the hardest part of any journey behind us (starting), and a calendar that runs on Cloud SQL, we’re in sync and right on schedule.&lt;/p&gt;
&lt;h2&gt;Learn more:&lt;/h2&gt;
&lt;ul&gt;
&lt;li aria-level="1"&gt;
&lt;p role="presentation"&gt;Discover how &lt;a href="https://cloud.google.com/sql"&gt;Cloud SQL&lt;/a&gt; can transform your business! &lt;a href="https://console.cloud.google.com/freetrial?redirectPath=sql"&gt;Start a free trial today&lt;/a&gt;!&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1"&gt;
&lt;p role="presentation"&gt;Download this &lt;a href="https://cloud.google.com/resources/idc-business-value-cloud-sql-analyst-report"&gt;IDC report&lt;/a&gt; to learn how migrating to Cloud SQL can lower costs, boost agility, and speed up deployments.&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1"&gt;
&lt;p role="presentation"&gt;&lt;span style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;Learn how &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/databases/ford-reduces-routine-database-management-with-google-cloud" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;Ford&lt;/a&gt;&lt;span style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt; and &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/databases/lightricks-delivers-dynamic-search-with-cloud-sql-vector-support" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;Lightricks&lt;/a&gt;&lt;span style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt; gained high performance and cut costs by modernizing with Cloud SQL.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;</description><pubDate>Mon, 11 Aug 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/infrastructure-modernization/how-yahoo-calendar-broke-free-from-hardware-queues-and-dba-bottlenecks/</guid><category>Databases</category><category>Customers</category><category>Infrastructure Modernization</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>How Yahoo Calendar broke free from hardware queues and DBA bottlenecks</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/infrastructure-modernization/how-yahoo-calendar-broke-free-from-hardware-queues-and-dba-bottlenecks/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Sunitha Reddy</name><title>Director, Software Applications Engineering, Yahoo</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Lee Atkins</name><title>Senior Engineering Manager, Yahoo Mail</title><department></department><company></company></author></item><item><title>Google is a Leader in the Gartner® Magic Quadrant for Strategic Cloud Platform Services</title><link>https://cloud.google.com/blog/products/compute/google-is-a-leader-in-gartner-magic-quadrant-for-scps/</link><description>&lt;div class="block-paragraph"&gt;&lt;p data-block-key="pztel"&gt;For the eighth consecutive year, Gartner® has named Google a Leader in the Gartner Magic Quadrant™ for Strategic Cloud Platform Services. This year, however, marks a major milestone: Google is now positioned furthest for completeness of vision.&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/image1_zyImNXV.max-1000x1000.png"
        
          alt="image1"&gt;
        
        &lt;/a&gt;
      
        &lt;figcaption class="article-image__caption "&gt;&lt;p data-block-key="wa9s9"&gt;&lt;a href="https://cloud.google.com/resources/content/2025-gartner-magic-quadrant-strategic-cloud-platform-services"&gt;Download the complimentary 2025 Magic Quadrant for Strategic Cloud Platform Services&lt;/a&gt;.&lt;/p&gt;&lt;/figcaption&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph"&gt;&lt;p data-block-key="pztel"&gt;How do we keep getting recognized year over year? In our opinion, and what our customers consistently relay, are three major advantages of working with Google Cloud infrastructure:&lt;/p&gt;&lt;ul&gt;&lt;li data-block-key="5jsg5"&gt;A purpose-built modern cloud, specifically designed for AI-first apps and services.&lt;/li&gt;&lt;li data-block-key="1da93"&gt;Workload-optimized compute, storage, and the most versatile cloud networking that make it easier to migrate and modernize key enterprise workloads — SAP, VMware, Microsoft, Oracle, OpenShift, and even mainframes.&lt;/li&gt;&lt;li data-block-key="14n4"&gt;Leading reliability and security among hyperscalers — customers see Google as an extension of their security team.&lt;/li&gt;&lt;/ul&gt;&lt;p data-block-key="996bl"&gt;But that’s not the full story. What does Google Cloud do behind the scenes to provide these advantages to our customers? Let’s dive into the five principles that Google Cloud lives by as we continue to build and enhance our infrastructure:&lt;/p&gt;&lt;h3 data-block-key="17k07"&gt;&lt;b&gt;1. Run workloads on precisely optimized infrastructure&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="covvu"&gt;At Google Cloud, we believe that simply adding more hardware is an unsustainable and ineffective approach to scaling application performance. That’s why we develop strategic, workload-optimized infrastructure technologies. For example, &lt;b&gt;Titanium&lt;/b&gt; is a system of hardware and software offloads, boosting performance, reliability, and security and maximizing workload efficiency.&lt;/p&gt;&lt;p data-block-key="6qfj5"&gt;In the past 12-months we also &lt;a href="https://cloud.google.com/blog/products/compute/delivering-new-compute-innovations-and-offerings"&gt;enhanced our Compute Engine platform&lt;/a&gt; to drive enterprise transformation with more massive developments:&lt;/p&gt;&lt;ul&gt;&lt;li data-block-key="5r1h9"&gt;&lt;b&gt;Fourth-generation compute instances&lt;/b&gt; and &lt;b&gt;Hyperdisk block storage,&lt;/b&gt; focusing on optimizing performance and costs across workloads, while delivering enterprise-grade scalability, reliability, security, and workload consistency, ultimately enabling businesses to grow efficiently and invest more in innovation.&lt;/li&gt;&lt;li data-block-key="ep23l"&gt;We also developed our own custom &lt;a href="https://cloud.google.com/blog/products/compute/google-axion-powers-cloud-sql-and-alloydb"&gt;&lt;b&gt;Arm®-based CPUs, Google Axion processors&lt;/b&gt;&lt;/a&gt;, aimed at maximizing performance, reducing infrastructure costs, and supporting sustainability goals for general-purpose workloads.&lt;/li&gt;&lt;li data-block-key="c5pbm"&gt;Continued enhancements to our &lt;b&gt;fully managed database services&lt;/b&gt;, such as AlloyDB for PostgreSQL and Cloud SQL, and easy migration from traditional databases to Google Cloud infrastructure.&lt;/li&gt;&lt;/ul&gt;&lt;h3 data-block-key="ed9pv"&gt;&lt;b&gt;2. Build AI-centric infrastructure&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="5npbu"&gt;Google, like in 2024, is again ranked #1 for AI/ML in the 2025 Gartner Critical Capabilities for Strategic Cloud Platform Services. And we’ve only added to our depth and breadth since then. For evidence of our commitment to AI, there’s &lt;a href="https://cloud.google.com/blog/products/compute/whats-new-with-ai-hypercomputer"&gt;&lt;b&gt;AI Hypercomputer&lt;/b&gt;, built on Google AI technology developed over the past decade&lt;/a&gt;, underpinning nearly every AI workload run on Google Cloud.&lt;/p&gt;&lt;p data-block-key="beue8"&gt;This integrated supercomputing system delivers more intelligence at a consistently low price for both training and serving AI workloads. It powers &lt;b&gt;Vertex AI&lt;/b&gt;, our managed machine learning (ML) platform, that unifies the entire ML lifecycle—from data preparation to model deployment and monitoring. From an infrastructure perspective, it allows data scientists and developers to focus on building models rather than provisioning and managing the infrastructure required to run them.&lt;/p&gt;&lt;p data-block-key="1e2v9"&gt;Beyond core infrastructure, we’re also dedicated to fostering an open and innovative generative AI partner ecosystem, helping companies rapidly transform their software development, business processes, and information retrieval.&lt;/p&gt;&lt;p data-block-key="7rl85"&gt;We’ve also released a plethora of new AI products and services, including:&lt;/p&gt;&lt;ul&gt;&lt;li data-block-key="arm3b"&gt;&lt;b&gt;Ironwood&lt;/b&gt;, announced at Next ‘25, is our 7th generation TPU, built specifically for inference, offering 5x more peak compute capacity and 6x more high-bandwidth memory than its predecessor and achieving a staggering 42.5 exaFLOPS of compute in its larger configuration while being 2x more power efficient.&lt;/li&gt;&lt;li data-block-key="1iaab"&gt;&lt;b&gt;Google Cloud Managed Lustre&lt;/b&gt;, a fully managed parallel file system built on the DDN EXAScaler Lustre file system, which provides PB scale at under 1ms latency, millions of IOPS, and TB/s of throughput for AI workloads; &lt;b&gt;Rapid Storage,&lt;/b&gt; a new Cloud Storage zonal bucket that provides industry-leading &amp;lt;1ms random read and write latency, 20x faster data access, 6 TB/s of throughput, and 5x lower latency for random reads and writes compared to other leading hyperscalers; &lt;b&gt;Anywhere Cache&lt;/b&gt;, which provides an SSD-backed zonal read cache for Cloud Storage buckets; &lt;b&gt;Hyperdisk Exapools&lt;/b&gt;, entering preview this quarter, providing exabytes of block storage capacity and TB/s of throughput for AI clusters.&lt;/li&gt;&lt;li data-block-key="fd6hk"&gt;&lt;a href="https://blog.google/technology/developers/introducing-gemini-cli-open-source-ai-agent/" target="_blank"&gt;&lt;b&gt;Gemini CLI&lt;/b&gt; (Command Line Interface)&lt;/a&gt;, an open-source AI agent that brings Gemini 2.5 Pro directly into the terminal with unmatched free usage limits (60 model requests per minute, 1,000 requests per day) and integrates with Gemini Code Assist for AI-first coding and problem-solving.&lt;/li&gt;&lt;/ul&gt;&lt;h3 data-block-key="3usrh"&gt;&lt;b&gt;3. Meet the AI moment with containers&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="2a6sf"&gt;With Google Kubernetes Engine (GKE), Google is once again a Leader in the Gartner Magic Quadrant for Container Management this year. And like our advancements elsewhere, we feel it's only gotten better. Built on years of experience and a deep commitment to Kubernetes, GKE has emerged as the foundation for the next generation of AI workloads. In fact, we use GKE to power our own leading AI services at scale, including Vertex AI, leveraging the same cutting-edge technologies and best practices we share with you.&lt;/p&gt;&lt;p data-block-key="48ps4"&gt;To continue empowering your AI innovation, &lt;a href="https://cloud.google.com/blog/products/containers-kubernetes/how-gke-powers-ai-innovation"&gt;we've rolled out several impressive new releases&lt;/a&gt;:&lt;/p&gt;&lt;ul&gt;&lt;li data-block-key="6uqb0"&gt;&lt;b&gt;GKE Autopilot&lt;/b&gt;, with an impressive 30% of active GKE clusters created in 2024 operating in Autopilot mode, showcases its effectiveness in simplifying operations and enhancing resource efficiency for critical workloads.&lt;/li&gt;&lt;li data-block-key="3373u"&gt;&lt;b&gt;Cluster Director&lt;/b&gt;, now GA, allows for the deployment and management of massive clusters of accelerators (GPUs and TPUs) as a single, high-performance unit, crucial for demanding AI models and distributed workloads.&lt;/li&gt;&lt;li data-block-key="87rpt"&gt;For optimizing AI inference, the new &lt;b&gt;GKE Inference Quickstart&lt;/b&gt; and &lt;b&gt;GKE Inference Gateway&lt;/b&gt; (both in public preview) simplify model deployment with benchmarked profiles and provide intelligent routing, leading to up to a 30% reduction in serving costs and up to a 60% decrease in tail latency.&lt;/li&gt;&lt;/ul&gt;&lt;h3 data-block-key="1ga0r"&gt;&lt;b&gt;4. Empower customers with true sovereignty&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="5uap7"&gt;At Google Cloud, we believe that &lt;a href="https://cloud.google.com/blog/products/identity-security/google-advances-sovereignty-choice-and-security-in-the-cloud"&gt;&lt;b&gt;digital sovereignty&lt;/b&gt;&lt;/a&gt; is about more than just compliance; it's about providing organizations with &lt;b&gt;flexibility, control, choice, and robust security&lt;/b&gt; without compromising functionality to enable innovation. We do that by delivering technology capabilities that align with our customers' diverse needs, backed by extensive engagement with local partners and policymakers.&lt;/p&gt;&lt;p data-block-key="8brgu"&gt;Our strength in this area comes from our &lt;a href="https://cloud.google.com/about/locations"&gt;&lt;b&gt;massively scaled global infrastructure&lt;/b&gt;&lt;/a&gt;, encompassing over 42 cloud regions, 127 zones, and 202 network edge locations, alongside significant subsea cable investments. Two of our most important recent developments here include:&lt;/p&gt;&lt;ul&gt;&lt;li data-block-key="egi1e"&gt;Forging &lt;b&gt;key partnerships&lt;/b&gt; with independent local and regional partners across Asia, Europe, the Middle East, and the United States, such as Schwarz Group, T-Systems, S3NS (with Thales), Minsait, Telecom Italia, and World Wide Technology&lt;/li&gt;&lt;li data-block-key="4u0re"&gt;Designing a portfolio of sovereign solutions designed to fit your unique business needs, regulatory requirements, and risk profiles. With &lt;a href="https://cloud.google.com/blog/products/identity-security/google-advances-sovereignty-choice-and-security-in-the-cloud"&gt;three precisely designed solutions&lt;/a&gt;, &lt;b&gt;Google Cloud Data Boundary, Google Cloud Dedicated, and Google Cloud Air-Gapped&lt;/b&gt;, organizations can choose what you need based on their unique requirements.&lt;/li&gt;&lt;/ul&gt;&lt;h3 data-block-key="bia0k"&gt;&lt;b&gt;5. Deliver a “planet-scale” network&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="db2ts"&gt;Our strength in networking is built upon &lt;a href="https://cloud.google.com/blog/products/networking/networking-innovations-at-google-cloud-next25"&gt;&lt;b&gt;over 25 years of Google's foundational innovations&lt;/b&gt;&lt;/a&gt;, connecting billions of people globally to essential services like Gmail, YouTube, and Search. This deep expertise has allowed us to build planet-scale network infrastructure that powers Google Cloud and our Cross-Cloud Network solutions. Our vast backbone network features 202 points of presence (PoPs), over 2 million miles of fiber, and 33 subsea cables, all backed by a 99.99% reliability SLA, providing a robust and resilient global platform.&lt;/p&gt;&lt;p data-block-key="20i4k"&gt;With the rapid emergence of AI, today’s industries are demanding unprecedented network capabilities for training, inference, and serving AI models, including massive capacity, seamless connectivity, and robust security. We're addressing this with continuous innovation in our cloud networking products and Cross-Cloud Network solutions, enabling customers to easily build and deliver distributed AI applications.&lt;/p&gt;&lt;p data-block-key="87fvq"&gt;To meet these demanding requirements, we've launched a suite of impressive new networking solutions, including:&lt;/p&gt;&lt;ul&gt;&lt;li data-block-key="3uri9"&gt;&lt;a href="https://cloud.google.com/blog/products/networking/connect-globally-with-cloud-wan-for-the-ai-era"&gt;&lt;b&gt;Cloud WAN&lt;/b&gt;&lt;/a&gt;, our new, fully managed, reliable, and secure enterprise backbone designed to transform wide area network (WAN) architectures by leveraging Google's planet-scale network. It offers up to 40% faster performance compared to the public internet and up to a 40% savings in total cost of ownership (TCO) over customer-managed solutions.&lt;/li&gt;&lt;li data-block-key="8pcom"&gt;&lt;b&gt;Cloud Interconnect&lt;/b&gt; and the new &lt;b&gt;Cross-Site Interconnect&lt;/b&gt; (options in Cloud WAN, currently in preview), make Google Cloud the first major cloud provider to offer transparent Layer 2 connectivity over its network.&lt;/li&gt;&lt;li data-block-key="bci6c"&gt;For AI-optimized networking, &lt;b&gt;400G Cloud Interconnect&lt;/b&gt; (4x more bandwidth for massive data ingestion), networking support for up to tens of thousands of GPUs per cluster, and &lt;b&gt;Zero-Trust RDMA security&lt;/b&gt; for high-performance GPU/TPU traffic.&lt;/li&gt;&lt;/ul&gt;&lt;h3 data-block-key="b5lp1"&gt;&lt;b&gt;Take the next steps on your journey with Google Cloud&lt;/b&gt;&lt;/h3&gt;&lt;p data-block-key="fbnbp"&gt;From optimizing legacy apps to building next-gen AI and everything in between, Google Cloud lets you tackle your biggest challenges with a platform that works across your data centers, other clouds, and the edge. With decades of experience and a proven track record of reliability, Google Cloud infrastructure is equipped to handle your most visionary workloads and is the ideal partner to drive your business transformation.&lt;/p&gt;&lt;p data-block-key="7c4vm"&gt;You can download a complimentary copy of the &lt;a href="https://cloud.google.com/resources/content/2025-gartner-magic-quadrant-strategic-cloud-platform-services"&gt;2025 Magic Quadrant for Strategic Cloud Platform Services&lt;/a&gt; on our website.&lt;/p&gt;&lt;/div&gt;</description><pubDate>Fri, 08 Aug 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/compute/google-is-a-leader-in-gartner-magic-quadrant-for-scps/</guid><category>Infrastructure Modernization</category><category>Compute</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Google is a Leader in the Gartner® Magic Quadrant for Strategic Cloud Platform Services</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/compute/google-is-a-leader-in-gartner-magic-quadrant-for-scps/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Brad Calder</name><title>VP &amp; GM, Google Cloud Platform</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Amin Vahdat</name><title>SVP and Chief Technologist, AI and Infrastructure</title><department></department><company></company></author></item><item><title>Selecting the right Hyperdisk block storage for your workloads</title><link>https://cloud.google.com/blog/products/storage-data-transfer/how-to-choose-the-right-hyperdisk-block-storage-for-your-use-case/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As you adopt Google Cloud or migrate to the latest Compute Engine VMs or to Google Kubernetes Engine (GKE), selecting the right block storage for your workload is crucial. &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/disks/hyperdisks"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Hyperdisk,&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; Google Cloud's workload-optimized block storage that’s designed for our latest &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/machine-resource"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;VM families&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (C4, N4, M4, and more), delivers high-performance storage volumes that are cost-efficient, easily managed at scale, and enterprise-ready. In this post, we guide you through the basics and help you choose the optimal Hyperdisk for your environment.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Introduction to Hyperdisk block storage&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With Hyperdisk, you can independently tune capacity and performance to match your block storage resources to your workload. &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/disks/hyperdisks#when-to-use"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Hyperdisk is available in a few flavors:&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Hyperdisk Balanced:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Designed to fit most workloads and offers the best combination and balance of price and performance. This is also the boot disk for your compute instances. With Hyperdisk Balanced, you can independently configure the capacity, throughput, and IOPS of each volume. Hyperdisk Balanced is available in High Availability and Multi-writer mode.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Hyperdisk Extreme:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Delivers the highest IOPS of all Hyperdisk offerings and is suited for high-end, performance-critical databases. With Hyperdisk Extreme, you can drive up to 350K IOPS from a single volume. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Hyperdisk Throughput:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Delivers capacity at the cost of cold object storage with the semantics of a disk. Hyperdisk Throughput offers high throughput for bandwidth and capacity-intensive workloads that do not require low latency. It also can be used to deliver cost-effective disks for cost-sensitive workloads (e.g., cold disks).&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Hyperdisk ML:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Purpose-built for loading static data into your compute clusters. With Hyperdisk ML, you hydrate the disk with a fixed data set (such as model weights or binaries), then connect  &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;up to 2,500 compute instances to the same volume, so a single volume can serve over 150x more compute instances than competitive block storage volumes&lt;/span&gt;&lt;sup&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="vertical-align: super;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;span style="vertical-align: baseline;"&gt; in read-only mode&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;. You get exceptionally high aggregate throughput across all of those nodes, enabling you to accelerate inference startup, train models faster, and ensure your valuable compute resources are highly utilized. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can also leverage Hyperdisk &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/disks/storage-pools"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Storage Pools&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, which lowers TCO and simplifies operations by pre-provisioning an aggregate amount of capacity and performance, which is then dynamically consumed by volumes in that pool. You create a storage pool with the aggregate capacity and performance that your workloads will need, and then create disks in the storage pool. You can then attach the disks to your VMs. When you create the disks, you can create them with a much larger size or provisioned performance limit than is needed. This simplifies planning and provides room for growth later, without needing to change the disk's provisioned size or performance. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;You can also use a set of comprehensive data protection capabilities such as high availability, cross-region replication and recovery, backup, and snapshots to protect your business critical workloads.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;For specifics around capabilities, capacity, machine support, and performance, please &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/disks/hyperdisks#when-to-use"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;visit the documentation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Try Google Cloud for free&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2e294cdd00&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;Get started for free&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;https://console.cloud.google.com/freetrial?redirectPath=/welcome&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Recommendations for the most common workloads&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To make choosing the right Hyperdisk architecture simpler, here are high-level recommendations for some of the most common workloads we see. For an enterprise, the Hyperdisk portfolio lets you optimize an entire three-tier application matching the needs of each component of your application to the different flavors of Hyperdisk.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Enterprise applications including general-purpose databases&lt;/strong&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/compute/docs/disks/hd-types/hyperdisk-balanced"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Hyperdisk Balanced&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; combined with Storage Pools offers an excellent solution for a wide variety of general-purpose workloads, including common database workloads. Hyperdisk Balanced can meet the IOPS and throughput needs for most databases including Clickhouse, MySQL, and PostgreSQL, at general-purpose pricing. Hyperdisk Balanced offers 160K IOPS per volume — better than AWS EBS gp3 volumes&lt;/span&gt;&lt;sup&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="vertical-align: super;"&gt;2&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;span style="vertical-align: baseline;"&gt;. With Storage Pools you can enhance efficiency and radically simplify planning. Storage Pools allows customers to save approximately 20-40% on storage costs for typical database workloads when compared to Hyperdisk Balanced Volumes or AWS EBS gp3 volumes&lt;/span&gt;&lt;sup&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="vertical-align: super;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“At Sentry.io, a platform used by over 4 million developers and 130,000 teams worldwide to quickly debug and resolve issues, adopting Google Cloud's Hyperdisk has enabled us to create a flexible architecture for the next-generation of our Event Analytics Platform, a product at the core of our business. Hyperdisk Storage Pools with advanced capacity and performance enabled us to reduce our planning cycles from weeks to minutes, while saving 37% in storage costs, compared to persistent disks.” &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;- Dave Rosenthal, CTO, Sentry&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-video"&gt;



&lt;div class="article-module article-video "&gt;
  &lt;figure&gt;
    &lt;a class="h-c-video h-c-video--marquee"
      href="https://youtube.com/watch?v=jjU8F8FjzCM"
      data-glue-modal-trigger="uni-modal-jjU8F8FjzCM-"
      data-glue-modal-disabled-on-mobile="true"&gt;

      
        &lt;img src="//img.youtube.com/vi/jjU8F8FjzCM/maxresdefault.jpg"
             alt="Sentry: Creating the next generation data platform with Hyperdisk and Storage Pools"/&gt;
      
      &lt;svg role="img" class="h-c-video__play h-c-icon h-c-icon--color-white"&gt;
        &lt;use xlink:href="#mi-youtube-icon"&gt;&lt;/use&gt;
      &lt;/svg&gt;
    &lt;/a&gt;

    
  &lt;/figure&gt;
&lt;/div&gt;

&lt;div class="h-c-modal--video"
     data-glue-modal="uni-modal-jjU8F8FjzCM-"
     data-glue-modal-close-label="Close Dialog"&gt;
   &lt;a class="glue-yt-video"
      data-glue-yt-video-autoplay="true"
      data-glue-yt-video-height="99%"
      data-glue-yt-video-vid="jjU8F8FjzCM"
      data-glue-yt-video-width="100%"
      href="https://youtube.com/watch?v=jjU8F8FjzCM"
      ng-cloak&gt;
   &lt;/a&gt;
&lt;/div&gt;

&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p style="padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“High Availability is essential for Blackline — we run database failover clustering, at massive scale, for our global and mission-critical deployment of Financial Close Management. We are excited to bring this workload to Google Cloud leveraging Hyperdisk Balanced High Availability to meet the performance, capacity, cost efficiency, and resilience requirements that our customers demand, and helps us address our customer’s financial regulatory needs globally.” &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;- Justin Brodley, SVP of Cloud Engineering and Operations, Blackline&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Tier-0 databases&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;For high-end, performance-critical databases like SAP HANA, SQL Server, and Oracle Database, &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/disks/hd-types/hyperdisk-extreme"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Hyperdisk Extreme&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; delivers uncompromising performance. With Hyperdisk Extreme, you can obtain up to 350K IOPS and 10 GiB/s of throughput from a single volume.     &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;AI, analytics, and scale-out workloads&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Hyperdisk offers excellent solutions for the most demanding next-generation machine learning and high performance computing workloads. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Dynamically scaling AI and analytics workloads and high-performance file systems&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Workloads with fluctuating demand, and high peak throughput and IOPS, benefit from Hyperdisk Balanced and Storage Pools. These workloads can include customer-managed parallel file systems and scratch disks for accelerator clusters. Storage Pools’ dynamic resource allocation helps ensure that these workloads get the performance they need during peak times without requiring constant manual adjustments or inefficient over-provisioning. Further, once your Storage Pool is set up, planning at the per-disk level is significantly simpler. Note: If you want a fully managed file system, &lt;/span&gt;&lt;a href="https://cloud.google.com/managed-lustre/docs/overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Managed Lustre&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; is an excellent option for you to consider.  &lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“Combining our use of cutting-edge machine learning in quantitative trading at Hudson River Trading (HRT) with Google Cloud's accelerator-optimized machines, Dynamic Workload Scheduler (DWS) and Hyperdisk has been transformative in enabling us to develop [state-of-the-art] models. Hyperdisk storage pools have delivered substantial cost savings, lowering our storage expenses by approximately 50% compared to standard Hyperdisk while minimizing the amount of planning needed.&lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;” &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;- Ragnar Kjørstad, Systems Engineer, Hudson River Trading&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-video"&gt;



&lt;div class="article-module article-video "&gt;
  &lt;figure&gt;
    &lt;a class="h-c-video h-c-video--marquee"
      href="https://youtube.com/watch?v=U1NbkumODpg"
      data-glue-modal-trigger="uni-modal-U1NbkumODpg-"
      data-glue-modal-disabled-on-mobile="true"&gt;

      
        &lt;img src="//img.youtube.com/vi/U1NbkumODpg/maxresdefault.jpg"
             alt="Hudson River Trading: Powering cutting-edge quantitative research models with Google Cloud"/&gt;
      
      &lt;svg role="img" class="h-c-video__play h-c-icon h-c-icon--color-white"&gt;
        &lt;use xlink:href="#mi-youtube-icon"&gt;&lt;/use&gt;
      &lt;/svg&gt;
    &lt;/a&gt;

    
  &lt;/figure&gt;
&lt;/div&gt;

&lt;div class="h-c-modal--video"
     data-glue-modal="uni-modal-U1NbkumODpg-"
     data-glue-modal-close-label="Close Dialog"&gt;
   &lt;a class="glue-yt-video"
      data-glue-yt-video-autoplay="true"
      data-glue-yt-video-height="99%"
      data-glue-yt-video-vid="U1NbkumODpg"
      data-glue-yt-video-width="100%"
      href="https://youtube.com/watch?v=U1NbkumODpg"
      ng-cloak&gt;
   &lt;/a&gt;
&lt;/div&gt;

&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;AI/ML and HPC data-load acceleration&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Hyperdisk ML is specifically optimized for accelerating data load times for inference, training and HPC workloads —  Hyperdisk ML accelerates model load time by 3-5x compared to common alternatives&lt;/span&gt;&lt;sup&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="vertical-align: super;"&gt;4&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;span style="vertical-align: baseline;"&gt;. Hyperdisk ML is particularly well-suited for serving tasks compared to other storage services on Google Cloud because it can concurrently provide to many VMs exceptionally high aggregate throughput (up to 1.2 TiB/s of aggregate throughput per volume, offering greater than 100x higher performance than competitive offerings)&lt;/span&gt;&lt;sup&gt;&lt;span style="vertical-align: baseline;"&gt;&lt;span style="vertical-align: super;"&gt;5&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;span style="vertical-align: baseline;"&gt;. You write once (up to 64 TiB per disk) and attach multiple VM instances to the same volume in a read-only mode. With Hyperdisk ML you can accelerate data load times for your most expensive compute resources, like GPUs and TPUs. For more, check out &lt;/span&gt;&lt;a href="http://g.co/cloud/storage-design-ai" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;g.co/cloud/storage-design-ai&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“At Resemble AI, we leverage our proprietary deep-learning models to generate high-quality AI audio through text-to-speech and speech-to-speech synthesis. By combining Google Cloud’s A3 VMs with NVIDIA H100 GPUs and Hyperdisk ML, we’ve achieved significant improvements in our training workflows. Hyperdisk ML has drastically improved our data loader performance, enabling 2x faster epoch cycles compared to similar solutions. This acceleration has empowered our engineering team to experiment more freely, train at scale, and accelerate the path from prototype to production." &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;-&lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt; &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;Zohaib Ahmed, CEO, Resemble AI&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;High-capacity analytics workloads: &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;For large-scale data analytics workloads like Hadoop and Kafka, which are less sensitive to disk latency fluctuations, Hyperdisk Throughput provides a cost-effective solution with high throughput. Its low cost per GiB and configurable throughput are ideal for processing large volumes of data with low TCO.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;How to size and set up your Hyperdisk&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To select and size the right Hyperdisk volume types for your workload, answer a few key questions:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Storage management.&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Decide if you want to manage the block storage for your workloads in a pool or individually. If your workload will have more than 10 TiB of capacity in a single project and zone, you should consider using Hyperdisk Storage Pools to lower your TCO and simplify planning. &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;Note that Storage Pools do not affect disk performance; some data protection features such as Replication and High Availability are not supported in Storage Pools. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Latency.&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; If your workload requires SSD-like latency (i.e., sub-millisecond), it likely should be served by Hyperdisk Balanced or Hyperdisk Extreme. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;IOPS or throughput.&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; If your application requires less than 160K IOPS or 2.4 GiB/s of throughput from a single volume, Hyperdisk Balanced is a great fit. If it needs more than that, consider Hyperdisk Extreme. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Sizing performance and capacity.&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Hyperdisk offers independently configurable capacity and performance, allowing you to pay for just the resources you need. You can leverage this capability to lower your TCO by understanding how much capacity your workload needs (i.e., how much data, in GiB or TiB, is stored on the disks which serve this workload) and the peak IOPS and throughput of the disks. If the workload is already running on Google Cloud, you can see many of these metrics in your console under “&lt;/span&gt;&lt;a href="https://cloud.google.com/monitoring/charts/metrics-explorer"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Metrics Explorer&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.” &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Another important consideration is the level of business continuity and data protection required for your workloads. Different workloads have different Recovery Point Objective (RPO) and Recovery Time Objective (RTO) requirements, each with different costs. Think about your workload tiers when making data-protection decisions. The more critical an application or workload, the lower the tolerance for data loss and downtime. Applications critical to business operations likely require zero RPO and RTO in the order of seconds. Hyperdisk business continuity and data protection helps customers meet the performance, capacity, cost efficiency, and resilience requirements they demand, and helps them address their financial regulatory needs globally. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Here are a few questions to consider when selecting which variety of Hyperdisk to use for a workload:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;How do I protect my workloads from attack and malicious insiders? &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Use&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;a href="https://cloud.google.com/backup-disaster-recovery/docs/concepts/backup-vault"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud Backup vault&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;for&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;cyber resilience, backup immutability, and indelibility for managed backup reporting and compliance. If you want to self-manage your own backups, Hyperdisk standard snapshots are an option for your workloads.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;How do I protect data from user errors and bad upgrades cost efficiently with low RPO / RTO?&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; You can use our &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;point-in-time recovery with &lt;/strong&gt;&lt;a href="https://cloud.google.com/compute/docs/disks/instant-snapshots"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Instant Snapshots&lt;/strong&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. This feature minimizes the risk of data loss from user error and bad upgrades with ultra-low RPO and RTO — creating a checkpoint is nearly instantaneous.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;How do I easily deploy my critical workload (e.g., MySQL) with resilience across multiple locations?&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; You can utilize &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Hyperdisk HA. &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;This is a great fit for scenarios that require high availability and fast failover, such as SQL Server that leverages failover clustering. For such workloads, you can also choose our new capability with &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/disks/hd-types/hyperdisk-balanced-ha"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Hyperdisk Balanced High Availability&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; with &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/disks/sharing-disks-between-vms"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Multi-Writer support&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. This allows you to run clustered compute with workload-optimized storage in two zones with RPO=0 synchronous replication. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;When a disaster occurs, how do I recover my workload elsewhere quickly and reliably, and run drills to confirm my recovery process?&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Utilize our &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;disaster recovery capabilities with &lt;/strong&gt;&lt;a href="https://cloud.google.com/compute/docs/disks/hyperdisks#hd-sync-rep"&gt;&lt;strong style="text-decoration: underline; vertical-align: baseline;"&gt;Hyperdisk Async Replication&lt;/strong&gt;&lt;/a&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;which&lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;enables cross-region continuous replication and recovery from a regional failure, with fast validation support for disaster recovery drills via cloning. Further, consistency group policies help ensure that workload data that’s distributed across multiple disks is recoverable when a workload needs to fail over between regions.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In short, Hyperdisk provides a wealth of options to help you optimize your block storage to the needs of your workloads. Further, selecting the right Hyperdisk and leveraging features such as Storage Pools can help you lower your TCO and simplify management. To learn more, please visit our &lt;/span&gt;&lt;a href="https://cloud.google.com/products/block-storage?e=48754805&amp;amp;hl=en"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;website&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. For tailored recommendations, always consult your Google Cloud account team.&lt;/span&gt;&lt;/p&gt;
&lt;hr/&gt;
&lt;p role="presentation"&gt;&lt;sub&gt;&lt;em&gt;&lt;span style="vertical-align: baseline;"&gt;1. As of March 2025 based on published information for &lt;/span&gt;&lt;a href="https://docs.aws.amazon.com/ebs/latest/userguide/ebs-volumes-multi.html#considerations" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Amazon EBS&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://learn.microsoft.com/en-us/azure/virtual-machines/disks-shared#ultra-disk-ranges" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Azure managed disks&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;br/&gt;2. &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;As of May 2025, compared to &lt;/span&gt;&lt;a href="https://aws.amazon.com/ebs/general-purpose/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Amazon EBS gp3&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; volumes max iops/volume&lt;br/&gt;3. &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;As of March 2025, at list price, 50 to 150 TiB, peak IOPS of 25K to 75K and 25% compressibility, compared to &lt;/span&gt;&lt;a href="https://aws.amazon.com/ebs/general-purpose/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Amazon EBS gp3&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; volumes.&lt;br/&gt;&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;4. As of March 2025, based on internal Google benchmarking, compared to Rapid Storage, GCSFuse with Anywhere Cache, Parallelstore and Lustre for larger node sizes. &lt;br/&gt;5. As of March 2025 based on published performance for &lt;a href="https://learn.microsoft.com/en-us/azure/virtual-machines/disks-types" rel="noopener" target="_blank"&gt;Microsoft Azure Ultra SSD&lt;/a&gt; and &lt;a href="https://aws.amazon.com/ebs/features/" rel="noopener" target="_blank"&gt;Amazon EBS io2 BlockExpress&lt;/a&gt;&lt;/span&gt;&lt;/em&gt;&lt;/sub&gt;&lt;/p&gt;
&lt;p role="presentation"&gt;&lt;em&gt;&lt;sup&gt;The authors would like to thank David Seidman and Ruwen Hess for their contributions on this blog.&lt;/sup&gt;&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Wed, 11 Jun 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/storage-data-transfer/how-to-choose-the-right-hyperdisk-block-storage-for-your-use-case/</guid><category>Compute</category><category>Infrastructure Modernization</category><category>Storage &amp; Data Transfer</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Selecting the right Hyperdisk block storage for your workloads</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/storage-data-transfer/how-to-choose-the-right-hyperdisk-block-storage-for-your-use-case/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Ben Gitenstein</name><title>Group Product Manager, Google</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Sai Gopalan</name><title>Product Management, Google Cloud</title><department></department><company></company></author></item><item><title>Advancing sovereignty, choice, and security in the cloud for our customers</title><link>https://cloud.google.com/blog/products/identity-security/google-advances-sovereignty-choice-and-security-in-the-cloud/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Like most organizations, Google Cloud is continually engaging with customers, partners, and policymakers to deliver technology capabilities that reflect their needs. When it comes to digital sovereignty solutions, Google Cloud has worked with customers for nearly a decade. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Today, we’re pleased to announce significant technical and commercial updates on our &lt;/span&gt;&lt;a href="https://cloud.google.com/sovereign-cloud"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;sovereign cloud&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; solutions for customers, and details on how we’re helping them achieve greater control, choice, and security in the cloud — without compromising functionality. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Building on the &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/identity-security/how-google-cloud-is-addressing-data-sovereignty-in-europe-2020?e=48754805"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;first sovereign solutions&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; we introduced years ago, we’ve massively scaled our infrastructure footprint globally, now consisting of more than 42 cloud regions, 127 zones, 202 network edge locations, and 33 subsea cable investments.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We have also forged key partnerships in Asia, Europe, the Middle East, and the United States to help deliver these sovereign solutions, including &lt;/span&gt;&lt;a href="https://gruppe.schwarz/en/press/archive/2024/companies-of-schwarz-group-and-google-to-sign-partnership" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Schwarz Group&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://cloud.google.com/t-systems-sovereign-cloud"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;T-Systems&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (Germany), &lt;/span&gt;&lt;a href="https://www.s3ns.io/en/news/partnership-announcement" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;S3NS&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (France), &lt;/span&gt;&lt;a href="https://www.minsait.com/en/news/media-room/google-cloud-partners-minsait-boost-digital-sovereignity-spain" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Minsait&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (Spain), &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/infrastructure/new-google-cloud-region-in-milan-italy-now-open"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Telecom Italia&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (Italy), &lt;/span&gt;&lt;a href="https://clarence-cloud.com/en/home/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Clarence&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (Belgium and Luxembourg), &lt;/span&gt;&lt;a href="https://cntxt.com/products/google-cloud-solutions/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;CNTXT&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (Saudi Arabia), &lt;/span&gt;&lt;a href="https://cloud.google.com/find-a-partner/partner/kddi-corporation"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;KDDI&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (Japan), and &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/public-sector/google-public-sector-and-wwt-team-up-to-enhance-cloud-sovereignty/"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;World Wide Technology&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (United States). &lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;A commitment to customer choice&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Digital sovereignty is about more than just controlling encryption keys. At its core, it’s about giving customers the &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/identity-security/cloud-ciso-perspectives-digital-sovereignty-builds-better-borders-future"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;flexibility their global businesses require&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. It’s about enabling them to operate on multiple clouds. And it’s about securing data with the most advanced technologies.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We’ve long been committed to enabling customers to choose the cloud provider and solution that best fit their needs, and not locking them into a single option. Sovereignty in the cloud is not one-size-fits-all. We offer customers a portfolio of solutions that align with their business needs, regulatory requirements, and risk profiles. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Our strong contractual commitments to our customers are backed by robust sovereign controls and solutions that are all available &lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;today&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;. Our updated sovereign cloud solution portfolio includes:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Google Cloud Data Boundary&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; gives customers the ability to &lt;/span&gt;&lt;a href="https://cloud.google.com/security/products/assured-workloads"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;deploy a sovereign data boundary&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and control where their content is stored and processed. This boundary also allows customers to store and manage their encryption keys outside Google’s infrastructure, which can help customers meet their specific data access and control requirements no matter what market.&lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;Google Cloud Data Boundary customers have access to a large set of Google Cloud products, including AI services, and can enable capabilities, including &lt;/span&gt;&lt;a href="https://cloud.google.com/security/products/confidential-computing" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential Computing&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://cloud.google.com/security/products/security-key-management" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;External Key Management&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; with &lt;/span&gt;&lt;a href="https://cloud.google.com/assured-workloads/key-access-justifications/docs/overview" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Key Access Justifications&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to control access to their data and deny access for any reason. &lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;In addition, &lt;/span&gt;&lt;a href="https://workspace.google.com/security/digital-sovereignty/" rel="noopener" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Workspace&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; customers can take advantage of Google Cloud Data Boundary’s sovereign controls to limit the processing of their content to the United States or EU, choose a country to locally store data, and use client-side encryption to prevent unauthorized access (even by Google) to their most critical content.&lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;Today, we are also announcing &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;User Data Shield&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;, a solution that adds Mandiant services to validate the security of customer applications built on top of Google Cloud Data Boundary. User Data Shield provides recurring security testing of customer applications to validate sovereignty postures. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Google Cloud Dedicated &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;delivers a solution designed to meet local sovereignty requirements, enabled by independent local and regional partners. As an example, Google Cloud has partnered with &lt;/span&gt;&lt;a href="https://www.s3ns.io/en/news/partnership-announcement" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Thales&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; since 2021 to build a first-of-its-kind &lt;/span&gt;&lt;a href="https://www.s3ns.io/en/offres/trusted-cloud-by-S3NS" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Trusted Cloud by S3NS&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; for Europe. &lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;This offering with Thales is designed to offer a rich set of Google Cloud services with GPUs to support AI workloads and is operated by S3NS, a standalone French entity. Currently in preview, S3NS’ solution is designed to meet the rigorous security and operational resilience requirements of France’s &lt;/span&gt;&lt;a href="https://cyber.gouv.fr/secnumcloud-pour-les-fournisseurs-de-services-cloud" rel="noopener" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;SecNumCloud&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; standards. We are expanding our Google Cloud Dedicated footprint globally, launching next in Germany.&lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;span style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;“For France to truly embrace digital sovereignty, it is essential to have a cloud solution that marries the immense power of hyperscale technology with the strictest local security and operational controls. S3NS is committed to providing French organizations with access to advanced cloud services, including critical AI capabilities, all operated within France by a European operator to meet and exceed the rigorous SecNumCloud standards,” said Christophe Salomon, EVP, Information Systems and Secured Communication, at Thales.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Google Cloud Air-Gapped &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;offers a fully standalone and &lt;/span&gt;&lt;a href="https://cloud.google.com/distributed-cloud-air-gapped"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;air-gapped solution&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; that does not require connectivity to an external network. This solution is tailored for customers in the intelligence, defense, and other sectors with strict data security and residency requirements. The air-gapped solution can be deployed and operated by Google, the customer, or a Google partner.&lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;It is built with open-source components and comes with a targeted set of &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/infrastructure-modernization/unlock-ai-anywhere-with-google-distributed-cloud" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AI&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, database, and infrastructure services. Because air-gapped solutions run on open-source components, they are designed to provide business continuity and survivability in the event of service disruptions. Google Cloud Air-Gapped &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/topics/public-sector/google-public-sector-achieves-top-secret-and-secret-cloud-authorization" style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;received authorization&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; in 2024 to host U.S. government Top Secret and Secret-level data.&lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;span style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;"&gt;“Working with Google Cloud to introduce sovereign offerings can give our joint clients greater control, choice, and security in the cloud, without compromising the functionality of their underlying cloud architectures,” said Scott Alfieri, Senior Managing Director and Google Business Group Lead at Accenture. “Google Cloud's extensive global infrastructure, coupled with Accenture’s transformation and industry expertise, helps organizations build an agile and scalable foundation, unlocking opportunities for growth and continuous innovation.”&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Local control, global security&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Security and sovereignty are two sides of the same coin. Local control of data and operations can provide customers a greater level of confidence in their security, but it’s also true that no organization can be considered sovereign if dependencies on legacy infrastructure leave its data vulnerable to loss or theft. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Analysis from the Google Threat Intelligence Group and Google Cloud’s Office of the CISO suggests that the global cyber threat landscape &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/identity-security/cloud-ciso-perspectives-our-2025-cybersecurity-forecast-report/"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;will only become more complex&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; as malicious actors tap into AI-powered tools and techniques to prey on older software products, platforms, and outdated infrastructures.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;With Google Cloud, customers not only get sovereign solutions, but also gain access to our leading security capabilities. This includes our rigorous focus on &lt;/span&gt;&lt;a href="https://blog.google/technology/safety-security/google-secure-by-design-pledge/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;secure by design technology&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and deep expertise from &lt;/span&gt;&lt;a href="https://cloud.google.com/security/products/threat-intelligence"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Threat Intelligence Group&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; and &lt;/span&gt;&lt;a href="https://cloud.google.com/security/mandiant"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Mandiant Consulting&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, who operate on the frontlines of cyber conflicts worldwide and maintain trusted partnerships with more than 80 governments around the world. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In addition, &lt;/span&gt;&lt;a href="https://cloud.google.com/security/solutions/secops-cybershield"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud CyberShield&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; provides AI and intelligence-driven cyber defense to help governments defend against threats at national scale. And &lt;/span&gt;&lt;a href="https://cloud.google.com/security/products/managed-defense"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Mandiant Managed Defense&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; services make it easy for customers worldwide to extend their security teams with our security team. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Google Sovereign Cloud solutions ultimately enable customers to leverage the secure foundation of Google Cloud, while gaining access to advanced security features — such as &lt;/span&gt;&lt;a href="https://cloud.google.com/security/products/confidential-computing"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Confidential Computing&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://cloud.google.com/learn/what-is-zero-trust"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Zero Trust&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/identity-security/cloud-ciso-perspectives-prepare-early-for-PQC-resilient-cryptographic-threats/"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;post-quantum cryptography&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, and &lt;/span&gt;&lt;a href="https://cloud.google.com/security/ai"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;AI-powered platform defenses&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; — faster and more cost-effectively than they could achieve on their own.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Sovereign solutions for any organization&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We remain dedicated to fostering an environment of trust and control for our customers, empowering organizations globally to navigate the complex landscape of digital sovereignty with confidence. We continue to work with customers, partners, and policymakers around the world to refine our sovereign cloud offerings and deliver technologies that address their needs. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;To learn more about how we are enabling our customers’ digital sovereignty capabilities, visit our &lt;/span&gt;&lt;a href="https://cloud.google.com/sovereign-cloud?e=48754805&amp;amp;hl=en"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;web page&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; or contact your account manager.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Wed, 21 May 2025 08:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/identity-security/google-advances-sovereignty-choice-and-security-in-the-cloud/</guid><category>Infrastructure Modernization</category><category>Hybrid &amp; Multicloud</category><category>Security &amp; Identity</category><media:content height="540" url="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/sovereign-cloud-blog.jpg" width="540"></media:content><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Advancing sovereignty, choice, and security in the cloud for our customers</title><description></description><image>https://storage.googleapis.com/gweb-cloudblog-publish/original_images/sovereign-cloud-blog.jpg</image><site_name>Google</site_name><url>https://cloud.google.com/blog/products/identity-security/google-advances-sovereignty-choice-and-security-in-the-cloud/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Hayete Gallot</name><title>President, Customer Experience, Google Cloud</title><department></department><company></company></author></item><item><title>SandboxAQ: Accelerating drug discovery through cloud integration</title><link>https://cloud.google.com/blog/products/infrastructure-modernization/sandboxaq-speeds-up-drug-discovery-with-the-cloud/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The traditional drug discovery process involves massive capital investments, prolonged timelines, and is plagued with daunting failure rates. From initial research to obtaining regulatory approval, bringing a new drug to market can take decades. During this time, many drug candidates that had seemed very promising fail to deliver, either due to inefficacy or safety concerns. Only a small fraction of candidates successfully make it through clinical trials and regulatory hurdles. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Enter &lt;/span&gt;&lt;a href="https://www.sandboxaq.com/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;SandboxAQ&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, which is helping researchers explore vast chemical spaces, gain deep insights into molecular interactions, and predict biological outcomes with precision. It does so with cutting-edge computational approaches such as active learning, &lt;/span&gt;&lt;a href="https://pubs.acs.org/doi/10.1021/acs.jctc.4c00399" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;absolute free energy perturbation solution (AQFEP)&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://arxiv.org/abs/2405.11785" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;generative AI&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, structural analysis, and predictive data analytics, ultimately reducing drug discovery and development timelines. And it does all this on a cloud-native foundation. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Drug design involves an iterative cycle of designing, synthesizing, and testing molecules referred to as the Design-Make-Test cycle. Many customers approach SandboxAQ during the design phase, often when their computational methods are falling short. By improving and accelerating this part of the cycle, SandboxAQ helps medicinal chemists bring innovative and effective molecules to market. For example, in a project related to neurodegenerative disease, SandboxAQ’s approach expanded chemical space from 250,000 to 5.6 million molecules, achieving a 30-fold increase in hit rate and dramatically accelerating the discovery of candidate molecules. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_OZJ38Qu.max-1000x1000.png"
        
          alt="1"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Cloud-native development for scientific insight&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;SandboxAQ’s software relies on large-scale computation and to maximize flexibility and scale, they use a cloud strategy,  which includes Google Cloud infrastructure and tools. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;The technologies in large-scale virtual screening campaigns need to be agile and scale cost-effectively. Specifically, SandboxAQ engineers need to be able to quickly iterate on scientific code, immediately run that code at scale cost-effectively, and store and organize all of the data it produces. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;SandboxAQ achieved a significant boost in efficiency and scalability with Google Cloud infrastructure. They scaled their computational throughput by 100X to leverage tens of thousands of virtual machines (VMs) in parallel. They also improved utilization by reducing idle time by 90%. By consolidating development and deployment on Google Cloud, SandboxAQ streamlined its workflows, from code development and testing to large-scale batch processing and machine-learning model training. &lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;Try Google Cloud for free&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2e271a8b20&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;Get started for free&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;https://console.cloud.google.com/freetrial?redirectPath=/welcome&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;All of SandboxAQ’s development and deployment takes place in the cloud. Code and data live in cloud-based services, and development is done on a cloud-based platform that provides scientists and engineers with self-service VMs with standardized and centrally maintained environments and tools. This is important, because scientific code often requires heavy-duty computing hardware. Scientists have access to hefty 96-core machines, or instances with large GPUs. They can also create new machines with alternate configurations or CPU types as depicted below, enabling low-friction testing and development processes across heterogeneous resources.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-image_full_width"&gt;






  
    &lt;div class="article-module h-c-page"&gt;
      &lt;div class="h-c-grid"&gt;
  

    &lt;figure class="article-image--large
      
      
        h-c-grid__col
        h-c-grid__col--6 h-c-grid__col--offset-3
        
        
      "
      &gt;

      
      
        
        &lt;img
            src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_mgPMly4.max-1000x1000.png"
        
          alt="2"&gt;
        
        &lt;/a&gt;
      
    &lt;/figure&gt;

  
      &lt;/div&gt;
    &lt;/div&gt;
  




&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;SandboxAQ scientists and developers manage and access their Bench machines (see above) using the company’s `bench` client. They can connect to machines via SSH or use any number of managed tools, for example a browser-based VNC service for instant remote desktop, or JupyterLab for a familiar notebook development flow.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;As code is ready to be run at a larger scale, researchers can dispatch SandboxAQ parameterized sets of computations as jobs on an internal tool powered by &lt;/span&gt;&lt;a href="https://cloud.google.com/batch?e=48754805&amp;amp;hl=en"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Batch&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, a fully managed service to schedule, queue, and execute batch jobs on Google infrastructure. With development and batch runtime environments closely synced, changes can be quickly run at scale. Code developed on bench machines is pushed to GitHub and immediately available for batch execution. Then, as tools are reviewed and merged into `main` of the company’s monorepo, the new tools become automatically available on SandboxAQ scientists’ bench machines, who can launch parallel jobs processing millions of molecules on any kind of Google Cloud VM resource in any global zone, utilizing either on-demand or &lt;/span&gt;&lt;a href="https://cloud.google.com/compute/docs/instances/spot"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Spot VMs&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;SandboxAQ's implementation of a globally resolved transitive dependency tree, enables simple package and dependency management. With this practice, Google Batch can seamlessly integrate with individual tools developed by engineers to train many instances of a model in parallel.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Machine learning is a core component of SandoxAQ’s strategy, making easy data access especially important. At the same time, SandboxAQ’s Drug Discovery team also works with clients who have sensitive data. To secure customers’ data, bench and batch workloads read and write data from a unified interface that’s managed via IAM, allowing granular control of different data sources within the organization.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Meanwhile, Google Cloud services like Cloud Logging, Cloud Monitoring, Compute Engine and Cloud Run make it simple to develop tools to monitor these workloads, easily surface logs to SandboxAQ scientists, and comb through huge amounts of output data. As new features are tested or bugs show up, changes are made immediately available to the scientific team, without having to wrangle infrastructure. Then, as code becomes stable, they can incorporate it into downstream production applications, all in a centrally secured, unified way on Google Cloud.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In short, having a unified development, batch compute, and production environment on Google Cloud reduces the friction SandboxAQ faces to develop new workloads and run them at scale. With shared environments for scientific workload development and engineering, SandboxAQ makes it quick and easy for customers to move from experimentation to production, delivering the results customers want, fast.&lt;/span&gt;&lt;/p&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;SandboxAQ solution in the real world&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;SandboxAQ is already having a profound impact on drug discovery programs targeting a range of hard-to-treat diseases. For example, there are advanced collaborations with Professor Stanley Pruisner's lab at University of California San Francisco (&lt;/span&gt;&lt;a href="https://www.sandboxaq.com/press/sandboxaq-announces-bio-pharma-molecular-simulation-division-to-speed-life-saving-drugs-to-patients-through-ai-and-quantum-solutions" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;UCSF&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;), &lt;/span&gt;&lt;a href="https://www.sandboxaq.com/press/sandboxaq-announces-bio-pharma-molecular-simulation-division-to-speed-life-saving-drugs-to-patients-through-ai-and-quantum-solutions" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Riboscience&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://www.sandboxaq.com/press/sandboxaq-selected-by-sanofi-for-quantitative-ai-driven-biomarker-identification" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Sanofi&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, and with the &lt;/span&gt;&lt;a href="https://www.sandboxaq.com/press/the-michael-j-fox-foundation-selects-sandboxaq-partner-for-25-million-initiative-to-develop-novel-parkinsons-disease-treatments" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Michael J Fox Foundation&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, to name a few. With this approach built on Google CloudSandboxAQ has achieved &lt;/span&gt;&lt;a href="https://www.sandboxaq.com/post/biopharmas-quantum-leap-2" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;a superior hit rate compared to other methods like high throughput screening&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, demonstrating the transformative potential of SandboxAQ on drug discovery and bringing cures to patients faster. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Visit the &lt;/span&gt;&lt;a href="https://cloud.google.com/solutions/ai-hypercomputer?hl=en"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud AI Hypercomputer web page&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; to learn about Google Cloud AI infrastructure.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Tue, 29 Apr 2025 15:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/infrastructure-modernization/sandboxaq-speeds-up-drug-discovery-with-the-cloud/</guid><category>AI &amp; Machine Learning</category><category>HPC</category><category>Infrastructure Modernization</category><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>SandboxAQ: Accelerating drug discovery through cloud integration</title><description></description><site_name>Google</site_name><url>https://cloud.google.com/blog/products/infrastructure-modernization/sandboxaq-speeds-up-drug-discovery-with-the-cloud/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Ruslan Mursalzade</name><title>Product Marketing Lead, Google Cloud AI Infrastructure</title><department></department><company></company></author></item><item><title>Google Cloud and Oracle accelerate enterprise modernization with new offerings, regions, and capabilities</title><link>https://cloud.google.com/blog/products/databases/google-cloud-and-oracle-accelerate-enterprise-modernization/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Supporting customers where they want to be is a core value at Google Cloud, and a big part of the reason that we have partnered with Oracle — so that you can innovate faster with the best of Google and the best of Oracle. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This week at Google Cloud Next, we announced significant expansions to our Oracle Database offerings, including the preview of &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Oracle Base Database Service&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; for a flexible and controllable way to run Oracle databases in the cloud; &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;general availability of Oracle Exadata X11M,&lt;/strong&gt;&lt;strong style="vertical-align: baseline;"&gt; &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;bringing the latest generation of the Oracle Exadata platform to Google Cloud; and additional &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;enterprise-ready capabilities&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; including &lt;/span&gt;&lt;a href="https://cloud.google.com/blog/products/identity-security/cloud-kms-autokey-can-help-you-encrypt-resources-quickly-and-efficiently"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;customer managed encryption keys&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; (CMEK). &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We are continuing to invest in global infrastructure for Oracle, with a total of 20 locations available in the coming months, adding Oracle Database@Google Cloud presence in Australia, Brazil, Canada, India, Italy, and Japan. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;These announcements follow our developments with Oracle since last July, when we launched &lt;/span&gt;&lt;a href="https://www.googlecloudpresscorner.com/2024-09-09-Oracle-and-Google-Cloud-Announce-the-General-Availability-of-Oracle-Database-Google-Cloud" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Oracle Database@Google Cloud&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. This partnership enables customers to migrate and modernize their Oracle workloads and start taking advantage of Google’s industry-leading data and AI capabilities such as &lt;/span&gt;&lt;a href="https://cloud.google.com/bigquery"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;BigQuery&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, &lt;/span&gt;&lt;a href="https://cloud.google.com/vertex-ai"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Vertex AI platform&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;, and &lt;/span&gt;&lt;a href="https://blog.google/technology/ai/google-gemini-ai/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Gemini foundation models&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Additional features provide customers with even more options in their modernization journey, such as the fully managed &lt;/span&gt;&lt;a href="https://cloud.google.com/oracle/database/docs/overview"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Oracle Autonomous Database Serverless&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;. They can also benefit from increased reliability and resiliency features, such as cross-region disaster recovery and Oracle Maximum Availability Gold certification.&lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 40px;"&gt;&lt;span style="vertical-align: baseline;"&gt;"Banco Actinver is committed to providing innovative financial solutions to our clients. By combining the security and performance of Oracle Database with Google Cloud's data analytics and AI tools, we're gaining deeper insights into market trends, enhancing our services, and delivering personalized experiences to our customers," said Jorge Fernandez, CIO, Banco Actinver.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;$300 in free credit to try Google Cloud databases&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2e26525ca0&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;Start building for free&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;http://console.cloud.google.com/freetrial?redirectPath=/products?#databases&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Innovative new capabilities&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We're expanding our offerings to empower customers with the flexibility to manage a diverse set of database workloads cost effectively.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Oracle Base Database Service&lt;/strong&gt;&lt;strong style="vertical-align: baseline;"&gt;:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; The new Base Database Service delivers a highly controllable and customizable foundational database platform, built on Oracle Cloud Infrastructure (OCI) virtual machines and general-purpose infrastructure. It can empower businesses with the flexibility to manage a diverse range of database workloads directly.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Enhanced Oracle Database Services:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; In addition to the availability of &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;Exadata Cloud Service, Autonomous Database Service, Oracle Linux, and Oracle on Google Compute Engine (GCE) and Google Kubernetes Engine (GKE), we are pleased to share general availability &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;of &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Oracle Exadata X11M&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;. Oracle Database@Google Cloud now offers the latest generation of Oracle Exadata machines, the X11M, with enhanced performance and scalability for demanding database workloads. These new machines provide significant performance gains and increased capacity, enabling customers to run even the most intensive Oracle applications with ease. X11M will be available in all new regions.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Customers are embracing Oracle Database@Google Cloud, and to support their global needs, we're expanding our footprint while maintaining the highest standards of application performance and reliability. &lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Expanding to 20 Oracle Database@Google Cloud Locations &lt;/strong&gt;&lt;strong style="vertical-align: baseline;"&gt;in the coming months:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; To further support the growing demand for Oracle workloads on Google Cloud, we are launching in more locations, including U.S. Central 1 (Iowa), North America-Northeast 1 (Montreal), North America-Northeast 2 (Toronto), Asia-Northeast 1 (Tokyo), Asia-Northeast 2 (Osaka), Asia-South 1 (Mumbai), Asia-South 2 (Delhi), South America-East 1 (Sao Paulo), Europe-West (Italy), Australia-Southeast2 (Melbourne), and Australia-Southeast1 (Sydney) &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;—&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; and additional zones in Ashburn, Frankfurt, London, Melbourne, and Italy. The new regions and expanded capacity are in addition to Google Cloud regions across U.S. East (Ashburn), U.S. West (Salt Lake City), U.K. South (London), and Germany Central (Frankfurt) that are available today.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;New Partner Cross-Cloud Interconnect availability: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Partner Cross-Cloud Interconnect for OCI is pleased to expand our global network offerings with new multicloud connectivity between Google and Oracle Cloud Infrastructure in Toronto and Zurich. This further complements our existing 11 regions already served, ensuring the lowest possible latency between both clouds while keeping traffic private and secure.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Cross Region Disaster Recovery:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Cross Region Disaster Recovery support for Oracle workloads on Oracle Autonomous Database ensures high availability and resilience, protecting against potential outages and providing continuous operation for critical applications.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Enterprise-grade networking upgrades:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Advanced networking upgrades enable enterprises to efficiently deploy their Oracle resources along with Google Cloud and share resources.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Industry-leading certifications and user experience&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Google Cloud is committed to providing a seamless and efficient experience for Oracle customers, ensuring that managing and utilizing Oracle databases is straightforward and effective. We offer a combination of native Google Cloud tools and Oracle Cloud Infrastructure (OCI) interfaces, along with robust support for various applications and systems.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Enhanced user experience: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Google Cloud is committed to providing an easy-to-use experience for Oracle customers, offering a Google Cloud integrated user experience for application developers and routine database operations, alongside an OCI-native experience for advanced database management. This includes support for Shared VPC, APIs, SDKs, and Terraform.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Application support:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Google Cloud is pleased to announce the support for Oracle applications running on Google Cloud, ensuring compatibility and optimal performance, including Oracle E-Business Suite, Peoplesoft Enterprise, JD Edwards Enterprise One, Hyperion Financial Management, and Retail Merchandising.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;SAP and Oracle Capability: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Oracle workloads on Google Compute Engine are now supported by SAP and Oracle, further validating Google Cloud as a trusted platform for running enterprise applications.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Integration with Google Cloud Monitoring&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: Provides enterprises a unified monitoring and alerting mechanism across all their Google Cloud database services, now including Oracle Database.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;New support in Google Cloud Backup and DR:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Our backup service now provides central, policy-based management for backup of Oracle workloads along with other Google Cloud services using secure backup vaults for data protection — isolating and protecting data from threats like ransomware and accidental deletion.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Google Cloud's strengths make it the preferred hyperscaler for running mission-critical Oracle workloads. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Get started right away from your Google Cloud Console or &lt;/span&gt;&lt;a href="https://cloud.google.com/solutions/oracle"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;learn more here&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Thu, 10 Apr 2025 16:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/databases/google-cloud-and-oracle-accelerate-enterprise-modernization/</guid><category>Infrastructure Modernization</category><category>Partners</category><category>Google Cloud Next</category><category>Databases</category><media:content height="540" url="https://storage.googleapis.com/gweb-cloudblog-publish/images/Google_Cloud_x_Oracle.max-600x600.jpg" width="540"></media:content><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Google Cloud and Oracle accelerate enterprise modernization with new offerings, regions, and capabilities</title><description></description><image>https://storage.googleapis.com/gweb-cloudblog-publish/images/Google_Cloud_x_Oracle.max-600x600.jpg</image><site_name>Google</site_name><url>https://cloud.google.com/blog/products/databases/google-cloud-and-oracle-accelerate-enterprise-modernization/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Michelle Burtoft</name><title>Senior Product Manager, Google Cloud</title><department></department><company></company></author><author xmlns:author="http://www.w3.org/2005/Atom"><name>Rajesh Kasanagottu</name><title>Senior Engineering Manager, Google Cloud</title><department></department><company></company></author></item><item><title>Bringing Gemini and Google Agentspace to you on-premises</title><link>https://cloud.google.com/blog/products/ai-machine-learning/run-gemini-and-ai-on-prem-with-google-distributed-cloud/</link><description>&lt;div class="block-paragraph_advanced"&gt;&lt;p&gt;&lt;strong style="vertical-align: baseline;"&gt;Today we are announcing that Gemini will be available on Google Distributed Cloud (GDC), &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;bringing Google’s most capable models to on-premises environments, with public preview starting in Q3 2025. To do so, we’ve partnered with NVIDIA to bring our Gemini models to NVIDIA Blackwell systems that you can purchase through Google or your preferred channels. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="https://cloud.google.com/distributed-cloud"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;GDC&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; is a fully managed on-prem and edge cloud solution that is offered in both connected and air-gapped options, scaling from a single server to hundreds of racks. It offers infrastructure-as-a-service, security, data, and AI services, and is extensible with a rich ISV ecosystem. GDC takes care of infrastructure management, making it easy for your developers to focus on leveraging the best that AI has to offer and build applications, assistants, and agents. &lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 40px;"&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“NVIDIA and Google Distributed Cloud provide a secure AI platform, bringing Gemini models to enterprise datacenters and regulated industries. With NVIDIA Blackwell infrastructure and confidential computing, Google Distributed Cloud enhances privacy and security, and delivers industry-leading performance on DGX B200 and HGX B200 systems, available from Dell.&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;” – Justin Boitano, VP, Enterprise AI Software, NVIDIA.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Historically, organizations that face strict regulatory, sovereignty, latency, or data volume issues have been unable to access the latest AI technology since they must keep their data on-premises. Their only options have been open-source models and tools. And, in most cases, they have to put together the software and hardware themselves, which increases operational burden and complexity. With Gemini on GDC, you don’t have to compromise between the best of AI and the need to keep your data on-premises. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Our GDC air-gapped product, which is now authorized for US Government Secret and Top Secret missions, and on which Gemini is available, provides the highest levels of security and compliance.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;
&lt;div class="block-aside"&gt;&lt;dl&gt;
    &lt;dt&gt;aside_block&lt;/dt&gt;
    &lt;dd&gt;&amp;lt;ListValue: [StructValue([(&amp;#x27;title&amp;#x27;, &amp;#x27;$300 in free credit to try Google Cloud AI and ML&amp;#x27;), (&amp;#x27;body&amp;#x27;, &amp;lt;wagtail.rich_text.RichText object at 0x7f2e257d42e0&amp;gt;), (&amp;#x27;btn_text&amp;#x27;, &amp;#x27;Start building for free&amp;#x27;), (&amp;#x27;href&amp;#x27;, &amp;#x27;http://console.cloud.google.com/freetrial?redirectPath=/vertex-ai/&amp;#x27;), (&amp;#x27;image&amp;#x27;, None)])]&amp;gt;&lt;/dd&gt;
&lt;/dl&gt;&lt;/div&gt;
&lt;div class="block-paragraph_advanced"&gt;&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Gemini on GDC: unlocking generative AI anywhere&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;a href="https://deepmind.google/technologies/gemini/pro/" rel="noopener" target="_blank"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Gemini&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; models deliver breakthrough AI performance: they can analyze million-token contexts; are multimodal, i.e., can process diverse data formats such as text, image, audio and video; and operate globally across 100+ languages. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Further, the Gemini API offers AI inferencing without having to worry about infrastructure, OS management, or model lifecycle management. This enables you to:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Add your own business context:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Use Retrieval Augment Generation (RAG) to personalize and augment the AI model’s output, eliminating the need for fine tuning or retraining the models. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Automate information processing and knowledge extraction: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Improve employee efficiency by using gen AI to quickly summarize long documents, analyze sentiment in reports or feedback, or add captions to image, audio, and video content. &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Create interactive conversational experiences: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Build deeper customer relationships by enabling Gemini-powered customer support agents, chatbots via natural language, and employee assistants.&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Tailor agents for your industry’s use case&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;: Unlock highly specialized capabilities and workflows by developing tailored agents for everyone from financial advisors, to security assistants, to robotics. &lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“Gemini on Google Distributed Cloud will empower ServiceNow to augment powerful agentic AI capabilities such as reasoning in our existing systems via robust APIs. This strategic deployment allows us to explore and implement cutting-edge advancements while upholding our commitment to customer trust and data protection.”&lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt; - Pat Casey, Chief Technology Officer &amp;amp; EVP of DevOps, ServiceNow&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Vertex AI: one platform for cloud and on-prem&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;In addition to bringing Gemini to Google Distributed Cloud, customers today already benefit from the Vertex AI platform on GDC, which lets them accelerate the development, deployment, and management of agentic applications.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;This complete AI platform offers:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Pre-trained APIs:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Ready-to-use, task-optimized, pre-trained APIs based on advanced Google models for translation, speech-to-text, and optical character recognition (OCR). These APIs offer advanced features such as customizable glossaries and in-place document translation&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Gen AI building tools:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Open-source and third-party models with optimized inferencing on GKE, delivering fast startup and auto-scaling &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Retrieval Augmented Generation (RAG): &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Grounding using Google Agentspace search and LLM API management and governance using Apigee on-prem &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Built-in embeddings API and AlloyDB vector database:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Powerful applications for personalization and recommendations, enabling improved user experiences&lt;br/&gt;&lt;br/&gt;&lt;/span&gt;&lt;span style="font-style: italic; vertical-align: baseline;"&gt;“With Google Distributed Cloud, Vertex AI, and Agentspace search, we will empower our Home Team innovators with a secure AI/ML platform and unified search, enabling the use of AI to enhance productivity and transform public safety for a safer and more secure future.” &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;- Chee Wee Ang, Chief AI Officer, HTX&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Google Agentspace: out-of-box access to on-prem data &lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Enterprises are eager to deploy gen AI, but they also struggle to connect large volumes of siloed information across various repositories and formats such as images, PDFs, and text. This hinders productivity and innovation. At the same time, building an in-house search solution is costly and requires access to scarce AI expertise. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We are excited to announce &lt;/span&gt;&lt;strong style="vertical-align: baseline;"&gt;Google Agentspace search will be available on GDC, &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;with public preview starting in Q3 2025. Google Agentspace search provides all enterprise knowledge workers with out-of-the-box capabilities that unify access to all your data in a secure, permissions-aware way.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;Agentspace gives you access to:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Company-branded, multimodal search agent: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;A conversational search interface that can answer complex questions based on your company’s unique information, acting as a central source of enterprise truth for your entire organization  &lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Pre-built enterprise data connectors: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Connectors to index data from the most common on-prem enterprise systems (such as Confluence, Jira, ServiceNow, and Sharepoint)&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Permissions-aware search results:&lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt; Robust access control list (ACL) enforcement that help ensure that search results are permission-aware, maintaining security and compliance for all your on-prem data&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li aria-level="1" style="list-style-type: disc; vertical-align: baseline;"&gt;
&lt;p role="presentation"&gt;&lt;strong style="vertical-align: baseline;"&gt;Agentspace agents: &lt;/strong&gt;&lt;span style="vertical-align: baseline;"&gt;Vertex AI is integrated out-of-the-box with Agentspace, starting with search agents, with more pre-built agents coming soon, and the ability to build your own&lt;/span&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;&lt;strong style="vertical-align: baseline;"&gt;Get started with gen AI on GDC&lt;/strong&gt;&lt;/h3&gt;
&lt;p&gt;&lt;span style="vertical-align: baseline;"&gt;We're constantly innovating on GDC to be the leading gen AI and modern application development that you can deploy anywhere. To bring Gemini and gen AI to your premises, &lt;/span&gt;&lt;span style="vertical-align: baseline;"&gt;please contact &lt;/span&gt;&lt;a href="https://cloud.google.com/contact"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;Google Cloud sales&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt; or reach out to any of our &lt;/span&gt;&lt;a href="https://cloud.google.com/distributed-cloud?e=48754805#partners-and-integration"&gt;&lt;span style="text-decoration: underline; vertical-align: baseline;"&gt;accredited global partners&lt;/span&gt;&lt;/a&gt;&lt;span style="vertical-align: baseline;"&gt;.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><pubDate>Wed, 09 Apr 2025 12:00:00 +0000</pubDate><guid>https://cloud.google.com/blog/products/ai-machine-learning/run-gemini-and-ai-on-prem-with-google-distributed-cloud/</guid><category>Infrastructure Modernization</category><category>Hybrid &amp; Multicloud</category><category>Google Cloud Next</category><category>AI &amp; Machine Learning</category><media:content height="540" url="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/Gemini_on_GDC.jpg" width="540"></media:content><og xmlns:og="http://ogp.me/ns#"><type>article</type><title>Bringing Gemini and Google Agentspace to you on-premises</title><description></description><image>https://storage.googleapis.com/gweb-cloudblog-publish/original_images/Gemini_on_GDC.jpg</image><site_name>Google</site_name><url>https://cloud.google.com/blog/products/ai-machine-learning/run-gemini-and-ai-on-prem-with-google-distributed-cloud/</url></og><author xmlns:author="http://www.w3.org/2005/Atom"><name>Vithal Shirodkar</name><title>VP/GM, Google Distributed Cloud</title><department></department><company></company></author></item></channel></rss>