Tag Archive

Below you'll find a list of all posts that have been tagged as “Google Cloud”

Breaking the Text Barrier: Gemini API File Search Goes Multimodal

The Gemini API File Search tool now supports natively multimodal RAG with Embedding 2, inline citations, and custom metadata filters.

Valkey 9.0 Is Faster Than Redis. Now What?

Valkey 9.0 is now production-ready on Google Cloud. Snap, Juspay, and Fubo are already running it at scale. The result: 40% throughput gains, better developer tooling, and zero licensing drama. The Redis tax is officially optional.

TurboQuant is Kind of a Big Deal.

Google Research just published a way to cut AI serving costs by 50% with zero accuracy loss. The interesting part is what happens to the ISVs who figure this out first.

Building Agents Is Easy. Infrastructure? Not So Much.

Building AI agents shouldn’t mean building agent infrastructure. Vertex AI Agent Builder handles the runtime, governance, and Google Search grounding so you can focus on what the agent actually does.

The Generative Media Suite ISVs Have Been Waiting For.

Veo 3 generates video with native audio. Imagen 4 renders at 2K. Lyria 3 Pro writes full songs. All on Vertex AI, all under one bill.

Beyond the Box Score: How MLB Built a Digital Scout

Delivering clever AI commentary to millions of baseball fans in under two seconds is a massive engineering challenge. Here’s how MLB used Gemini, pre-generation, and “surprisal” math to pull it off.

The Agentic Future Is a Governance Problem as Much as a Technology Problem

Most enterprise AI projects don’t fail because the technology doesn’t work. They fail because no one built the infrastructure to let it work at scale.

Most Enterprise AI Is Blind to 80% of Your Data

Your AI reads text just fine. It’s the contracts, recordings, and images it can’t touch that are going to cost you.

You Could Build Your Own Vector Search Stack. You Probably Shouldn’t.

Vertex AI Vector Search 2.0 went GA in March 2026. For ISVs building on Google Cloud, it collapses embedding pipelines, indexing, feature stores, and hybrid search into one managed service, so you can ship AI features instead of building infrastructure.

The Voice AI Fast Enough to Pass as Human. (Cartesia)

Cartesia built the world’s fastest voice AI on Google Cloud, hitting sub-90ms latency with its Sonic model. More than 50,000 companies are using it.

Gemini Code Assist Cut Their Dev Time by 37%. (ComplyAdvantage)

ComplyAdvantage cut development time 37% with Gemini Code Assist, enabling a 170-person engineering team to ship a major new offering on time. Here’s how they did it.

Their Data Was Everywhere. Now It Isn’t. (PayPal)

PayPal completed one of the largest data migrations in history, consolidating 400 petabytes of fragmented data into BigQuery. The goal was not just cleaner infrastructure; it was the foundation for everything AI-powered that comes next.

Google Built a TPU for the Age of Inference. Meet Ironwood.

TPU Ironwood is Google’s 7th-generation custom AI chip, and unlike its predecessors, it was built for inference first. Here’s what that means and why it matters.

MCP Is the New REST. Google Cloud Just Made It Enterprise-Ready.

The Model Context Protocol is becoming the standard for how AI agents call external tools. Google Cloud Managed MCP Servers handle the enterprise governance layer so you don’t have to.

Apigee Got a New Job: The Control Plane for Your AI.

Apigee evolved from API gateway to the control plane for LLM traffic, agent actions, and MCP tools. Here is why that matters for anyone building AI features at scale.

A2A Is How AI Agents Finally Learn to Play Nicely

Google’s Agent2Agent protocol, now under Linux Foundation stewardship, gives AI agents a standard way to find, authenticate, and collaborate with each other across any vendor or framework. For ISVs, it changes what a multi-agent product architecture can look like.