Tag: gcp

10 entries tagged "gcp" — 7 posts, 3 links.

Posts

Apr 30, 2026 — 8 min — Platform & AI

The Go and gRPC Version of the SaaS Stack

When a SaaS product should graduate from a flexible Python-first backend into Go, gRPC, Cloud Run, and Google Cloud service boundaries.

Outcome: Mapped a Go and gRPC adoption path for SaaS teams that need stronger service contracts, concurrency, latency discipline, and Google Cloud operations without premature rewrites.

gcp go grpc cloud run software architecture

Jan 28, 2026 — 6 min — Platform & AI

Dataform + BigQuery Governance Release Patterns

A Dataform and BigQuery case study for turning data contracts, release lanes, validation gates, rollback behavior, and cost checks into one governed promotion path.

Outcome: Reduced contract-break risk in the sanitized release pattern by making schema, freshness, cost, and downstream impact checks part of promotion instead of after-the-fact review.

dataform bigquery data contracts release engineering gcp

Jan 12, 2026 — 5 min — Platform & AI

Compliant GCP Platform Playbook for Analytics and ML

A sanitized GCP platform case study where compliance, analytics delivery, and ML feature access had to be designed as one release path instead of three disconnected workstreams.

Outcome: Reduced governed dataset onboarding from weeks to days in the sanitized pattern while preserving auditability, cost visibility, and promotion rules for analytics and ML use cases.

gcp bigquery governance analytics ml

Dec 30, 2025 — 12 min — Platform & AI

Vertex AI Feature Store Is the Production Loop

A production-focused Vertex AI post on turning raw data, BigQuery features, online feature serving, model endpoints, monitoring, and retraining into one governed ML loop instead of another platform checklist.

Outcome: Defined a concrete Vertex AI feature-serving loop with source contracts, BigQuery feature views, point-in-time training exports, endpoint serving rules, monitoring thresholds, and retraining triggers.

gcp vertex ai feature store mlops gemini

Dec 26, 2025 — 10 min — Platform & AI

Vertex AI Makes More Sense as an MLOps Map

A Vertex AI architecture map for teams that need to decide which Google Cloud AI services belong in the ML lifecycle, where ownership changes hands, and which older assumptions are now unsafe.

Outcome: Gave teams an operating contract for using Vertex AI across data, features, training, deployment, monitoring, and generative AI without confusing a product menu for a production ML system.

gcp vertex ai mlops feature store model monitoring

Oct 31, 2025 — 14 min — Platform & AI

Cloud Run GPU Sidecars Need Deployment Discipline

A practical deployment guide for running Ollama behind Open WebUI on Cloud Run GPUs without mixing service specs, model storage modes, sidecar startup order, or auth assumptions.

Outcome: Clarified Cloud Run GPU sidecar deployment choices so model storage, service YAML, startup ordering, authentication, and billing constraints are explicit before launch.

gcp cloud run gpu ollama open webui

Oct 23, 2025 — 14 min — Platform & AI

What a Data Strategist Actually Does

A practical view of data strategy as the operating discipline that connects business goals, governance, KPIs, platforms, analytics, ML, and AI delivery.

Outcome: Connected data roadmaps, governance, KPI design, platform delivery, and stakeholder alignment so analytics and AI initiatives produced measurable business decisions.

data strategy data governance analytics gcp decision intelligence

Links

Threaddiscuss.google.devApr 19, 2026Permalink

Vertex AI Agent Engine Networking Overview

Google Developer forums

This forum post is useful because Agent Engine networking is exactly where cloud AI demos turn into platform engineering. Connectivity, controls, private access, and service boundaries are not side quests.

Keeping it here because Google Cloud agent work needs operational references, not only model and orchestration references.

Articledevelopers.googleblog.comMar 24, 2026Permalink

Gemini Embedding: Powering RAG and context engineering

Google Developers Blog

This is a Google-side reference for the embedding layer behind retrieval and context engineering. It is worth keeping because the site is already leaning into Vertex AI, Gemini, and production-grade AI systems, where embeddings are infrastructure rather than a demo detail.

The useful question is not whether embeddings are good in isolation. It is whether the retrieval loop improves task success, preserves source traceability, and handles stale or missing context gracefully.

gemini rag embeddings gcp

Articlemedium.comMar 19, 2026Permalink

Scaling Inference To Billions of Users And Agents

Federico Iezzi, Google Cloud

This is useful because it connects agent adoption to inference architecture. Agents do not make one call; they fan out across planning, retrieval, tool use, retries, and evaluation, which changes the serving math quickly.

Worth keeping as a scale reference for Google Cloud AI work. The product question is where inference cost becomes a feature constraint rather than a backend detail.

model serving gcp agents inference

All tags