Writing

Long-form and short-form notes across data platforms, ML agents, evaluation, system design, and game development.

Jul 12, 2026 — 12 min — Systems Notes

G# Is the Surprise That Made .NET Feel New Again

G# is exciting not because it replaces C#, but because it puts Go, Kotlin, and Swift-flavored ergonomics on the .NET platform and reminds us that a runtime can support more than one great way to think.

Outcome: Reader can evaluate G# as an early .NET language experiment, understand why F# is the ecosystem's proof that distinct language models can thrive on one runtime, and choose a practical way to explore both without mistaking novelty for production readiness.

gsharp dotnet fsharp programming languages language design

Jun 30, 2026 — 7 min — Games & Sim

What I Owe the Games Pillar I Claimed

The archive retrospective made the Games & Sim gap visible: three public proof pieces in a 96-post archive. The fix is not a better tagline; it is an artifact-first devlog cadence.

Outcome: Reader can audit a public pillar claim against archive evidence, then choose a proof cadence or retire the claim.

editorial positioning game development simulation devlog publishing

Jun 29, 2026 — 8 min — Systems Notes

The Rewrite Sprint That Made My Archive Earn Its Labels

A retrospective on the rewrite sprint that started when a 40-post archive audit found essay and case_study labels attached to note-depth drafts.

Outcome: Reader can audit a technical-writing archive by separating tier labels from body evidence, then choose which posts to expand, retag, or hold.

editorial writing publishing personal blog content strategy

Jun 29, 2026 — 10 min — Games & Sim

Customer Experience Simulation Is Agent-Driven User Research

Why plausible synthetic customer behavior can turn an onboarding simulation into false evidence unless every journey claim has a trace and a real-world validation gate.

Outcome: Reader can design an agent-driven CX simulation as a hypothesis generator, with validation gates that keep synthetic behavior from being mistaken for customer evidence.

agents simulation customer experience user research product systems

Jun 20, 2026 — 8 min — Systems Notes

XState for Python Is a Shared Workflow Contract

A project note on JovaniPink/xstate-python: loading XState and Stately JSON in Python, executing with SCXML-style semantics, binding live Python handlers, and using actors without hiding workflow state in async glue.

Outcome: Reader understands what xstate-python is trying to make possible, where it fits against other Python state-machine libraries, and why SCXML run-to-completion semantics, actors, clocks, and context snapshots matter for production workflows.

xstate-python state machines python statecharts scxml

Jun 18, 2026 — 8 min — Platform & AI

Agent Frameworks Are Infrastructure Now

A practical 2026 map of AI agent frameworks by infrastructure primitive: orchestration, tools, state, multi-agent delegation, approvals, observability, evaluation, and deployment.

Outcome: Reframed agent-framework selection away from tier lists and toward an operating contract for which primitives the framework owns, which ones the team must own, and where bespoke orchestration is still justified.

agents agent frameworks mcp orchestration observability

Jun 16, 2026 — 6 min — Platform & AI

Free-Threaded Python Changes the Concurrency Question

A decision map for when Python teams should keep asyncio, use multiprocessing, try subinterpreters, or pilot free-threaded CPython without importing new race conditions.

Outcome: Separated Python concurrency choices by workload, state sharing, dependency readiness, and failure mode so free-threaded builds become a measured pilot instead of a default switch.

python free-threading concurrency performance systems architecture

Jun 16, 2026 — 7 min — Platform & AI

Python Architecture for AI and Data Systems in 2026

A Python architecture map for AI, data, and backend teams that need notebooks, prompts, evaluations, services, repositories, and infrastructure to stop collapsing into one folder.

Outcome: Defined a production Python layout for AI and data systems that separates experimentation, evaluation, domain logic, infrastructure adapters, and deployable service code.

python ai engineering data engineering software architecture evaluation

Jun 15, 2026 — 7 min — Platform & AI

Measure Python Performance Before You Change the Code

A Python performance playbook for choosing data structures, profiling tools, vectorized libraries, JIT experiments, and concurrency changes from evidence instead of taste.

Outcome: Provided a performance triage workflow that asks for a measured bottleneck before changing algorithms, data structures, runtime flags, concurrency models, or native libraries.

python performance profiling data engineering systems

Jun 15, 2026 — 6 min — Platform & AI

The Python Project Skeleton I Want Before the First Feature

A production Python project skeleton that prevents import confusion, dependency drift, and toolchain sprawl before the first API route or model workflow ships.

Outcome: Specified a repo baseline with src layout, uv locking, Ruff, typed checks, pytest, dependency groups, and CI gates so Python projects begin with executable architecture.

python project architecture uv ruff ci

Jun 14, 2026 — 7 min — Platform & AI

The 2026 Python Operating Standard Is Boring on Purpose

A practical Python 3.13+ operating standard for teams that need typed, readable, measurable systems without mistaking every new interpreter feature for a production default.

Outcome: Turned a pile of Python 3.13, 3.14, and 3.15-era advice into an adoption contract: stable defaults now, measured pilots for runtime changes, and automated gates before production.

python software architecture engineering standards typing quality gates

Jun 14, 2026 — 7 min — Platform & AI

Typing Turns Python Architecture Into a Contract

How to use modern Python typing, protocols, dataclasses, and payload types to stop raw dictionaries from becoming the hidden architecture of a production system.

Outcome: Gave Python teams a boundary pattern for converting untrusted payloads into typed domain objects before service logic, repositories, and agent tools can depend on them.

python typing domain modeling software architecture api design

Jun 13, 2026 — 6 min — Platform & AI

Pythonic Code in 2026 Is Explicit at the Boundaries

A modern Python style guide for choosing clear comprehensions, explicit None checks, pattern matching, t-strings, and domain exceptions where they improve system behavior.

Outcome: Converted Pythonic style advice into boundary rules for payload dispatch, template rendering, exception design, and readable transformations in production code.

python code quality pattern matching error handling software engineering

May 16, 2026 — 11 min — Systems Notes

Offline Claims PWA MVP for Field Adjusters

A pilot plan for the claims app failure that usually arrives late: photos captured offline, model scores nobody trusts, sync queues with no proof, and evidence bundles that cannot defend chain of custody.

Outcome: Reader can scope an adjuster-focused claims MVP around offline capture, on-device triage, sync recovery, audit receipts, and acceptance tests that produce pilot evidence instead of demo theater.

product engineering pwa offline-first machine learning insurance auditability

May 16, 2026 — 15 min — Systems Notes

State Machines in 2026: Durable Execution for Agents and Workflows

Why the May 2026 state-machine story is not finite automata becoming fashionable again, but durable execution becoming the reliability boundary for agents, workflow engines, and document-heavy product systems.

Outcome: Reader can distinguish statechart formalism from durable execution, pick the right runtime for agents and long-running workflows, and model document-heavy product lifecycles with explicit states, transitions, checkpoints, and ownership.

state machines durable execution agents workflow engines xstate temporal langgraph

May 15, 2026 — 16 min — Systems Notes

Modular Monolith vs Microservices: A Decision Memo for Real Products

Why teams get stuck between tangled monoliths and premature microservices, and how to choose the next boundary with delivery metrics, ownership, and blast radius.

Outcome: Reader can write a decision memo that chooses modular monolith, service extraction, or boundary repair based on team ownership, delivery metrics, scaling pressure, and observable blast radius.

software architecture microservices modular monolith product engineering engineering leadership

May 15, 2026 — 14 min — Systems Notes

System Design Papers: A Reading Map from GFS to AI Infrastructure

A de-duplicated taxonomy for system design papers: what to read first, what each paper teaches, and how to move from classic distributed systems into modern databases, observability, serverless, and AI infrastructure.

Outcome: Reader can turn scattered system-design paper lists into a practical reading path, identify duplicates and mixed source types, and choose papers by the design question they answer instead of by prestige.

system design distributed systems research papers software architecture databases ai infrastructure

May 14, 2026 — 17 min — Platform & AI

dbt on BigQuery Ingestion, Snapshots, and Cost Gates

A dbt on BigQuery starter kit for the parts that usually fail after the demo: raw loads without partition filters, snapshots with weak change detection, and CI that lets expensive SQL promote.

Outcome: Reader can scaffold a dbt and BigQuery project with manifest-backed incremental loads, timestamp-first snapshots, partitioned models, and a dry-run bytes gate before production promotion.

dbt bigquery data engineering analytics engineering ci cost controls

May 8, 2026 — 14 min — Platform & AI

Data Governance with AI in 2026: A Current Map for Operators

Half the 2025 AI-governance recipes still in production cite documents that were rescinded, delayed, or replaced in the last twelve months. The current map: what got retired, what's still authoritative, and what an operating governance program actually has to cover in 2026.

Outcome: Reader can audit their AI/data governance program against the actual 2026 regulatory and standards stack — including the federal rescissions, the EU AI Act timeline shift agreed May 7, 2026, the ISO/IEC 5259 Part 5 publication, and the OWASP Agentic Top 10 — and retire stale references with confidence.

data governance ai governance compliance iso 42001 nist ai rmf eu ai act owasp c2pa policy as code

May 3, 2026 — 7 min — Platform & AI

LLM and Agent Observability with OpenTelemetry GenAI Conventions

Why custom LLM logging leaves you flying blind in production, and how OpenTelemetry's GenAI semantic conventions turn every model call, tool invocation, and agent step into a traceable, cost-accountable span.

Outcome: Reader can instrument an LLM pipeline or agent workflow with OTEL GenAI conventions, export spans and cost metrics to any compatible backend, and build alerts on real token spend and latency instead of inferring from flat logs.

observability opentelemetry llms agents monitoring mlops ai engineering

May 2, 2026 — 12 min — Platform & AI

Agent Repo Trust Gates: Conftest Policies, SLSA Provenance, and SBOM in GitHub Actions

Why standard code review misses capability escalation in skill manifests, and how to wire a pre-merge conftest policy gate and post-merge SLSA provenance chain that actually work — correcting three common mistakes in the recipes that circulate online.

Outcome: Reader can wire a working pre-merge OPA/conftest gate on skill manifests, add a correct post-merge SLSA L2 provenance workflow using the SLSA GitHub Generator reusable workflow (not the nonexistent slsa CLI), and align OTel instrumentation with the GenAI semantic conventions.

supply chain github actions slsa conftest opa sbom agents ci/cd security

May 2, 2026 — 13 min — Platform & AI

Comprehension Debt: When Code Ships Without Theory

Why a two-day debug session on a one-month-old AI-generated bug is not a debugging problem but a theory-building problem you skipped, and the operating discipline that makes the missing theory recoverable.

Outcome: Reader has a working definition of comprehension debt distinct from technical debt, three questions to test whether a theory exists for an AI-generated component, a PR comprehension scoring rubric, and a deliberate-practice tactic set that prevents the doom loop.

ai coding assistants systems thinking technical debt comprehension developer workflow code review cognitive load

Apr 30, 2026 — 11 min — Systems Notes

The SaaS Stack I'd Use for LLM-Assisted Product Development

A pragmatic Python, TypeScript, Supabase, Stripe, and observability stack for shipping SaaS products with AI assistance without turning the architecture into tool soup.

Outcome: Defined a default SaaS stack decision brief that optimizes for typed boundaries, fast feedback loops, low operational drag, and clear graduation paths.

saas software architecture product engineering ai engineering full stack

Apr 30, 2026 — 8 min — Platform & AI

The Go and gRPC Version of the SaaS Stack

When a SaaS product should graduate from a flexible Python-first backend into Go, gRPC, Cloud Run, and Google Cloud service boundaries.

Outcome: Mapped a Go and gRPC adoption path for SaaS teams that need stronger service contracts, concurrency, latency discipline, and Google Cloud operations without premature rewrites.

gcp go grpc cloud run software architecture

Apr 29, 2026 — 10 min — Platform & AI

BigQuery Keys in dbt Are Optimizer Hints, Not Enforcement

How to use BigQuery primary and foreign key constraints from dbt without confusing optimizer metadata for enforced data integrity.

Outcome: Defined a BigQuery and dbt constraint playbook that keeps optimizer hints, dbt contracts, data tests, compiled SQL review, and INFORMATION_SCHEMA verification in the right order.

bigquery dbt data contracts analytics engineering query optimization data quality

Apr 29, 2026 — 15 min — Platform & AI

Your Repo Needs an Agent Harness, Not More Prompt Paste

A critical guide to README.md, AGENTS.md, CLAUDE.md, SKILL.md, .agents, and .claude patterns for teams that want coding agents to follow repo rules without stuffing every workflow into one giant prompt.

Outcome: Defined a repo documentation harness that separates human orientation, always-loaded agent rules, tool-specific compatibility files, on-demand skills, dynamic docs, and deterministic enforcement.

coding agents agent skills agents.md claude code developer workflow

Apr 28, 2026 — 9 min — Platform & AI

What ADK 2.0 Adds, and Where the Approval Path Still Breaks

Why an ADK 2.0 ToolConfirmation flow paired with VertexAiSessionService re-presented the same approval to a reviewer on Monday morning and ran the tool twice, and what the gap tells you about how to evaluate harness primitives at different maturity levels.

Outcome: Reader can map ADK 2.0 primitives onto a session-service backing store and decide which combinations are production-ready, which are beta-with-known-gaps, and which require waiting.

agents adk agent harness human in the loop memory

Apr 28, 2026 — 11 min — Platform & AI

Why I Reach for DuckDB When Reading Parquet from Swift or Zig

What an oversized iOS binary, a Linux linker error, and a SQL boundary teach about embedding DuckDB as the Parquet reader for languages without a mature native library.

Outcome: Reader can decide when DuckDB is the right Parquet path for a Swift or Zig project, configure the SPM and build.zig integrations correctly the first time, and avoid the binary-size and linker failures that the unconfigured path produces.

data engineering parquet duckdb swift zig

Apr 27, 2026 — 13 min — Systems Notes

State Machines in Go, Elixir, Swift, and Zig

Why a Go retry loop ran forever because the attempt counter lived on the loop instead of the state, and what the runtime guarantees of Elixir, Swift, and Zig change about which state-machine idioms are honest in each.

Outcome: Reader can pick the right state-machine idiom for their language by recognizing which runtime guarantees the language ships, distinguish a true finite-state machine from unidirectional data flow, and avoid the cross-language mistake of treating one language's idiom as the universal pattern.

state machines go elixir swift zig

Apr 26, 2026 — 10 min — Platform & AI

Minimal ML Examples Are Better as Review Maps Than Cheatsheets

How a compact Python ML cheatsheet becomes useful when synthetic demos, metrics, pipelines, and version drift are tied to the model-review decisions they can actually defend.

Outcome: Reader can use minimal scikit-learn examples as smoke tests for task framing, metric choice, pipeline boundaries, and environment drift instead of treating them as production recipes.

machine learning scikit-learn model evaluation mlops python

Apr 25, 2026 — 12 min — Platform & AI

Every Engineer Is a Manager Now

AI coding agents are turning software work into management work: engineers now have to manage intent, context, agent output, teammate coordination, stakeholder evidence, and long-term maintenance.

Outcome: Defined a public operating model for engineers and consultants who need to coordinate human teammates and AI agents without producing artifacts that create hidden technical debt.

ai agents engineering leadership software process technical communication consulting

Apr 24, 2026 — 9 min — Platform & AI

Reading Parquet from Elixir and Mojo Without Pretending the Runtime Is Native

Why a precompiled-NIF fall-through on a less-common Linux target adds quiet minutes to a deploy, and what the borrowed-runtime pattern actually looks like for Elixir and Mojo.

Outcome: Reader can ship Parquet-reading Elixir without surprise source compilation in CI, recognize where Mojo's Python interop boundary is the bottleneck rather than Mojo itself, and know which DataFrame guarantees leak at the BEAM and PyArrow boundaries.

data engineering parquet elixir mojo deployment

Apr 23, 2026 — 11 min — Systems Notes

State Machines in Python: from xstate-python to LangGraph

Why an agent harness re-fired a half-finished tool after a worker restart, the four Python libraries that solve different parts of the problem, and a concrete contribution roadmap for xstate-python.

Outcome: Reader can map a Python workflow to the right state-machine library, distinguish statechart formalism from durable execution, and know where to start contributing to xstate-python with file paths and named missing features.

state machines python xstate-python langgraph agentic ai

Apr 22, 2026 — 8 min — Platform & AI

Building an NPS Classifier You Can Actually Act On

A scikit-learn NPS ordinal classifier with SMOTE, probability calibration, utility-based thresholding, and PSI drift checks. The parts that make it useful to the retention team, not just accurate on a dashboard.

Outcome: Shipped a calibrated multiclass NPS model with a utility-driven operating threshold and a PSI-based drift loop, giving the retention team a per-customer detractor probability they can act on and a rule for when to retrain.

ml nps classification calibration drift evaluation

Apr 21, 2026 — 14 min — Platform & AI

Coding Assistants Work Best When the Blast Radius Is Small

An Android-first operating pattern for using GitHub Copilot, Amazon Q Developer, Android CLI, and Android skills without letting coding assistants rewrite Gradle, manifests, architecture, and security posture by accident.

Outcome: Defined a repeatable assistant workflow for Android teams that combines sliced prompts, repo instructions, Android skills, screenshots, atomic commits, tests, and GHAS gates into one controlled development loop.

coding assistants android github copilot amazon q mobile engineering

Apr 20, 2026 — 11 min — Platform & AI

How I Read Parquet in Rust and Go Without an OOM

Why a default Go parquet.Read[T] call slurped a 1.4 GB file into 11 GB of resident memory, and the column-native Rust and Go patterns that replaced it.

Outcome: Reader can pick the streaming Parquet read path in Rust and Go, configure the compression-codec features explicitly, and avoid the eager-load anti-patterns that look fine in benchmarks and break in production.

data engineering parquet rust go memory safety

Apr 19, 2026 — 11 min — Systems Notes

XState, Actors, and What the Stately Argument Actually Buys

Why a hand-rolled retry double-charged a Stripe customer because the cancel state was implicit, and what XState 5's setup-plus-actors model gives you that useReducer does not.

Outcome: Reader can write an XState 5 machine using the setup pattern, distinguish invoked from spawned actors, decide when to graduate from useReducer to a state machine library, and read XState code as a structured argument rather than a configuration object.

state machines xstate typescript react actor model

Apr 18, 2026 — 13 min — Platform & AI

Treat Agent Skills Like Supply-Chain Dependencies

A repo-ready operating contract for agent skills that prevents prompt bundles from drifting into unsigned, over-permissioned, unreviewed production dependencies.

Outcome: Defined a hardened-by-default skill contract covering version pins, manifest provenance, prompt review, IO tests, least-privilege tools, runtime isolation, observability, rotation, and decommissioning.

agent skills supply chain agents security developer workflow

Apr 17, 2026 — 15 min — Platform & AI

AI Coding Assistants Expose Process Debt

Why teams using Claude, GPT-style coding agents, Cursor, and Copilot often get unstable app work when requirements, versions, conventions, tests, and handoffs are implicit.

Outcome: Defined a docs-first assistant workflow that turns requirements, pinned stack choices, task slices, review loops, tests, and Git checkpoints into a repeatable way to ship with AI without surrendering architecture control.

coding assistants software process ai agents developer workflow technical leadership

Apr 15, 2026 — 8 min — Systems Notes

When the State Chart Pays Off

Why a React form with seven boolean flags shipped a flicker bug that statecharts would have surfaced before the first render, and the decision rule that says when this discipline earns its place.

Outcome: Reader can decide when a workflow is state-machine-shaped, replace boolean-flag explosion with a small statechart that names guards and transitions, and recognize statecharts as an architectural discipline rather than a UI utility.

state machines statecharts software architecture react engineering discipline

Apr 14, 2026 — 4 min — Platform & AI

What AI Researchers Do That I Do Not

A short, honest read on what AI researchers actually do day to day, written from outside the role by an applied engineer who reads papers when the work demands it.

Outcome: Reader can distinguish AI research work from applied AI engineering work, decide which research outputs change their quarter and which do not, and avoid hiring or being hired against the wrong role description.

ai engineering ai research engineering discipline career

Apr 13, 2026 — 11 min — Systems Notes

Product-Minded Architecture Lives Between Design, Business, and Engineering

Why design systems, component libraries, monorepos, and architecture boundaries become product tools when teams need to learn faster.

Outcome: Mapped a product-minded architecture that connects design system foundations, app surfaces, analytics, support workflows, and delivery packages.

product management software architecture design systems frontend architecture monorepos

Apr 9, 2026 — 10 min — Systems Notes

A Product Development System Should Remember Why the Roadmap Changed

Why roadmap churn becomes expensive when feedback, analytics, MVP evidence, and decision history are not connected.

Outcome: Defined a feedback-to-roadmap decision record that preserves the evidence behind task breakdowns, pivots, and next bets.

product management roadmaps analytics feedback loops software systems

Apr 5, 2026 — 12 min — Systems Notes

The Product Manager's Real Job Is Cutting Scope Until Learning Can Ship

Why product work fails when the team chases domain complexity before the MVP has created real learning from users.

Outcome: Defined an MVP decision brief that keeps product managers, engineers, and designers focused on the smallest slice that can test a market or workflow thesis.

product management mvp product discovery software delivery strategy

Apr 1, 2026 — 19 min — Platform & AI

A Software Architecture Reading Path for Working Engineers

A practical reading path through software design, architecture, system design interviews, data-intensive applications, and systems analysis for engineers who want to grow beyond implementation.

Outcome: Reviewed the architecture and system design books from the DEV Community list, corrected the list count, summarized each book, and arranged them into a practical learning path.

software architecture software engineering systems design reading list engineering growth

Mar 28, 2026 — 13 min — Systems Notes

Design Thinking Is Human Decision Work

Why design thinking fails when teams treat it as a workshop template instead of a human-centered way to make better product decisions under uncertainty.

Outcome: Defined a design-thinking decision loop and workshop brief that keep teams focused on user evidence, safe ideation, cheap prototypes, and explicit next decisions.

design thinking product discovery decision making human-centered design systems thinking

Mar 24, 2026 — 13 min — Systems Notes

A Software Developer Job Description Is an Operating Contract

Why generic software developer job descriptions over-index on writing code and under-specify the ownership, testing, maintenance, communication, and judgment that make software engineering work.

Outcome: Provided a practical role model and responsibility checklist that teams can use to write clearer software developer expectations and evaluate engineering work beyond code output.

software engineering engineering roles hiring technical leadership maintenance

Mar 20, 2026 — 8 min — Platform & AI

Fine-Tuning GPT-OSS 20B on a 64GB MacBook Pro

A practical MLX-first recipe for experimenting with openai/gpt-oss-20b on a 64GB Apple Silicon Mac without confusing local LoRA work for CUDA-scale training.

Outcome: Defined a local 64GB MacBook Pro fine-tuning path for GPT-OSS 20B that prioritizes Harmony formatting, MLX quantized LoRA, small evals, and a clear fallback to NVIDIA when scale is required.

gpt-oss mlx apple silicon llm fine-tuning local ai

Mar 16, 2026 — 7 min — Platform & AI

Fine-Tuning LLMs on a MacBook Pro With MPS and MLX

Why Apple Silicon is useful for local LLM prototyping and LoRA experiments, but still has sharp boundaries compared with CUDA-scale NeMo or Hugging Face training.

Outcome: Separated Mac-local MPS and MLX fine-tuning paths from NVIDIA-only training features so local experiments can start with realistic hardware expectations.

apple silicon mlx pytorch mps llm fine-tuning

Mar 12, 2026 — 9 min — Platform & AI

The Faster Transformers Stack Behind GPT-OSS

Why Hugging Face's faster Transformers work matters beyond GPT-OSS, and how kernels, MXFP4, parallelism, KV cache, batching, and model loading change practical LLM runtime decisions.

Outcome: Mapped the GPT-OSS-era Transformers runtime features into concrete decisions about memory, compute, cache behavior, batching, and serving boundaries.

transformers gpt-oss hugging face inference model performance

Mar 8, 2026 — 10 min — Platform & AI

Fine-Tuning LLMs Is an Operating Loop, Not a Training Command

Why LLM fine-tuning projects fail when teams jump to NeMo or Hugging Face training commands before deciding the model, data, evaluation, serving, and governance loop.

Outcome: Defined a fine-tuning operating loop that connects base-model choice, data curation, PEFT, evaluation, distributed training, serving, and governance into one repeatable release path.

llm fine-tuning nemo hugging face peft llmops

Mar 8, 2026 — 7 min — Systems Notes

Why Data Platforms Fail as Systems, Not Tools

A data platform failure pattern where tool replacement looked like the fix, but the real problem was ownership, release discipline, metric mismatch, and governance outside the workflow.

Outcome: Reframed platform recovery around ownership contracts, operating metrics, and release discipline so teams could fix the system instead of replacing another tool.

data platform systems design organizational design governance engineering

Mar 7, 2026 — 6 min — Systems Notes

What Complexity Science Teaches About AI Evaluation

A practical AI evaluation essay showing how locally strong retrieval, reasoning, and tool-use components can interact into globally weak product behavior.

Outcome: Improved evaluation strategy by testing full decision paths, interaction effects, feedback loops, and second-order behavior instead of isolated component scores.

complexity ai evaluation systems thinking product decision intelligence

Mar 5, 2026 — 6 min — Games & Sim

Building Abuela's Core Loop Across Unity and Web Surfaces

How Abuela's Unity gameplay loop and supporting web surfaces can share progression state, event contracts, reward rules, and iteration hooks without duplicating game logic.

Outcome: Promoted Abuela from a note to an essay by defining a loop architecture that keeps Unity runtime state, web companion flows, reward rules, and content iteration aligned.

unity next.js game loop systems architecture integration

Mar 4, 2026 — 11 min — Platform & AI

NVFP4 and the Infrastructure Meaning of Precision

A grounded read of NVIDIA's NVFP4 training post and why 4-bit pretraining matters for model quality, token throughput, cost, and AI infrastructure strategy.

Outcome: Explained NVIDIA's NVFP4 training recipe, separated the credible technical signal from the marketing surface, and connected low-precision training to practical AI infrastructure decisions.

llm training nvidia quantization model efficiency ai infrastructure

Mar 1, 2026 — 6 min — Games & Sim

Designing the Hippi Kingdom Economy as a Systems Problem

A game economy design essay for Hippi Kingdom covering currency loops, sinks and sources, telemetry, a rejected progression model, and the balancing mistake that made hoarding look like engagement.

Outcome: Created an economy balancing framework that separates progression health from currency hoarding, making pacing, reward pressure, and retention tradeoffs easier to test.

game dev economy design systems telemetry unity

Feb 28, 2026 — 15 min — Platform & AI

Context Engineering Keeps Long Context Useful

A practical synthesis of Drew Breunig, Simon Willison, and Anthropic on how long contexts fail, how to fix them, and why multi-agent systems need context discipline.

Outcome: Turned long-context failure modes into an engineering playbook for selecting, isolating, pruning, summarizing, offloading, and evaluating context in agent systems.

agents context engineering llm evaluation tool use multi-agent systems

Feb 24, 2026 — 20 min — Platform & AI

From Algorithms to AI Systems

A practical map from algorithmic complexity to software engineering, data pipelines, machine learning systems, and modern LLM architecture decisions.

Outcome: Connected classical algorithm analysis to production software, ML pipelines, RAG systems, model serving, and the trade-offs behind modern AI research.

algorithms machine learning llm systems data engineering systems design

Feb 24, 2026 — 6 min — Platform & AI

DSPy + RAG Evaluation Ops in Production

How to turn DSPy and RAG evaluation into a production release loop with golden sets, retrieval checks, generation rubrics, regression thresholds, and versioned prompt programs.

Outcome: Promoted the note into an essay by defining a repeatable RAG evaluation workflow that separates retrieval quality from generation quality and blocks prompt-program regressions before release.

dspy rag evaluation mlops agents

Feb 20, 2026 — 18 min — Systems Notes

An Enterprise Data Governance Glossary Operators Can Use

A practical enterprise data governance glossary that turns business intelligence, stewardship, metadata, security, privacy, quality, and lifecycle terms into usable review language.

Outcome: Created a shared vocabulary and term-entry contract that helps governance, data engineering, analytics, security, and business teams align definitions before certifying data products.

data governance data management metadata privacy business intelligence

Feb 16, 2026 — 15 min — Systems Notes

Data Governance Roles Need Decision Rights

A data governance operating model for assigning owners, stewards, custodians, and SMEs without leaving quality rules, access decisions, retention, source-of-truth choices, and incident closure ambiguous.

Outcome: Defined a role-and-cadence contract that lets governance teams assign decision rights, artifacts, escalation paths, and success measures before a data product is certified.

data governance data platform organizational design compliance operating model

Feb 12, 2026 — 15 min — Systems Notes

Principle Stacks Make Trade-offs Explicit

A practical look at Principle Stacks as a decision mechanism for teams, products, and personal priorities when important values collide.

Outcome: Defined Principle Stacks and Priority Stacks as ranked decision mechanisms, explained their failure modes, and provided a template for using them in product, engineering, and personal planning.

decision making principles systems thinking leadership product strategy

Feb 10, 2026 — 6 min — Platform & AI

Evaluating Multi-Agent Workflows for Enterprise Reliability

A practical evaluation loop for multi-agent workflows that catches demo-friendly failures in task handoff, tool use, permissions, latency, and completion criteria before release.

Outcome: Established a repeatable evaluation workflow that gates multi-agent releases on task completion, handoff quality, tool correctness, latency, and recoverability instead of demo impressions.

agents evaluation reliability enterprise ai observability

Feb 8, 2026 — 15 min — Systems Notes

Product Planning Is Shaping the Work

A practical view of product planning through Shape Up, user flow, business logic, mockups, architecture, discovery, timelines, and technical debt.

Outcome: Clarified product planning as the work of shaping user flow, business rules, implementation boundaries, discovery evidence, and technical constraints before committing delivery capacity.

product development systems design software architecture product discovery execution

Feb 4, 2026 — 18 min — Platform & AI

Machine Learning Terms That Make Model Reviews Better

A practical ML terminology guide for model reviews where feature definitions, data splits, task type, optimization behavior, overfitting risk, regularization, ensembles, and embeddings need to be discussed precisely.

Outcome: Gave peers a review-ready vocabulary for inspecting ML systems by connecting core terms to design choices, failure modes, and release questions.

machine learning model evaluation feature engineering neural networks mlops

Jan 31, 2026 — 10 min — Platform & AI

The Preprocessing Boundary Between scikit-learn and PyTorch

A production-friendly pattern for pairing scikit-learn preprocessing graphs with PyTorch models so training and inference use the same feature contract.

Outcome: Defined an artifact contract that keeps column preprocessing, feature order, model weights, metadata, and inference behavior synchronized across batch and serving environments.

machine learning pytorch scikit-learn mlops model serving

Jan 28, 2026 — 6 min — Platform & AI

Dataform + BigQuery Governance Release Patterns

A Dataform and BigQuery case study for turning data contracts, release lanes, validation gates, rollback behavior, and cost checks into one governed promotion path.

Outcome: Reduced contract-break risk in the sanitized release pattern by making schema, freshness, cost, and downstream impact checks part of promotion instead of after-the-fact review.

dataform bigquery data contracts release engineering gcp

Jan 27, 2026 — 18 min — Platform & AI

Local MCP and Private Open Model Infrastructure

A practical guide to running MCP servers locally, choosing affordable clients, and deploying private open models with Cloud Run, Ollama, and Open WebUI.

Outcome: Separated local agent tool access from private model serving, then defined a safer setup for MCP clients, local servers, and Cloud Run GPU sidecars.

mcp agents cloud run ollama open webui

Jan 23, 2026 — 17 min — Systems Notes

Lead Measures Make Dashboards Useful

A dashboard-before/dashboard-after operating pattern for turning lead measures into weekly commitments instead of passive reporting.

Outcome: Defined a lead-measure scoreboard and cadence that turns reporting into weekly action, with explicit checks for whether the metric is actually changing behavior.

analytics dashboards execution decision intelligence operating model

Jan 19, 2026 — 18 min — Platform & AI

API Design for MCP Server Boundaries

A Confluence-ready guide for designing durable HTTP APIs and wrapping them safely as Model Context Protocol servers.

Outcome: Turned general API design guidance into a practical standard for HTTP APIs that back MCP servers, with current protocol corrections, checklists, and source links.

api design mcp agent systems software architecture platform engineering

Jan 15, 2026 — 11 min — Platform & AI

When 0.3 Does Not Mean 30 Percent

How imbalanced classifiers can keep a strong AUC while producing probabilities that break thresholds, alerts, and cost-sensitive decisions in production.

Outcome: Defined a production calibration gate that logs Brier score, ECE, reliability diagrams, cost-sensitive thresholds, run metadata, and promotion criteria for imbalanced classifiers.

ml calibration classification evaluation probability reliability

Jan 12, 2026 — 5 min — Platform & AI

Compliant GCP Platform Playbook for Analytics and ML

A sanitized GCP platform case study where compliance, analytics delivery, and ML feature access had to be designed as one release path instead of three disconnected workstreams.

Outcome: Reduced governed dataset onboarding from weeks to days in the sanitized pattern while preserving auditability, cost visibility, and promotion rules for analytics and ML use cases.

gcp bigquery governance analytics ml

Jan 11, 2026 — 12 min — Platform & AI

scikit-learn Pipelines That Survive Tuning and Deployment

Why tabular models drift between notebooks and production when preprocessing, sample metadata, hyperparameter search, and persistence are not treated as one scikit-learn pipeline contract.

Outcome: Defined a scikit-learn pipeline contract that keeps column preprocessing, metadata routing, hyperparameter search, evaluation, and deployment artifacts reproducible across dev, stage, and production.

machine learning scikit-learn mlops model persistence tabular data

Jan 7, 2026 — 20 min — Platform & AI

Statistics for Data Science, Written for Software Developers

A software-developer guide to the statistics that actually change data-science decisions: samples, estimates, uncertainty, effect size, bias, probability, distributions, and model metrics.

Outcome: Defined a practical estimate-review workflow that helps software developers report effect size, confidence intervals, p-values, sampling bias, and classification metrics without treating statistics as glossary trivia.

statistics data science machine learning model evaluation experimentation

Dec 30, 2025 — 12 min — Platform & AI

Vertex AI Feature Store Is the Production Loop

A production-focused Vertex AI post on turning raw data, BigQuery features, online feature serving, model endpoints, monitoring, and retraining into one governed ML loop instead of another platform checklist.

Outcome: Defined a concrete Vertex AI feature-serving loop with source contracts, BigQuery feature views, point-in-time training exports, endpoint serving rules, monitoring thresholds, and retraining triggers.

gcp vertex ai feature store mlops gemini

Dec 26, 2025 — 10 min — Platform & AI

Vertex AI Makes More Sense as an MLOps Map

A Vertex AI architecture map for teams that need to decide which Google Cloud AI services belong in the ML lifecycle, where ownership changes hands, and which older assumptions are now unsafe.

Outcome: Gave teams an operating contract for using Vertex AI across data, features, training, deployment, monitoring, and generative AI without confusing a product menu for a production ML system.

gcp vertex ai mlops feature store model monitoring

Dec 22, 2025 — 15 min — Platform & AI

Correlation Is a Feature Screen, Not a Feature Strategy

A long-form feature-screening workflow that uses correlation for quick linear checks, then adds redundancy clustering, mutual information, chi-squared tests, L1 models, tree importances, permutation importance, and domain review.

Outcome: Defined a practical feature review loop that prevents teams from dropping useful nonlinear signals or keeping redundant features just because a correlation heatmap looked convincing.

machine learning feature selection correlation scikit-learn model evaluation

Dec 2, 2025 — 16 min — Systems Notes

TypeScript Concepts Make More Sense Inside React

A practical TypeScript and React guide to the event loop, hoisting, throttling, debouncing, timers, closures, callbacks, IIFEs, promises, async, and await through code patterns that show up in real components.

Outcome: Provided a React-centered runtime map and reusable TypeScript examples for debugging async UI behavior, timer cleanup, stale closures, callback flow, and promise-based rendering work.

typescript react javascript frontend software engineering

Nov 20, 2025 — 14 min — Platform & AI

Agent Memory Is an Operating Boundary

A practical look at Google ADK memory, Vertex AI Memory Bank, session state, retrieval, retention, access control, and why durable agent memory needs production discipline.

Outcome: Clarified the difference between short-term session state and durable agent memory, then mapped the operational risks around retrieval, security, retention, cost, and memory poisoning.

agents google cloud adk memory rag

Nov 16, 2025 — 8 min — Platform & AI

The Question About Your AI Agent Has Changed

Capability is no longer the hard question about AI agents. What the agent is permitted to do, and whether it will do it successfully, are. Here is why that distinction matters architecturally.

Outcome: Reframed agent deployment decisions around permission scope and blast radius rather than capability, reducing the risk of production failures from over-permissioned agentic systems.

agents ai governance enterprise ai authorization security

Nov 12, 2025 — 13 min — Platform & AI

Codex Plugins Extend Agents, Not Interfaces

Why Codex plugins point toward a different software design mindset: fewer UI extensions, more safe agent capabilities, system access points, and operational boundaries.

Outcome: Framed plugins as reusable agent capability bundles that require structured systems, permissions, predictable workflows, and safer operational surfaces.

codex agents plugins mcp software architecture

Nov 8, 2025 — 13 min — Platform & AI

Sandboxed Agents and the Production Automation Boundary

OpenAI's April 2026 Agents SDK update matters because sandboxed execution, manifests, resumable state, and memory move agents closer to real production automation.

Outcome: Framed sandboxed agent execution as an architecture boundary for safer, stateful, long-running automation instead of another demo-layer SDK feature.

agents openai sandboxing automation enterprise ai

Nov 4, 2025 — 15 min — Platform & AI

AI Strategy Starts Before the Model

A practical AI strategy framework with a worked example that connects business levers, data readiness, pilots, evaluation, governance, deployment, and operating metrics.

Outcome: Defined an end-to-end AI strategy playbook and worked example that ties data readiness, use-case selection, model development, governance, deployment, and operating ownership to measurable business outcomes.

ai strategy data strategy mlops llmops business outcomes

Oct 31, 2025 — 14 min — Platform & AI

Cloud Run GPU Sidecars Need Deployment Discipline

A practical deployment guide for running Ollama behind Open WebUI on Cloud Run GPUs without mixing service specs, model storage modes, sidecar startup order, or auth assumptions.

Outcome: Clarified Cloud Run GPU sidecar deployment choices so model storage, service YAML, startup ordering, authentication, and billing constraints are explicit before launch.

gcp cloud run gpu ollama open webui

Oct 27, 2025 — 10 min — Platform & AI

In-Warehouse Inference on Snowflake and BigQuery

A practical runbook for scoring changed rows close to the data using Snowflake Streams and Tasks or BigQuery scheduled queries and remote models.

Outcome: Compared Snowflake and BigQuery patterns for scheduled in-warehouse inference, corrected CDC assumptions, and defined monitoring, grants, and deployment checks.

snowflake bigquery mlops inference data engineering

Oct 23, 2025 — 14 min — Platform & AI

What a Data Strategist Actually Does

A practical view of data strategy as the operating discipline that connects business goals, governance, KPIs, platforms, analytics, ML, and AI delivery.

Outcome: Connected data roadmaps, governance, KPI design, platform delivery, and stakeholder alignment so analytics and AI initiatives produced measurable business decisions.

data strategy data governance analytics gcp decision intelligence

Oct 19, 2025 — 12 min — Platform & AI

When the Model Should Say It Doesn't Know: Conformal Prediction Sets with MAPIE

How to add coverage-guaranteed prediction sets, temperature scaling calibration, and risk-coverage curves to a classifier using MAPIE — the pieces that make uncertainty quantification operationally useful rather than decorative.

Outcome: Added coverage-guaranteed prediction sets and operational abstention gates to a classification pipeline, cutting acted-upon error rate without retraining the model.

ml conformal-prediction calibration uncertainty mapie selective-prediction

Oct 15, 2025 — 18 min — Platform & AI

Fine-Tuning Open Source LLMs With NVIDIA NeMo

A practical map of NVIDIA NeMo for teams that want to curate data, fine-tune open-source LLMs, evaluate them, and move from research checkpoints to production inference.

Outcome: Separated data curation, fine-tuning, alignment, evaluation, export, and serving concerns so open-source LLM customization could move from experiments to governed production workflows.

nemo llm fine-tuning mlops gpu training enterprise ai

Oct 11, 2025 — 16 min — Platform & AI

Plain-Language Machine Learning Metrics for Real Decisions

A practical explanation of ML metrics with decision tables for regression tolerance, rare-event classification, threshold tradeoffs, and the failure case where accuracy looked good but the decision failed.

Outcome: Clarified how metric choice, threshold design, tree-based pattern discovery, and logit interpretation affect whether ML outputs are useful for action.

machine learning model evaluation classification regression interpretability

Oct 7, 2025 — 7 min — Platform & AI

Probability Calibration Is an Operating Control

A practical playbook for turning classifier scores into reliable probabilities that can support ranking, thresholds, SLAs, and cost-sensitive decisions.

Outcome: Defined a calibration workflow that separates ranking from probability quality, uses scikit-learn calibration correctly, and carries thresholds and monitoring into production.

machine learning calibration mlops classification model evaluation

Oct 3, 2025 — 7 min — Platform & AI

The Three-Run Lab: How I Triage Slow PyTorch Training

A repeatable triage routine — the three-run baseline, DataLoader diagnosis, five profiler signatures, and a copy-paste scaffold — for finding where training time actually goes before touching the model.

Outcome: Identified and resolved training bottlenecks in under an hour by running the three-run baseline and reading profiler signatures before changing any model code.

pytorch ml training performance profiling debugging

Sep 29, 2025 — 7 min — Platform & AI

PyTorch Training Throughput: The Patterns That Actually Move the Number

torch.compile, mixed precision, gradient accumulation, DDP vs FSDP, and the profiler — the five levers I reach for before rethinking the model architecture.

Outcome: Cut training wall-clock time and GPU memory pressure by applying compile, AMP, and accumulation patterns in sequence before ever touching model architecture.

pytorch ml training performance gpu distributed-training

Sep 25, 2025 — 12 min — Platform & AI

A scikit-learn Pipeline for Calibrated Decisions

A production-friendly scikit-learn pattern for mixed tabular data, class imbalance, calibrated probabilities, threshold selection, and model persistence.

Outcome: Defined an end-to-end scikit-learn classification pipeline that keeps preprocessing, imbalance handling, probability calibration, evaluation, thresholding, and production artifacts aligned.

machine learning scikit-learn calibration classification mlops

Sep 21, 2025 — 14 min — Platform & AI

Algorithm Complexity as Engineering Judgment

A practical way to use algorithm complexity in product engineering, from choosing data structures to designing recommendation features that do not collapse as data grows.

Outcome: Explained how algorithm complexity shows up in everyday product work, then walked through an e-commerce recommendation feature from naive loops to indexed lookup.

software engineering algorithms systems design typescript performance

Sep 17, 2025 — 11 min — Systems Notes

Why Teams Miss Goals They Actually Care About

The four reasons goal execution breaks down, the 4DX framework that addresses them, and why the apparent tension between goal-thinking and systems-thinking resolves the moment you understand lead measures.

Outcome: Clearer framework for translating organizational goals into team-level execution through lead measures, visible scoreboards, and accountable weekly cadences.

leadership team performance execution systems thinking management

Sep 13, 2025 — 6 min — Systems Notes

The Many Paths Into Data Architecture

Data architecture is a function, not a credential. The paths into it are genuinely varied, and that variety reflects something real about what the role actually demands.

Outcome: Clearer picture of how different technical backgrounds map to the data architect role and what makes each one a legitimate — or limited — foundation.

data architecture data engineering career data modeling data governance

Sep 9, 2025 — 13 min — Systems Notes

Ten Ideas About Thinking in 2026

A practical set of reflections on thinking quality, decision-making, analogies, conflict, expertise, and the invisible assumptions that shape product and career outcomes.

Outcome: Reframed ten provocative ideas into practical decision checks for product, engineering, consulting, and career judgment.

critical thinking decision intelligence systems thinking product judgment career

Sep 5, 2025 — 12 min — Systems Notes

Thinking and Communication Are Engineering Work

A design-review scenario showing why communication, facilitation, visual thinking, feedback, and critical judgment are part of engineering delivery.

Outcome: Defined a practical decision artifact for surfacing assumptions, tradeoffs, evidence, and feedback before a technical plan hardens.

systems thinking decision intelligence product discovery facilitation engineering