Mar 7, 2026 — 6 min — Systems Notes
What Complexity Science Teaches About AI Evaluation
A practical AI evaluation essay showing how locally strong retrieval, reasoning, and tool-use components can interact into globally weak product behavior.
Outcome: Improved evaluation strategy by testing full decision paths, interaction effects, feedback loops, and second-order behavior instead of isolated component scores.