The Wire Blog

Page 4 of 8

Apr 29, 2026 · AI Agent

Tool-based agent memory: why 2026 benchmarks favor it

Tool-based agent memory exposes store, retrieve, and navigate as callable MCP tools. 2026 benchmarks from Mem0, Memanto, and Wire show why the pattern wins.

Apr 28, 2026 · Context Engineering

Why AI customer support replies sound generic

AI support replies sound generic because teams treat brand voice as a prompt problem. Context engineering fixes it by selecting the right exemplars.

Apr 27, 2026 · Structured Context

TOON vs JSON: why smaller doesn't mean cheaper for LLMs

TOON looks more compact than JSON, but a 9,649-test study found it cost LLMs 38% more tokens. The reason: model training distribution beats format size.

Apr 24, 2026 · AI Hallucination

GPT-5.5 didn't cut hallucinations 60%. Here's what it did.

OpenAI's GPT-5.5 system card reports 23% better claim-level accuracy, not the 60% hallucination reduction making press rounds. Here's what actually changed.

Apr 23, 2026 · Agent Drift

Agent drift: why long-running AI agents lose the plot

Agent drift is how AI agents silently deviate from goals over long-running tasks. Six mechanisms cause it, and most have nothing to do with the model.

Apr 22, 2026 · Context Engineering

Provenance is a context engineering primitive, not a trust score

Retrieval provenance for AI agents isn't an audit log or a trust verdict. It's structural metadata (source, position, time, edges) agents use to plan.

Apr 21, 2026 · Context Engineering

Why token cost doesn't scale with knowledge base size

AI token usage scales with knowledge base size only when the full corpus loads per query. The real variable is selective context delivery, not KB size.

Apr 20, 2026 · MCP (Model Context Protocol)

One job per tool: why adding wire_navigate cut agent calls 24%

We restructured Wire's MCP surface from 2 overloaded tools to 3 single-purpose ones. The counterintuitive result: adding a tool cut total calls 24%.

Apr 17, 2026 · AI Hallucination

GPT-5.4-pro hallucinates more than GPT-5.4-nano

Vectara's 2026 benchmark shows OpenAI's flagship GPT-5.4-pro hallucinates at 8.3% while its nano variant stays at 3.1%. The reasoning-model tradeoff, explained.

Apr 15, 2026 · AI Second Brain

How to build a private AI second brain

Native Notion and Obsidian MCP give every connected agent the same coarse scope. Build a private AI second brain with per-agent, revocable access across tools.