What 466 AGENTS.md files teach about context engineering
Key takeaway
AI notetakers default to producing a transcript because transcripts are the easy artifact to ship. The artifacts teams actually use downstream are different: decision logs, customer evidence files, sales handoff notes, blog drafts. The gap between what notetakers capture and what teams need is a context engineering problem, not a transcription one. The fix is to structure meeting context for the task that comes after the meeting, not to record more words.
AI notetakers are at near saturation in knowledge work. ICONIQ Capital’s 2026 GTM survey put AI adoption in go-to-market teams at 70%, with meeting transcription in the top three use cases. The AI meeting transcription market is set to grow from $3.86 billion this year to $29.45 billion by 2034. Everyone has Otter, Fathom, Granola, or Fireflies attending their calls now.
So why does every team I talk to have an inbox of transcripts they never reread?
Because notetakers ship the wrong artifact. They produce a transcript. The work downstream of a meeting needs something else: a decision log, a customer evidence file, a sales handoff note, a draft of the thing the meeting was supposed to produce. The gap between what gets captured and what gets used is the real problem with AI meeting tools, and it is a context engineering problem, not a transcription one.
AI notetakers produce a transcript and call the job done. The 21.5 hours per week the average knowledge worker spends in meetings (per Sonix’s 2026 adoption report) turns into roughly 170,000 words of transcript a week per person. The promise is that this becomes searchable, recoverable, useful. The reality is that almost nobody on the team rereads a 60-minute meeting.
The 62% time-savings figure that gets cited everywhere comes from one specific behavior: people no longer have to take notes during the meeting. That part is real. But the saved time gets clawed back the moment someone has to figure out what the meeting was actually about, what was agreed, and what needs to happen next. The transcript is technically searchable but practically inert.
What downstream work needs is not the verbatim words. It is the structured residue: the decision, the commitment, the customer’s exact language about a specific problem, the editorial-ready story. The notetaker captured the raw material. Nobody on the team turned it into the part you would actually consume.
Transcripts lose information in the wrong direction. They preserve every “um” and side conversation while losing the meeting’s structure: what mattered, what was decided, what is owed and by whom. A perfect transcript of a bad meeting still tells you nothing useful.
Fortune covered the HR-side consequence earlier this year: notetakers surfacing comments that should never have been on the record. People speak differently when a recorder is in the room. Notetakers do not flag the difference between the part of the call where someone was thinking out loud and the part where they were stating a position. The transcript treats both identically.
There is also a compression problem in the wrong direction. A 60-minute meeting produces around 8,000 words of transcript. To get value from those 8,000 words, somebody has to read them. The notetaker has just moved the attendance problem one step downstream: the meeting itself was an hour you sat through, and the transcript is an hour you now have to read. This is context offloading done badly. Offloading is supposed to relocate state to a destination cheaper to act on, not one identical in cost.
The problem is not the model used to transcribe. It is that “transcript” is the wrong target.
This is also a regression from the pre-AI baseline. A human notetaker filtered the meeting implicitly: they wrote down what mattered, skipped what did not, and translated “we should probably maybe look at that” into “parked, owned by Sam”. The output was lossy in the right direction. An AI notetaker is lossless in the wrong direction. The meeting becomes more recoverable in raw form and less useful in structured form, because the structuring step that used to happen during the meeting no longer happens anywhere.
Different downstream tasks need different artifacts from a meeting. Here are the four I see teams reach for in practice, and what each one requires.
| Artifact | Used by | What it needs from the meeting |
|---|---|---|
| Decision log | Eng leads, exec teams | What changed, why, who owns the follow-up |
| Customer evidence file | Marketing, product, research | Direct quotes tagged to themes and dates |
| Sales handoff note | AE to CSM, AE to AE | Account state, commitments made, open risks |
| Blog or content draft | Marketing, founders | Customer voice plus house voice and editorial rules |
None of these is a transcript. Each requires the meeting plus something the meeting did not contain: a decision template, a theme taxonomy, the account’s current state, the writer’s editorial preferences. Producing the artifact is the work of joining the meeting evidence to the task-specific context the meeting itself never held.
A notetaker that produces a transcript has done about 30% of the job, by volume. The remaining 70% is structuring and joining, and that is the part most workflows skip.
The fix is to structure meeting context for the task that comes after the meeting. Whisper-level transcription accuracy was solved years ago. The bottleneck moved upstream into structured context: turning the captured meeting into something the next step can consume.
That is what context engineering is for. The meeting evidence is one input. The task-specific context (a decision template, an account record, a voice corpus) is the second input. The artifact is the join of the two, shaped for the consumer.
The reason most notetakers do not do this is straightforward: the second input is not in the notetaker. Granola does not know your editorial preferences. Otter does not know which customer just churned. Fathom does not know your decision-log template. None of these tools were built to hold the other half of the join. They were built to capture the meeting and ship a transcript.
So the artifact problem stays unsolved as long as the only context available to the downstream agent is the transcript itself. The harder question is where the second input lives, and how the agent reaches both the meeting and the task context at once.
The pattern that works is two-source. The meeting tool holds the meeting evidence. A separate context layer holds the task-specific material. The agent reads both, joins them, and writes the artifact for review.
For a blog draft, the second layer holds a corpus of past posts (for voice), editorial rules (what to name and what to anonymize), and your topic backlog (so you do not write the same post twice). A Wire container alongside Granola is the concrete way to do this: past posts, editorial rules, and the topic backlog all sit as entries in one container reachable through the same five MCP tools, so the drafting agent runs a single search-and-write loop against a team-shared context layer instead of stitching across three separate connectors. The full walkthrough is in How to draft blog posts from your customer meetings.
Same shape for the other artifacts. A sales handoff agent needs the meeting plus the account record, the AE’s stated commitments, and the CSM’s required intake template. A decision log agent needs the meeting plus the team’s decision schema and the owner directory. A customer evidence file needs the meeting plus the active theme taxonomy and the rules about anonymization. None of these are transcription problems. All of them are structuring problems, and the structuring requires inputs the notetaker does not have on its own.
A useful diagnostic for whether the second source is in place is to ask whoever produces the artifact today whether they can do their job from the transcript alone. Marketing trying to write a customer story from a transcript will need the brand voice from somewhere. A CSM picking up an account from a sales handoff transcript will need the deal context from somewhere. If “somewhere” is a Slack thread, a private Notion doc, or in someone’s head, the structuring step is happening manually every time and the meeting tool has not actually reduced the work, it has just moved where the work happens. That recurring manual step is exactly what a queryable second context layer is for.
The same idea appears in customer support, where a related failure mode shows up as replies that sound generic because the voice corpus is missing from the agent’s context. The notetaker problem is the meeting-side version of the same gap.
The way to know if your meeting workflow has this problem is the inbox test. Open your AI notetaker’s archive. Count how many transcripts from the last month you have actually opened a second time. If the answer is zero, the notetaker is shipping an artifact nobody is consuming, and the time you thought you saved by not taking notes is being burned somewhere else.
The useful reframe is to pick the artifact before the meeting starts. “What does this meeting need to produce?” is a different question from “should we record this?” If the answer is “a transcript” the existing tools are fine. If the answer is “a decision the team will act on” or “a draft post” or “a handoff that doesn’t drop context”, the existing tools have done a third of the work and stopped.
That framing also tells you what the second source has to be. A decision-led artifact needs the team’s decision schema. An evidence-led artifact needs the theme taxonomy. A draft-led artifact needs a voice corpus. The notetaker can keep doing its one job. The structuring belongs in a different layer, and it is worth building before the next quarter of meetings piles up unread.
Sources: Meeting transcription adoption statistics 2026 - Sonix · AI notetakers are creating HR nightmares - Fortune · Top AI notetakers 2026 - AssemblyAI · Best AI Note Takers May 2026 - Jamie
Related
Wire transforms your documents into structured, AI-optimized context containers. Upload files, get MCP tools instantly.
Create Your First Container