Skip to content

Large context windows are not the same as reliable agent memory

Original: Don't trust large context windows View original →

Read in other languages: 한국어日本語
LLM Jun 14, 2026 By Insights AI (HN) 1 min read 1 views Source

Large context windows are one of the most marketable numbers in agent tooling. A 200K, 1M, or 2M token window sounds as if an assistant can keep an entire project and every instruction alive at once. Garrit Franke’s post resonated on Hacker News because it describes the gap between that promise and daily coding-agent work.

The post borrows the idea of a “smart zone” and a “dumb zone” inside a context window. Early context remains useful and sharp; as the window fills, attention degrades and the model starts missing details it had just been given. The author argues that the practical boundary can appear around 100K tokens, regardless of the advertised maximum.

Coding agents hit that boundary quickly. A handful of file reads, verbose logs, test failures, tool traces, and conversational detours can turn one session into a noisy archive. Auto-compaction helps by summarizing and restarting, but it often happens after the model has already spent time in degraded territory. The summary may also be produced from an overloaded context.

The proposed response is workflow design rather than blind trust in bigger windows. A human-written spec, plan, or handoff note can carry higher signal than an automatic transcript summary because it states what matters next. It also gives the next session a clean starting point instead of forcing it to mine a long history.

The community discussion treated long context as useful but not magical. The effective working set depends on placement, compression, retrieval, and the discipline of moving decisions into artifacts. For agent-heavy development, context is less like unlimited storage and more like a budgeted working memory.

Source: Don’t trust large context windows. HN discussion: item 48524620.

Share: Long

Related Articles