Context Window Discipline: What Fits in AI's Memory
Pasting a 50-page document plus your question often gets a worse answer than pasting just the relevant 2 pages.
17 min · Reviewed 2026
The big idea
Models have a 'context window' — how much text they can read at once. Modern Claude and GPT can hold 200k+ tokens, but performance degrades the more junk is in there. Curating the context (only what matters) beats dumping everything in.
Some examples
You paste a whole textbook to ask one question — Claude misses the answer that was on page 4.
You paste only chapters 3 and 4 — Claude nails it.
You include 10 PDFs of documentation — the model gets confused which one matters.
You include only the 1 PDF that's relevant + a clear question — the model is sharp.
Try it!
Take a long document and a question about it. First, ask with the whole doc. Then, ask with only the section you think is relevant. Compare quality.
End-of-lesson check
8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-builders-prompting-context-window-discipline-r8a8-teen
What is the main idea of "Context Window Discipline: What Fits in AI's Memory"?
Pasting a 50-page document plus your question often gets a worse answer than pasting just the relevant 2 pages.
Use AI as the final authority for the whole decision
Avoid checking the answer once it sounds polished
Focus only on speed instead of judgment
Which concept is most central to "Context Window Discipline: What Fits in AI's Memory"?
relevance
context window
context curation
needle in haystack
Which use of AI fits this topic best?
Let the AI decide what matters without your review
Use the answer before checking whether it fits the situation
You paste a whole textbook to ask one question — Claude misses the answer that was on page 4.
Use the first answer without checking it
What should a careful learner remember about "The rule"?