---
title: "What is a context window in AI?"
canonical_url: "https://tryiro.com/blog/what-is-a-context-window"
site: "Iro AI"
site_url: "https://tryiro.com"
app_store: "https://apps.apple.com/app/id6759628066"
language: en-US
keywords: ["what is a context window", "context window AI", "context window meaning", "AI context window explained", "context window tokens", "context length"]
date_published: "2026-06-22"
date_modified: "2026-06-22"
reading_time_minutes: 5
author: "Alex Furukawa"
license: "© 2026 Iro AI"
canonical_llm_reference: "https://tryiro.com/llms-full.txt"
pillar: "ai-fluency"
---

# What is a context window in AI?

> A context window is how much text an AI can “see” at once — your prompt plus everything earlier in the conversation. Here's what it means in plain English and why it matters.

**Canonical:** https://tryiro.com/blog/what-is-a-context-window
**Published:** 2026-06-22
**Reading time:** ~5 min
**Author:** Alex Furukawa — Founder of Iro AI

## Key takeaways

- A context window is the maximum amount of text (measured in tokens) a model can consider at one time — input plus its own response.
- Everything competes for that space: your instructions, pasted documents, and the running conversation history.
- When you exceed it, the model “forgets” the oldest parts — a common reason long chats start drifting or losing details.
- Bigger windows help, but clear, well-organized prompts matter more than raw size.

## What is a context window?

**A context window is the maximum amount of text an AI model can consider at once — your current prompt, any documents you've pasted, the earlier conversation, and the model's own reply, all together.** Think of it as the model's short-term memory or its "field of view." It's measured in [tokens](/blog/what-are-tokens-in-ai) (roughly, chunks of words), and every model has a limit.

## How the context window works

Everything you send shares one budget. Your instructions, a long PDF you pasted, and the whole back-and-forth so far all count against the window. The model reads all of it to generate a response, and its answer counts too. A larger window means the model can take in more — a long report, a big codebase, hours of conversation — without losing track. Smaller windows fill up faster.

## Why AI seems to “forget” things

When a conversation grows past the window, the oldest content falls out of view. That's why a long chat can start contradicting itself, dropping details you mentioned early on, or "forgetting" instructions from the top. It isn't being careless — that text is simply no longer in its field of view. This is also why pasting a giant document and then asking many follow-ups can degrade: the document is crowding the window. Related: how to [spot when AI is making things up](/blog/spot-ai-hallucinations).

## Practical tips for working within the window

- **Put the important stuff up front and restate it** if a chat gets long.
- **Start a fresh chat** for a new task instead of letting one thread sprawl.
- **Paste only the relevant excerpt**, not an entire document, when you can.
- **Summarize** a long conversation and carry the summary into a new one.

Managing context well is a quiet superpower — part of broader [AI fluency](/become-ai-fluent). You can [build these habits in 5 minutes a day](/learn-ai-in-5-minutes-a-day).

## FAQ

**What is a context window in simple terms?**

It's how much text an AI can pay attention to at once — your prompt, any pasted documents, the earlier conversation, and the model's reply, all together. Think of it as the model's short-term memory, measured in tokens.

**Why does AI forget what I said earlier?**

Because long conversations eventually exceed the context window, and the oldest content drops out of the model's view. It isn't being careless; that text is simply no longer something it can see, so details and early instructions get lost.

**Is a bigger context window always better?**

It helps for long documents and long chats, but it isn't everything. A clear, well-organized prompt that puts the important information up front often matters more than raw window size — and huge inputs can still dilute the model's focus.

**How is a context window related to tokens?**

Context windows are measured in tokens, the small chunks of text models process. The window is the maximum number of tokens — input plus output — the model can handle at once, so longer text uses more of it.

## Read next

- [What are tokens in AI?](https://tryiro.com/blog/what-are-tokens-in-ai)
- [What is an LLM?](https://tryiro.com/blog/what-is-an-llm)
- [How to become AI fluent](https://tryiro.com/become-ai-fluent)
- [Take the free AI IQ test](https://tryiro.com/quiz)

## About the author

Alex Furukawa — Founder of Iro AI. Alex Furukawa is the founder of Iro AI, the gamified app for learning to use AI well. He writes about practical AI fluency — prompting, AI tools, and the daily habits that turn AI from a novelty into real leverage.