Iro AI Blog

What are tokens in AI?

Tokens are the small chunks of text an AI breaks language into — usually pieces of words. They're how models read, generate, get priced, and hit their limits. Here's the plain-English version.

By Alex FurukawaPublished 2026-06-22~5 min readAI Fluency

In this postDefinition How tokenization works Why tokens matter Do you need to care?

What is a token in AI?

A token is a small chunk of text — often a whole word, but sometimes part of one — that an AI model uses as its basic unit of language. Models don't read letter by letter or sentence by sentence; they break text into tokens, then predict the next token over and over to produce a response. "Tokenization" is just the step of slicing text into those pieces.

How tokenization works

Common words are usually a single token, while longer or unusual words get split into several. "Cat" might be one token; "unbelievable" might be two or three. Spaces and punctuation count too. A handy rule of thumb for English: about 0.75 words per token, so roughly 1,000 tokens covers about 750 words. Both your input and the model's output are measured this way — they're the unit behind a model's context window.

Practice this, don't just read it.

Iro AI turns ideas like the ones in this post into 5-minute exercises with feedback. Free tier, Pro from $5/month ($59.99/year, 7-day free trial).

Download Iro AI Take the AI IQ test

Why tokens matter

Cost — AI APIs are usually priced per token, so longer prompts and answers cost more.
Limits — context windows and rate limits are defined in tokens, capping how much a model can take in at once.
Speed — models generate one token at a time, so longer outputs take longer.

This is why a concise, well-targeted prompt is often cheaper, faster, and better — see why your prompts aren't working.

Do you need to think about tokens?

For everyday use, no — you don't need to count them. But understanding tokens demystifies the things that confuse beginners: why a chat "forgets," why an API bill is what it is, and why pasting a whole book doesn't work. That kind of mental model is exactly what AI fluency is made of, and you can build it in about 5 minutes a day. See where you stand with the free AI IQ test.

Practice this, don't just read it.

Iro AI turns ideas like the ones in this post into 5-minute exercises with feedback. Free tier, Pro from $5/month ($59.99/year, 7-day free trial).

Download Iro AI Take the AI IQ test

FAQ

What is a token in AI in simple terms?

A token is a small chunk of text — often a word or part of a word — that an AI model uses as its basic unit. Models read and generate text one token at a time rather than by letters or whole sentences.

How many words is a token?

A rough rule of thumb for English is about 0.75 words per token, or roughly 1,000 tokens per 750 words. Common words are usually one token; longer or unusual words get split into several.

Why do tokens matter?

Tokens are the unit behind pricing, rate limits, and context windows. They determine how much an AI can process at once, how much an API call costs, and partly how long a response takes to generate.

What is the difference between tokens and words?

Words are how people count language; tokens are how models do. A token can be a whole word or just part of one, so token counts are usually a bit higher than word counts — about 1.3 tokens per word in English on average.

What are tokens in AI?

What is a token in AI?

How tokenization works

Practice this, don't just read it.

Why tokens matter

Do you need to think about tokens?

Practice this, don't just read it.

Read next

FAQ

About the author