The big idea
AI breaks every message into small chunks called tokens before reading. A token is often a piece of a word.
Some examples
- The word 'unhappy' might be 2 tokens: 'un' + 'happy'.
- A short word like 'cat' is usually 1 token.
- Spaces and punctuation can be tokens too.
- 100 words is roughly 130 tokens in English.
Try it!
Type 'pineapple' to AI and ask how many tokens that is. Most AIs split it into 2 or 3 chunks.
End-of-lesson check
15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-explorers-foundations-AI-and-the-token-not-word-r10a5
Before AI can understand a sentence you type, it first breaks the sentence into what?
- Complete sentences
- Full dictionary words
- Tiny word chunks called tokens
- Pictures of letters
A token in AI is most similar to which of these?
- A whole paragraph
- An entire dictionary
- A syllable in a spoken word
- A complete book chapter
If you type the word 'cat' into an AI, how many tokens will it likely become?
- 10 tokens
- 5 tokens
- 1 token
- 3 tokens
The word 'unhappy' would most likely be broken into how many tokens by AI?
- 2 tokens: 'un' and 'happy'
- 1 token
- 5 tokens
- 10 tokens
When you count 100 words you typed, about how many tokens would an AI say it actually received?
- Around 130 tokens
- Around 20 tokens
- Exactly 50 tokens
- Exactly 100 tokens
Which of these is TRUE about how AI sees text?
- AI sees tokens, not full words
- AI sees pictures of each letter
- AI reads text exactly like humans do
- AI ignores spaces completely
You type 'pineapple' into an AI and ask how many tokens it is. What will the AI most likely tell you?
- 50 tokens
- 100 tokens
- Exactly 1 token
- 2 or 3 tokens
Why does AI split words into tokens instead of reading whole words?
- Tokens let AI draw pictures of words
- Tokens help AI process language more efficiently
- Tokens are easier for humans to read
- Tokens make the text longer
If someone says AI 'reads' words, what do they really mean?
- AI remembers every word perfectly
- AI sees the words as complete pictures
- AI speaks the words out loud
- AI breaks words into token chunks and processes those
The word 'building' might become which of these token combinations?
- 'building' (1 token)
- 'build' + 'ing' (2 tokens)
- 'b' + 'u' + 'i' + 'l' + 'd' + 'i' + 'n' + 'g'
- 'build' (1 token)
When you send a message to AI, how does it count the length of your message?
- By counting pictures
- By counting every letter
- By counting sentences only
- By counting tokens, not words
What is the MAIN thing you learned about how AI processes text?
- AI reads text as small chunks called tokens
- AI cannot read text at all
- AI only reads text written by humans
- AI reads text as complete sentences only
A token could be which of the following?
- A piece of a word like 'un' or 'ing'
- A sound recording
- An entire encyclopedia
- A video file
Two different AIs might split the word 'pineapple' differently. What numbers might they each give you?
- They would both always say 1
- They would both always say 100
- They would both always say 50
- One might say 2, another might say 3
If you ask an AI to count the 'words' in your sentence, what is it actually counting?
- Nothing - AI cannot count
- How many times you pressed keys
- Tokens that represent word pieces
- The actual letters