Build It: Terminal Quiz Bot Powered by Claude

A CLI quiz app: Claude generates questions on any topic, you answer, it grades. Teaches prompts, loops, and keeping state.

45 min · Reviewed 2026

What we're building

A terminal script that asks for a topic, calls Claude to get 5 multiple-choice questions, asks them one at a time, tracks your score, and shows the final result. Real code, ~80 lines.

# pyproject.toml: anthropic, pydantic import json from pydantic import BaseModel, Field from anthropic import Anthropic class Question(BaseModel): prompt: str options: list[str] = Field(min_length=4, max_length=4) correct_index: int = Field(ge=0, le=3) explanation: str class QuizSet(BaseModel): topic: str questions: list[Question]Pydantic models define the exact shape we expect from the LLM.

client = Anthropic() def generate_quiz(topic: str, n: int = 5) -> QuizSet: prompt = f"""Generate exactly {n} multiple-choice questions about: {topic} Return ONLY valid JSON, no markdown fence, matching this shape: {{ "topic": "", "questions": [ {{"prompt": "", "options": ["a","b","c","d"], "correct_index": 0, "explanation": ""}} ] }} Target middle-school difficulty. No trick questions.""" response = client.messages.create( model="claude-opus-4-7", max_tokens=2000, messages=[{"role": "user", "content": prompt}], ) raw = response.content[0].text.strip() if raw.startswith("```"): raw = raw.strip("`").split("\n", 1)[1].rsplit("\n", 1)[0] return QuizSet.model_validate_json(raw)One LLM call returns the whole quiz. Pydantic will reject malformed output.

def run_quiz(quiz: QuizSet) -> int: score = 0 for i, q in enumerate(quiz.questions, start=1): print(f"\n--- Question {i}/{len(quiz.questions)} ---") print(q.prompt) for idx, opt in enumerate(q.options): print(f" {idx+1}. {opt}") while True: raw = input("Your answer (1-4): ").strip() if raw in {"1","2","3","4"}: break print("Please enter 1, 2, 3, or 4.") chosen = int(raw) - 1 if chosen == q.correct_index: print("Correct!") score += 1 else: correct = q.options[q.correct_index] print(f"Nope — answer was: {correct}") print(f"Why: {q.explanation}") return score def main(): topic = input("Topic? ").strip() or "the solar system" print(f"Generating quiz about {topic}") try: quiz = generate_quiz(topic) except Exception as e: print(f"Quiz generation failed: {e}") return final = run_quiz(quiz) print(f"\nFinal score: {final}/{len(quiz.questions)}") if __name__ == "__main__": main()The main loop: ask, validate input, score, report.

Mini-exercise

Run the quiz on three different topics
Add a --difficulty flag (easy/medium/hard) and wire it into the prompt
After the quiz, ask Claude to explain one question in simpler terms
Save results to quiz_history.json

Ad-hoc JSON parse	Pydantic schema
Crashes on bad output	Raises ValidationError with field path
Easy to write	Slightly more setup
Good for: throwaway scripts	Good for: anything you run twice

Big idea: an LLM + a typed schema + a simple loop is a stunningly powerful base for any interactive tool. You've now built the skeleton every AI tutor app shares.

End-of-lesson check

8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-prog-python-quiz-bot-builders

What is the main idea of "Build It: Terminal Quiz Bot Powered by Claude"?
1. A CLI quiz app: Claude generates questions on any topic, you answer, it grades. Teaches prompts, loops, and keeping state.
2. Use AI as the final authority for the whole decision
3. Avoid checking the answer once it sounds polished
4. Focus only on speed instead of judgment
Which concept is most central to "Build It: Terminal Quiz Bot Powered by Claude"?
1. LLM calls
2. CLI
3. state
4. JSON parsing
Which use of AI fits this topic best?
1. Let the AI decide what matters without your review
2. Use the answer before checking whether it fits the situation
3. Run the quiz on three different topics
4. Use the first answer without checking it
What should a careful learner remember about "Why a Pydantic schema matters"?
1. Use AI to draft or organize ideas about CLI, then verify before acting.
2. Skip the context so the tool can guess faster
3. Treat the output as private even after sharing it online
4. Use the answer without checking the source
You want to use AI after this lesson. What is the safest next step?
1. Act immediately because the AI answer is written clearly
2. Use the AI answer as a draft, then check it against a reliable source.
3. Hide uncertainty so the final answer looks cleaner
4. Use private or sensitive details before checking permission
How should AI output about CLI be treated?
1. As proof that no other source is needed
2. As a replacement for context, consent, or expert review
3. As a draft or helper output that still needs human judgment and verification
4. As something that becomes correct when it sounds confident
Name one way to verify an AI answer about CLI.
Which action would help you apply "Build It: Terminal Quiz Bot Powered by Claude" responsibly?
1. Use the tool to avoid thinking through the tradeoff
2. Keep going even if the output conflicts with a trusted source
3. Use the first answer without checking it
4. Add a --difficulty flag (easy/medium/hard) and wire it into the prompt

← Back to interactive lesson

Tendril · Builders · AI-Assisted Coding

Build It: Terminal Quiz Bot Powered by Claude

A CLI quiz app: Claude generates questions on any topic, you answer, it grades. Teaches prompts, loops, and keeping state.

45 min · Reviewed 2026

What we're building

A terminal script that asks for a topic, calls Claude to get 5 multiple-choice questions, asks them one at a time, tracks your score, and shows the final result. Real code, ~80 lines.

# pyproject.toml: anthropic, pydantic import json from pydantic import BaseModel, Field from anthropic import Anthropic class Question(BaseModel): prompt: str options: list[str] = Field(min_length=4, max_length=4) correct_index: int = Field(ge=0, le=3) explanation: str class QuizSet(BaseModel): topic: str questions: list[Question]Pydantic models define the exact shape we expect from the LLM.

client = Anthropic() def generate_quiz(topic: str, n: int = 5) -> QuizSet: prompt = f"""Generate exactly {n} multiple-choice questions about: {topic} Return ONLY valid JSON, no markdown fence, matching this shape: {{ "topic": "", "questions": [ {{"prompt": "", "options": ["a","b","c","d"], "correct_index": 0, "explanation": ""}} ] }} Target middle-school difficulty. No trick questions.""" response = client.messages.create( model="claude-opus-4-7", max_tokens=2000, messages=[{"role": "user", "content": prompt}], ) raw = response.content[0].text.strip() if raw.startswith("```"): raw = raw.strip("`").split("\n", 1)[1].rsplit("\n", 1)[0] return QuizSet.model_validate_json(raw)One LLM call returns the whole quiz. Pydantic will reject malformed output.

def run_quiz(quiz: QuizSet) -> int: score = 0 for i, q in enumerate(quiz.questions, start=1): print(f"\n--- Question {i}/{len(quiz.questions)} ---") print(q.prompt) for idx, opt in enumerate(q.options): print(f" {idx+1}. {opt}") while True: raw = input("Your answer (1-4): ").strip() if raw in {"1","2","3","4"}: break print("Please enter 1, 2, 3, or 4.") chosen = int(raw) - 1 if chosen == q.correct_index: print("Correct!") score += 1 else: correct = q.options[q.correct_index] print(f"Nope — answer was: {correct}") print(f"Why: {q.explanation}") return score def main(): topic = input("Topic? ").strip() or "the solar system" print(f"Generating quiz about {topic}") try: quiz = generate_quiz(topic) except Exception as e: print(f"Quiz generation failed: {e}") return final = run_quiz(quiz) print(f"\nFinal score: {final}/{len(quiz.questions)}") if __name__ == "__main__": main()The main loop: ask, validate input, score, report.

Mini-exercise

Run the quiz on three different topics
Add a --difficulty flag (easy/medium/hard) and wire it into the prompt
After the quiz, ask Claude to explain one question in simpler terms
Save results to quiz_history.json

Ad-hoc JSON parse	Pydantic schema
Crashes on bad output	Raises ValidationError with field path
Easy to write	Slightly more setup
Good for: throwaway scripts	Good for: anything you run twice

Big idea: an LLM + a typed schema + a simple loop is a stunningly powerful base for any interactive tool. You've now built the skeleton every AI tutor app shares.

End-of-lesson check

8 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-prog-python-quiz-bot-builders

What is the main idea of "Build It: Terminal Quiz Bot Powered by Claude"?
1. A CLI quiz app: Claude generates questions on any topic, you answer, it grades. Teaches prompts, loops, and keeping state.
2. Use AI as the final authority for the whole decision
3. Avoid checking the answer once it sounds polished
4. Focus only on speed instead of judgment
Which concept is most central to "Build It: Terminal Quiz Bot Powered by Claude"?
1. LLM calls
2. CLI
3. state
4. JSON parsing
Which use of AI fits this topic best?
1. Let the AI decide what matters without your review
2. Use the answer before checking whether it fits the situation
3. Run the quiz on three different topics
4. Use the first answer without checking it
What should a careful learner remember about "Why a Pydantic schema matters"?
1. Use AI to draft or organize ideas about CLI, then verify before acting.
2. Skip the context so the tool can guess faster
3. Treat the output as private even after sharing it online
4. Use the answer without checking the source
You want to use AI after this lesson. What is the safest next step?
1. Act immediately because the AI answer is written clearly
2. Use the AI answer as a draft, then check it against a reliable source.
3. Hide uncertainty so the final answer looks cleaner
4. Use private or sensitive details before checking permission
How should AI output about CLI be treated?
1. As proof that no other source is needed
2. As a replacement for context, consent, or expert review
3. As a draft or helper output that still needs human judgment and verification
4. As something that becomes correct when it sounds confident
Name one way to verify an AI answer about CLI.
Which action would help you apply "Build It: Terminal Quiz Bot Powered by Claude" responsibly?
1. Use the tool to avoid thinking through the tradeoff
2. Keep going even if the output conflicts with a trusted source
3. Use the first answer without checking it
4. Add a --difficulty flag (easy/medium/hard) and wire it into the prompt

← Back to interactive lesson