Tendril

Lesson 350 of 2116

Qualitative Coding With AI: Inter-Rater Reliability Still Matters

AI can tag interview transcripts at 1000x human speed. That speed is worthless without validation. Here's the honest workflow.

CreatorsResearch & Analysis~24 min readInteractiveBI3 · LearningBI4 · Natural InteractionBI2 · Representation & ReasoningPrint / PDF

Lesson map

What this lesson covers

40 min47 blocks9 concepts

Learning path

The main moves in order

1The tempting shortcut and its problem
2AI for Qualitative Data Coding: Speed With Validity
3The premise
4AI Qualitative Codebook Iteration: Supporting Inductive-Code Refinement

Concept cluster

Terms to connect while reading

qualitative codingthematic analysisinter-rater reliabilitycodebookvalidationintercoder reliability

Sections15

Lists8

Notes16

Tools1

Terms2

Section 1

The tempting shortcut and its problem

Qualitative coding is the slow heart of interview research: read 40 transcripts, tag every meaningful passage, let themes emerge. LLMs can do a first pass in minutes. But 'a first pass' is not a finished analysis. Treating the first pass as the final product is how AI-assisted research gets rejected at peer review.

The defensible workflow

1Have the LLM propose an initial codebook from 3-5 transcripts
2Human researchers refine the codebook, collapsing redundant codes and naming them carefully
3LLM codes all transcripts using the refined codebook
4Humans independently re-code a random 15-20% sample
5Calculate inter-rater reliability (Cohen's kappa) between human and LLM
6If kappa < 0.7, revise the codebook and re-run

Check-in 1. Got it so far?

What to disclose in the methods section

Which model did the coding (including version and date)
The exact prompt template used for coding
The codebook (as a supplementary appendix)
The human-AI agreement statistics
Any cases where humans overrode the LLM's codes

Check-in 2. Got it so far?

Key terms in this lesson

The big idea: AI accelerates qualitative coding — it does not replace the validation work. Kappa statistics and disclosures are non-negotiable.

Check-in 3. Got it so far?

Section 2

AI for Qualitative Data Coding: Speed With Validity

Section 3

The premise

AI-assisted qualitative coding can multiply researcher capacity; validity requires explicit validation methodology.

What AI does well here

Use AI for initial open coding at scale
Maintain researcher review on a sample for validation
Calculate intercoder reliability between AI and human coders
Document the AI methodology in publications for transparency

Check-in 4. Got it so far?

What AI cannot do

Substitute AI for the interpretive insight researchers bring
Replace validation with pure trust in AI output
Generate genuine novel theme discovery

Check-in 5. Got it so far?

Section 4

AI Qualitative Codebook Iteration: Supporting Inductive-Code Refinement

Section 5

The premise

AI can suggest code-merge candidates and sub-theme groupings from coded transcripts to support iterative codebook refinement.

Check-in 6. Got it so far?

What AI does well here

Cluster similar codes by definition and exemplar overlap.
Generate intercoder-disagreement summaries to inform calibration.

What AI cannot do

Decide the final code structure or theoretical framing.
Replace researcher reflexivity in interpretive work.

Check-in 7. Got it so far?

Check-in 8. Got it so far?

Section 6

AI and Qualitative Coding Second Pass: Catching What You Missed

Section 7

The premise

Two human coders cost time you don't have; AI is a cheap second coder that catches different things than you do.

What AI does well here

Apply your existing codebook to new transcripts
Surface candidate themes you didn't include
Flag where the same passage could fit two codes
Suggest where a theme might need splitting

Check-in 9. Got it so far?

What AI cannot do

Replace a human collaborator's interpretive depth
Notice cultural cues without explicit framing

Check-in 10. Got it so far?

Key terms in this lesson

End-of-lesson quiz

Check what stuck

15 questions · Score saves to your progress.

Tutor

Curious about “Qualitative Coding With AI: Inter-Rater Reliability Still Matters”?

Ask anything about this lesson. I’ll answer using just what you’re reading — short, friendly, grounded.

Progress saved locally in this browser. Sign in to sync across devices.

Related lessons

Qualitative Coding With AI: Inter-Rater Reliability Still Matters

The tempting shortcut and its problem

The defensible workflow

What to disclose in the methods section

AI for Qualitative Data Coding: Speed With Validity

The premise

What AI does well here

What AI cannot do

AI Qualitative Codebook Iteration: Supporting Inductive-Code Refinement

The premise

What AI does well here

What AI cannot do

AI and Qualitative Coding Second Pass: Catching What You Missed

The premise

What AI does well here

What AI cannot do

Curious about “Qualitative Coding With AI: Inter-Rater Reliability Still Matters”?

Keep going

Qualitative Coding With AI: Inter-Rater Reliability Still Matters

The tempting shortcut and its problem

The defensible workflow

What to disclose in the methods section

AI for Qualitative Data Coding: Speed With Validity

The premise

What AI does well here

What AI cannot do

AI Qualitative Codebook Iteration: Supporting Inductive-Code Refinement

The premise

What AI does well here

What AI cannot do

AI and Qualitative Coding Second Pass: Catching What You Missed

The premise

What AI does well here

What AI cannot do

Curious about “Qualitative Coding With AI: Inter-Rater Reliability Still Matters”?

Keep going