play icon for videos

How to Analyze Qualitative Data from Interviews

Stop treating interview analysis as a standalone task. Learn why organizations must rethink their entire qualitative workflow

Updated
May 29, 2026
360 feedback training evaluation
Use Case
The six steps in one view

From transcript to reported finding in six steps.

Interview analysis is not one operation. It is a sequence — six discrete steps, each producing an artifact the next step depends on. Skipping any one of them is the most common reason a stack of finished interviews never makes it to a defensible finding.

01
Transcribe

Convert the audio recording into verbatim text. Preserve speaker turns, hesitations, and emphases — they are part of the meaning.

ProducesTime-stamped transcript per interview
02
Read

Read the full set of transcripts once without coding. The pass builds familiarity and surfaces themes the codebook may have missed.

ProducesFamiliarization memo; codebook revisions
03
Code

Apply the codebook to each turn of the transcript. Every segment of meaningful text receives one or more code tags.

ProducesCoded transcripts with tag overlay
04
Theme

Group the codes into themes anchored in the research framework. Themes are higher-order than codes and carry interpretive weight.

ProducesTheme map with code groupings
05
Pattern

Look across the whole set. Which themes recur? Which subgroups raise which themes? Where do the data contradict the program's assumptions?

ProducesTheme frequencies; cross-tabs; outliers
06
Report

Write the findings. Pattern statements above, verbatim quotes below, source citations underneath. Every claim traces back to a moment in a transcript.

ProducesFindings with citation chains

Each step takes a different cognitive mode. Transcription is mechanical. Reading is interpretive. Coding is systematic. Theming is conceptual. Pattern identification is analytic. Reporting is rhetorical. Teams that conflate the steps — code while reading, theme while coding, report while patterning — produce findings that are hard to defend because the artifacts of the earlier steps are never actually built.

Four established approaches to interview analysis

Thematic, content, framework, narrative — what each one does and when to use it.

The six-step sequence is shared across approaches. What changes is the analytic lens — what the analyst is looking for when coding, and what the final output looks like. Most studies declare an approach upfront in the methods section and stay consistent across the dataset.

Approach 01 · most widely used
Thematic analysis

Identifies recurring topics, ideas, and patterns across a set of interviews. Codes are grouped into themes anchored either in the data (inductive) or in a predefined framework (deductive). The output is a theme map with frequencies and illustrative quotes.

Use when: The research question is exploratory — surfacing what participants experienced and how they describe it.
Approach 02
Content analysis

Counts and categorizes. How often specific words, codes, or categories appear; how frequency varies across subgroups. More quantitative than thematic — produces tables of counts and proportions rather than interpretive theme statements.

Use when: The research question is comparative — testing for differences in topic prevalence across groups or time.
Approach 03
Framework analysis

Applies a predefined matrix to the transcripts. Rows are participants, columns are framework dimensions (the funder's outcome framework, a theory of change, a regulatory taxonomy). Each cell holds the relevant excerpts from that participant on that dimension.

Use when: Reporting requires structured comparison across participants on a fixed set of dimensions — typical for funder evaluations.
Approach 04
Narrative analysis

Treats each interview as a story to be analyzed as a whole rather than broken into coded fragments. Focus is on sequence, plot, turning points, and how the participant constructs meaning over the course of telling.

Use when: The research question is biographical or identity-focused — life-course studies, recovery narratives, career trajectories.

Studies sometimes use more than one approach in the same analysis. A foundation evaluation might run framework analysis to populate a structured matrix for the funder report, then thematic analysis on a subset to surface cross-cutting patterns the framework did not anticipate. The pairing is more common than purist single-approach studies, particularly in applied research where the report has both a structured deliverable and an exploratory section.

Each step, expanded

What to actually do at each of the six steps.

Each card below names the practical work that step requires, the specific decisions the analyst makes, and the tactical traps that derail the step most often. Read the card before doing the work — not after.

STEP 01
Transcribe
What you do Convert the audio recording into verbatim text. Mark speaker turns, timestamp each section, and preserve hesitations and emphases as they were spoken. Choose intelligent verbatim (cleaning filler words) or full verbatim (preserving everything) depending on the analytic approach.
Common trap Summarizing as you transcribe. Summary loses the participant's wording, which is the unit of analysis in the next four steps. Transcribe what was said, not what you understood them to mean.
A 60-minute interview produces roughly 8,000 to 10,000 words of transcript. Tools handle the first pass in minutes; the analyst's job is to clean errors and verify speaker attribution.
STEP 02
Read for familiarization
What you do Read every transcript in the set before you begin coding any of them. Take memos — not yet code tags. The memos capture impressions, surprises, recurring language, and questions that emerge across the dataset.
Common trap Coding the first transcript before reading the others. The codebook gets over-fit to whoever spoke first. By transcript twenty, the codes from transcript one feel arbitrary.
A familiarization memo for a workforce study might note: "Three participants mentioned the supervisor specifically. Two raised childcare. One reframed the program as a credential rather than a skill build." Memos surface emerging themes before they get fixed in code.
STEP 03
Code
What you do Apply codes — short labels — to segments of text. One turn can carry multiple codes when it speaks to multiple themes. Codes come from the codebook, which is written before collection and revised after the familiarization pass.
Common trap Inventing codes as you go without updating the codebook. By transcript fifteen, you have 120 codes, 60 of which are near-duplicates, and the coding is no longer comparable across the set.
A line like "the labs helped me apply what we covered" gets tagged APPLY (applied learning) and may also receive PEDAGOGY-PREFERENCE depending on the codebook's structure.
STEP 04
Theme
What you do Group codes into themes that operate at a higher conceptual level. APPLY, SELF-EFFICACY, and CONFIDENCE-GROWTH might all roll up to a theme of "active learning and growing autonomy." The theme is what the report will eventually claim.
Common trap Treating themes as bigger codes. A theme is a relational interpretation — it tells you how codes connect, not just that they recur. A theme statement is a sentence, not a label.
A theme statement: "Hands-on practice drives self-directed problem-solving by mid-program, with participants reporting a shift from rote recall to applied reasoning."
STEP 05
Pattern
What you do Look across the full set of interviews. Which themes recur and how often? Which subgroups raise which themes? Where do the data diverge — when do participants disagree, and what predicts that disagreement? Cross-tab by demographics, site, cohort, or any variable on the record.
Common trap Reporting theme frequency without checking subgroup variation. A theme that occurs in 60% of transcripts overall may occur in 90% of one subgroup and 30% of another. The aggregate hides the finding.
Across 25 intake interviews, APPLY recurs in 17 transcripts. Disaggregated by prior-experience subgroup, it recurs in 14 of 16 first-time trainees and only 3 of 9 returning participants. The aggregate hides the real pattern.
STEP 06
Report
What you do Write each finding as a pattern statement supported by a verbatim quote and a source citation. The quote illustrates; the frequency defends. The citation lets the reader trace any claim back to the participant and moment in the transcript.
Common trap Treating one vivid quote as the finding. A quote is illustration. The cohort-level pattern is the evidence. The two travel together in a defensible finding.
A finding paragraph: "Hands-on labs drive self-directed problem-solving by week five (n=137, 68% of cohort). 'By week five I was solving things on my own.' — P-024, mid-program survey."
A worked example · one transcript, coded turn by turn

What a coded interview transcript actually looks like.

Below is a real shape of interview analysis output. Four turns from an intake interview with a workforce-training participant, with the codes applied to each turn shown alongside. The highlights inside the text show the specific phrases that triggered each code — so every tag traces back to a moment in the transcript.

INTAKE INTERVIEW · P-024 Workforce training program · Cohort 3 · Site B Excerpt min 14:22 — 16:48
INTERVIEWERProgram researcher

Can you tell me what you hoped to get out of this program when you enrolled?

P-024Participant · age 28 · prior retail

Honestly? I want skills that get me into a trade. I worked retail for six years and the hours and the pay both stayed the same. I want something I can build on — something where year three pays better than year one.

GOAL · OCCUPATIONALTrade skills as the explicit aim
MOTIVATION · CAREER MOBILITYProgression over stasis
INTERVIEWERProgram researcher

What about challenges? What do you anticipate being hard?

P-024Participant · age 28 · prior retail

Getting here every day is the part I'm worried about. The bus from where I live takes almost an hour each way, and if there's weather it's longer. I work evenings still so I'm tired before the day starts. If I make it through the first month, I think I'll be okay.

BARRIER · TRANSPORTATIONCommute as primary friction
BARRIER · ENERGYEvening shift competing with morning program
SELF-EFFICACY · CONDITIONALConfidence tied to early-stage persistence
INTERVIEWERProgram researcher

Tell me about the learning style that has worked for you in the past.

P-024Participant · age 28 · prior retail

I'm pretty hands-on. I never liked sitting in a room while someone read slides. I want to try the thing and see what breaks. The places where I've actually learned were the places where someone handed me the work and let me figure it out, with someone nearby in case I got really stuck.

PEDAGOGY-PREFERENCE · APPLIEDHands-on learning explicitly preferred
PEDAGOGY-PREFERENCE · SCAFFOLDEDSupport nearby but not dominant

Four turns produce seven distinct codes across three thematic clusters — goals, barriers, and pedagogy. Each code retains the specific phrase that triggered it. When the same codes recur across the other 199 intake interviews, the cross-participant pattern emerges. When P-024 returns three months later for the exit interview, the analysis can compare their pedagogy preferences as stated at intake against the pedagogy they actually engaged with — because the participant ID travels with every coded fragment.

From coded transcript to reported finding

What the analysis produces at the cohort scale.

One coded transcript is the unit of work. The reported finding lives at the cohort scale. The three panels below trace what happens between them, using the same workforce-training study from the worked example.

SCALE 01 · ONE One participant's coded transcript.
PARTICIPANT P-024 INTAKE · COHORT 3 · SITE B
Codes assigned 7 codes across 4 turns 7
Goal theme Occupational trade skills, career mobility GOAL
Barrier theme Transportation and energy from competing shift BARRIER
Pedagogy theme Hands-on, scaffolded, applied PEDAGOGY
Self-efficacy Conditional on first-month persistence CONDITIONAL

One participant's interview produces a structured profile. The profile is more than the codes — it captures the relationship between them.

SCALE 02 · TWO HUNDRED The cohort. Theme frequencies and subgroup variation.

Intake interview themes — Cohort 3 · n = 200

Career mobility 68%
Hands-on pedagogy 61%
Transportation barrier 31%
Childcare barrier 22%
Conditional self-efficacy 40%
Credential framing 15%
Same chart redraws by site, by prior-experience subgroup, by gender, by age band — because every coded fragment sits on a record that also holds those variables.
SCALE 03 · REPORTED The finding statement, with verbatim quote and source citation.
FINDING · COHORT 3 INTAKE ANALYSIS

Career mobility and hands-on pedagogy are the dominant intake themes; transportation surfaces as the leading anticipated barrier.

Across 200 intake interviews, 68 percent of participants described their primary goal in terms of career mobility — progression rather than wage stasis. 61 percent named a hands-on learning preference. The leading anticipated barrier was transportation (31 percent), with childcare second (22 percent). The transportation barrier was substantially more prevalent at Site B (47 percent) than at Sites A or C.

"I want something I can build on — something where year three pays better than year one." — P-024, intake interview min 14:32
Source: intake-int-c3 · 200 participants · Cohort 3 · CAREER MOBILITY n=137, HANDS-ON n=122, TRANSPORTATION n=62

The finding is defensible because the verbatim quote, the cohort frequency, and the source citation all travel together. Any reader can trace any claim back to a specific moment in a specific interview.

Four ways to code interview transcripts

How coding actually gets done — and what each method costs.

The coding step is where most teams hit the volume wall. Twenty-five interviews of an hour each take fifty to one hundred hours to code manually. The four approaches below carry different costs and different traceability properties; most teams pick based on study size, traceability requirements, and what software the institution already licenses.

APPROACH 01
Manual coding in a document tool
How it works Transcripts live in Word or Google Docs. Analyst reads each one, applies codes via highlights and comments, maintains a separate codebook document, and rolls up themes manually in a spreadsheet.
Tools WordGoogle DocsExcelNotion
Costs roughly 2 to 4 analyst-hours per interview-hour. Codebook drift is the main risk — codes proliferate without a single source of truth.
Best for studies with fewer than ten interviews where the analyst can hold the codebook in working memory across the full set.
APPROACH 02
CAQDAS — dedicated qualitative software
How it works Transcripts are imported into a tool built for qualitative coding. The codebook is enforced as a structure. Codes attach to text segments, theme rollups happen in the tool, queries support disaggregation and code co-occurrence.
Tools NVivoMAXQDAATLAS.tiDedoose
Costs roughly 1 to 3 analyst-hours per interview-hour after the learning curve. License and training costs are the front-loaded expense.
Best for studies with 15 to 100 interviews where a single analyst or small team has the time to develop fluency in the chosen tool.
APPROACH 03
Spreadsheet-based template coding
How it works A spreadsheet holds one row per participant. Columns are codebook dimensions. The analyst pastes the relevant transcript excerpt into the appropriate cell as they read. The structure is the codebook.
Tools ExcelGoogle SheetsAirtable
Costs roughly 1 to 2 hours per interview-hour for framework analysis. Traceability is good if the cell preserves the verbatim excerpt with a transcript reference.
Best for framework analysis with a fixed matrix and small to mid-size studies — funder evaluations with structured deliverables work well here.
APPROACH 04
Codebook-on-arrival platforms
How it works The codebook is defined upfront. As each transcript arrives, the platform applies the codebook automatically — assigning codes, computing theme frequencies, and preserving the link from each code back to the source phrase in the transcript. Analyst reviews and refines rather than coding from scratch.
Tools Sopact SenseOther integrated platforms
Costs roughly 15 to 30 minutes of analyst review per interview-hour. The cost shifts from coding to codebook quality and review.
Best for studies above 25 interviews, longitudinal designs running multiple waves, and reporting workflows where the same codebook must be applied consistently across cohorts and over time.

The shift across the four approaches is not a shift in the analytic operations — the six steps run the same way in all of them. The shift is in where the analyst's time goes. Manual coding spends 80 percent of analyst time on segment-by-segment tagging and 20 percent on interpretation. Codebook-on-arrival inverts the ratio: most of the analyst's hours go to designing the codebook and reviewing the patterns the platform surfaces.

Six failure modes in interview analysis

The mistakes that produce reports nobody trusts — and what to do instead.

Most disputed findings in interview research are not analytical errors. They are process failures from the earlier steps that surface as analytical errors at the reporting stage. The six cards below name the failures that show up most often, with the substitute that prevents each one.

MISTAKE 01
"I read it and got a feel for it"

Familiarization is treated as the analysis. The report writes itself out of recent memory and the analyst's overall impression of the transcripts. The codebook is never applied systematically. Different readers would report different findings.

Run the codebook across every transcript. Familiarization is step two of six, not the whole process.

MISTAKE 02
Codebook drift across the set

The codebook starts at 20 codes. Twenty transcripts in, it has 80 codes — half of them near-duplicates. The first ten transcripts are coded against the old codebook; the last ten against the new one. Comparison across the set is no longer valid.

Pause after the first few transcripts, revise the codebook, and re-code the full set against the revised version. Lock the codebook before final coding.

MISTAKE 03
Cherry-picking the vivid quote

The most quotable line in the dataset goes into the report. It is unrepresentative — most of the cohort said something different — but it is the quote everyone remembers from the interviews. The quote is doing the work the pattern data should be doing.

Lead every finding with the cohort-level frequency, then choose the verbatim quote that illustrates the dominant pattern, not the most marketable outlier.

MISTAKE 04
Theme frequency without subgroup variation

A theme recurring in 60 percent of transcripts is reported as a cohort-level finding. The fact that it shows up in 90 percent of one subgroup and 30 percent of another never gets surfaced. The aggregate hides the real story.

Cross-tab every theme by the disaggregation variables on the record — site, cohort, gender, prior experience, plan tier. The subgroup variation often is the finding.

MISTAKE 05
Lost source attribution

A quote appears in the report without a participant ID and without a transcript reference. The reader cannot trace the claim. Reviewers cannot replicate the analysis. The finding sits on the analyst's authority alone.

Every quote carries a participant ID and a transcript-and-timestamp reference. Every coded fragment retains the link back to the source through the full analysis pipeline.

MISTAKE 06
Coding capacity not budgeted

The team recruits and interviews thirty participants. Analysis is allocated two weeks. Manual coding takes six. The deadline lands with ten transcripts coded thoroughly and twenty summarized in single paragraphs. The dataset is half-analyzed; the report draws on the well-coded half and quietly drops the rest.

Plan the analysis backward from the reporting deadline. Either reduce the interview count to what the team can code, or shift to a coding approach that scales with volume.

Frequently asked

Interview analysis questions, answered.

Q.01

What is qualitative interview analysis?

Qualitative interview analysis is the process of turning interview transcripts into reported findings. It runs in six steps: transcribe the recording, read for familiarization, code the text against a defined framework, group codes into themes, identify patterns across interviews, and report the findings with verbatim quotes that trace back to the source. The output is cross-participant pattern data, not a stack of read transcripts.

Q.02

How do you analyze qualitative interview data?

Six steps. First, transcribe every interview verbatim. Second, read the full set of transcripts once without coding, to build familiarity. Third, apply a defined codebook to each turn of the transcript. Fourth, group the codes into themes anchored in the research framework. Fifth, look for patterns across interviews — frequency, co-occurrence, divergence between subgroups. Sixth, write findings with the verbatim quotes that illustrate each pattern. Every quote retains a link back to the participant and the moment in the transcript.

Q.03

What are the steps in interview analysis?

The six widely used steps are transcribe, read for familiarization, code, theme, pattern, and report. Some researchers add a seventh step — member checking, where the analyst returns themes to participants for validation. Others compress the steps into four if working with shorter or fewer interviews. The principle is constant: from raw transcript to coded text to thematic patterns to reported findings, with traceability preserved at every stage.

Q.04

What is thematic analysis of interviews?

Thematic analysis is the most widely used approach for interview data. It identifies recurring topics, ideas, and patterns across a set of transcripts by coding each segment of text and then grouping codes into themes. Thematic analysis is flexible — it works for inductive studies that surface themes from the data and deductive studies that test pre-defined themes against the transcripts. Output is a theme map with frequency counts and verbatim illustrative quotes.

Q.05

How do you code an interview transcript?

Coding an interview transcript means assigning short labels — codes — to segments of text that share a common topic, idea, or sentiment. Start with the codebook: a defined set of codes anchored in the research framework or theory of change. Apply codes line by line or paragraph by paragraph. One segment can carry multiple codes when it speaks to more than one theme. After the first pass, review the codebook for gaps and apply revisions across the full set so all transcripts are coded against the same final version.

Q.06

What is the difference between thematic and content analysis?

Both code text against a framework, but they aim at different outputs. Thematic analysis surfaces patterns of meaning — what participants are saying, why they are saying it, and how positions relate across the cohort. Content analysis counts and categorizes — how often specific words, codes, or categories appear and how their frequency varies across groups. Thematic analysis is interpretive; content analysis is quantitative. Many studies use both: content analysis for the frequency view, thematic analysis for the meaning view.

Q.07

How do you analyze interview data in qualitative research?

Qualitative research analysis of interview data follows the six-step process: transcribe, read, code, theme, pattern, report. The choice of analytic approach — thematic, content, framework, or narrative — shapes how coding is done and what the final output looks like, but the underlying sequence is shared. Research designs typically name the approach upfront in the methods section, then describe how each step was executed and how trustworthiness was established.

Q.08

How long does qualitative interview analysis take?

Manual analysis runs at roughly two to four hours per interview hour. A study with 25 hour-long interviews can take 50 to 100 hours of analyst time to code thoroughly. The volume is the most common reason qualitative analysis lags collection: teams plan around the cost of recruiting and interviewing without planning around the cost of analyzing. Modern workflows apply a defined codebook automatically as transcripts arrive, reducing the per-interview cost while preserving traceability back to source text.

Q.09

What software is used to analyze interview transcripts?

Four categories of software show up in interview analysis workflows. CAQDAS tools (NVivo, MAXQDA, ATLAS.ti, Dedoose) hold manual coding and theme management on the analyst's machine. Transcription tools (Otter, Rev, Descript) produce the transcript from audio. Spreadsheet and document tools (Excel, Google Docs) handle small studies via comment-and-highlight workflows. Integrated platforms apply a defined codebook to each transcript on arrival, surface theme distributions, and preserve citation chains back to the source response — useful when collection and analysis share one record per participant.

Q.10

How do you analyze interview transcripts?

Analyzing interview transcripts is the practical work of qualitative interview analysis. Read each transcript once for familiarization. Then apply the codebook to each turn — every segment of meaningful text gets one or more code tags. Group the codes into themes. Look for patterns across the full set: which themes recur, which subgroups raise which themes, where the data contradicts the program's assumptions. Report the pattern with a verbatim quote underneath each claim and a citation back to the participant and the moment in the transcript.

Q.11

What is an example of interview analysis?

A workforce training intake interview with participant P-024 contains the line: "The hands-on labs helped me apply what we covered. By week five I was solving things on my own." Coding tags this turn with APPLY (applied learning) and SELF-EFFICACY (growth in confidence). Across 200 intake interviews, APPLY recurs in 137 transcripts and SELF-EFFICACY in 82. The report claim — that hands-on labs drive applied learning and self-efficacy — is illustrated by the verbatim quote and supported by the cohort-level theme frequency.

Working session
Bring one transcript. See it coded against your framework by Friday.

A twenty-minute working session takes one real transcript from your study — intake, exit, customer, employee, patient — and shows what coding on arrival looks like against the framework you bring. No procurement decision required. The point is to see how the analysis backlog changes when coding and reporting share one record per participant.

Format Live walk-through. Twenty minutes. One call, no slideware.
What to bring One interview transcript and the framework or theory of change the codes should anchor to.
What you leave with A coded version of your transcript against your codebook, with theme frequencies and citation chains visible from the first pass.