Survey Data Analysis: Methods, AI Limits & The Persistent Layer

The lifecycle

Survey data analysis runs in five stages. Most teams stall in stage 4.

Five stages move raw responses to action: design, collect, clean, analyze, act. Most foundations and nonprofits get the first three right and then run out of rigor at stage 4, where qualitative themes get exported as a text column nobody reads. Stage 5 fails because the insight sits in a dashboard nobody opens. This page focuses on stages 4 and 5.

01 · Design Questions and dictionary Map every question to an outcome category before collection starts.

02 · Collect Persistent identity A stable respondent ID carried across every wave, every cohort.

03 · Clean Validate and reconcile Drop test data, check ranges, reconcile field names against the dictionary.

04 · Analyze Quant and qual together Descriptive, inferential, cross-tab, thematic. Open-ended responses get coded, not skimmed.

05 · Act Decision, audience, distance The insight reaches the right person, in time, with a short path to action.

Definition

What survey data analysis is, and where it stops

Survey data analysis is the practice of collecting structured responses from a defined population and analyzing the response distribution. SurveyMonkey, Qualtrics, Google Forms, and similar platforms all produce survey datasets that get exported to CSV, then analyzed in a spreadsheet, a notebook, or a BI tool. It is the standard form of stakeholder measurement at most foundations, nonprofits, and impact funds. It is necessary. It is not sufficient.

What survey analysis does well

Surveys aggregate responses across large populations consistently. They support clean statistical work: means, distributions, significance testing, and before-after comparisons. They capture anonymous responses on sensitive topics where attribution would distort answers. They scale to thousands of respondents with predictable infrastructure. The output is structured data that any analyst can work with.

Where survey analysis stops

Surveys ask predetermined questions, so they cannot capture what the program team did not think to ask. Self-report bias is built in: respondents answer what they think reflects well on themselves. Survey fatigue erodes response rates: a quarterly check-in typically drops from 60% in Q1 to 25 to 30% by Q4 of Year 2. The narrative reflections where the actual insight lives get exported as a text column nobody reads systematically. And surveys are isolated from each other, so individual-level change tracking requires manual joining work.

The shift to stakeholder intelligence is the shift from "what does the survey say" to "what does everything we know about this stakeholder say." The first is a snapshot. The second is a portrait.

Survey responses are the floor of what you can know, not the ceiling.

A persistent layer carries identity, dictionary, and framework across every wave.

See how Sopact Sense works →

The five approaches

Four standard methods, and the fifth approach that integrates them

Most survey analysis work uses one of four methods: descriptive, inferential, cross-tabulation, or thematic. Each is useful for a specific question type, and each has known limits. The fifth approach, stakeholder intelligence, integrates all four with qualitative reflection, behavioral data, and persistent identity to track change at the individual level across waves.

Dimension	Descriptive	Inferential	Cross-tab	Thematic	Stakeholder intelligence
Primary question	What did people say	Is the difference real	How does it vary by segment	What do the open-ended responses mean	What does the full record say about this stakeholder
Typical output	Frequencies, means, distributions	P-values, confidence intervals, effect sizes	Pivot tables by demographic cuts	Coded themes against a framework	A longitudinal portrait per respondent
Best tool	Spreadsheet, BI tool	Notebook (R, Python)	Spreadsheet, BI tool	Coding software, structured platform	Persistent data layer with dictionary
Handles open-ended text	no	no	no	yes	yes
Tracks change per person across waves	no	partial	no	no	yes
Integrates non-survey data	no	no	no	no	yes
Effort to set up	Low	Medium	Low	Medium to high	Medium, with one-time framework + dictionary setup
Common failure mode	Means hide bimodal distributions	Underpowered samples produce false positives	Spurious correlation in small cells	Inconsistent coding across coders or waves	Requires framework discipline up front

★ marks the integrated approach. The four standard methods are not replaced by stakeholder intelligence; they are wrapped by it.

The AI trap

Three ways survey data breaks when poured into generic AI

Pasting a 2,000-row survey CSV into a generic chat AI looks fast, then quietly fails in three predictable ways. Numeric tables exceed what language models compute reliably. Qualitative themes drift session to session because there is no persistent dictionary. Longitudinal comparison is impossible because the model has no memory of last quarter. The output reads confident. The numbers do not match.

Failure 1 · Quantitative

12–18%

Hallucinated totals on large tables

Language models do not run arithmetic over thousands of rows. They generate text that resembles the answer. Spot checks find percentages that drift 12 to 18% from the actual numbers on real datasets above roughly 500 responses.

Symptom: confident output, wrong sums

Failure 2 · Qualitative

≠

Theme drift across sessions

Code "career confidence" today and the AI maps eight responses to that theme. Re-run tomorrow with the same prompt and seven responses map to a slightly different theme name. The drift is invisible at the response level. It is fatal at the rollup level when waves are compared.

Symptom: same input, different categories

Failure 3 · Longitudinal

0

No memory of last wave

Generic AI has no persistent record of how respondent #284 answered three months ago. Every session starts from scratch. Individual-level change tracking, the entire point of pre-mid-post survey design, is structurally outside what stateless chat tools can do.

Symptom: snapshots only, never trajectories

A foundation that uses generic AI as its survey analysis tool is buying speed for one report and paying for it twice in the next two.

Project research notes · Stakeholder intelligence brief, 2026

Why this matters specifically for surveys

Survey datasets sit exactly in the failure zone for generic AI. They are large enough to trigger numeric hallucination (most cohorts exceed 200 responses, most multi-wave studies exceed 1,000), narrative-heavy enough to invite qualitative coding (almost every survey has at least one open-ended question), and longitudinal enough to require persistent identity (any pre-mid-post design, any quarterly check-in series). The exact properties that make surveys analytically valuable are the properties generic AI handles worst.

The fix is not to avoid AI. The fix is to put a persistent layer between the survey data and the AI tool, so the AI works against a structure that holds dictionary, identity, and framework. The next section shows what that layer is.

The persistent layer

The data dictionary that maps qualitative to quantitative, kept across every wave

A persistent data dictionary is the configured mapping between every question, theme, and concept in your surveys and the outcome category it rolls up to in your framework. "Skills training," "capacity building," and "professional development" all map to one outcome category when the dictionary says they do. The dictionary lives in a structured platform and persists across forms, cohorts, and years. Without it, every analysis starts with a reconciliation project. With it, analysis becomes a query against a clean structure.

What the layer holds

A persistent stakeholder data layer carries four things across the full lifecycle: identity, dictionary, framework alignment, and history. Each is configuration, not code, and each is what makes downstream AI analysis trustworthy.

Raw response "I feel more ready to apply for jobs in my field" Open-ended, pre-program

Raw response "Building professional confidence" Open-ended, mid-program

Raw response "Career direction is clearer now" Open-ended, post-program

Likert response Self-efficacy scale, 1 to 7 Structured, all waves

Behavioral signal Applications submitted within 60 days of completion System data, post-program

Outcome category Career self-efficacy · IRIS+ aligned · tracked per respondent across waves Five distinct inputs, one outcome rollup. The dictionary defines the mapping once; every future wave inherits it.

Why generic AI cannot maintain this on its own

This mapping is not generated. It is configured by the program team against their theory of change, validated against IRIS+ or the Five Dimensions of Impact, and stored. Generic AI tools can read a dictionary if you give them one; they cannot create or maintain one across hundreds of forms and thousands of records. That work needs a platform that holds state.

Persistent identity. A stable respondent ID carried across every form, cohort, fund.
Data dictionary. Semantic mapping from raw questions and themes to outcome categories.
Framework alignment. Theory of Change, IRIS+, or Five Dimensions baked into the rollup.
Deterministic rubrics. Same input produces the same scored output every run, auditably.
Longitudinal state. Cross-wave, cross-cohort patterns the platform actually remembers.
MCP interface. Clean read access for Claude Code, Hex, or any analytical surface.

reads via MCP →

One-off dashboards. A board-meeting view built in two minutes, disposable after.
Cross-source queries. Survey data joined with BLS, CPI, sector benchmarks, policy context.
Workflow automation. A red flag in Sopact routes to Slack, Asana, email, with drafted context.
Custom modeling. Regression, forecasting, segmentation done in code, against a clean source.
Personalized views. Program officer, board member, finance: each their own surface.
Real-time tools. Operational apps that run in the team's daily work.

The combination is strictly more powerful than either alone. A foundation running the structured layer without ever touching the external tools still gets more value than from a survey platform alone. A foundation running Gen AI tools without a structured source produces dashboards that do not aggregate to anything coherent across time.

Worked example · one of three

Primary plus secondary data: outcomes against a regional baseline

A foundation funds workforce programs across three states and wants to know not only whether grantees produced outcomes, but whether grantees outperformed what would have happened anyway given regional labor market conditions. Survey analytics cannot answer this. The structured layer alone cannot answer this. The answer requires the foundation's primary data joined with secondary public data, queried together, with the interpretation written back to the primary record.

The question: did our grantees' placement rates exceed the regional baseline by a meaningful margin? Did wage outcomes outpace regional CPI plus typical wage growth for the occupation? Did retention hold against regional job tenure averages? Did effects vary by demographic segment relative to the regional baseline for that segment?

01

Primary data · structured layer

Per-grantee outcome data, aligned and longitudinal

The structured layer holds placement rates, wage outcomes at 90-day and 6-month checkpoints, retention rates, demographic data, and program-level cost. All persistent identity. All mapped to the foundation's framework. All available as a queryable source.

Available fields

grantee_id · cohort · state · county
occupation_category (O*NET aligned)
placement_30 / placement_90 / placement_180
wage_at_placement · wage_at_180
retention_180 · retention_365

02

Secondary data · public source

BLS state and county data, fetched on the fly

An external tool, such as Claude Code reading via MCP, pulls Bureau of Labor Statistics employment data for the same time period and geography. Median wage data for the same occupation categories. CPI for the same regions. Labor force participation trends. The pull happens at analysis time, not at platform integration time.

Secondary sources

BLS QCEW · county-level employment
BLS OEWS · wage by occupation, region
BLS CPI-U · regional inflation
BLS LAUS · labor force participation
State workforce agency exits (where API exists)

03

Join · analysis surface

Join on geography, occupation, time

The join key is meaningful in both datasets: county FIPS plus O*NET occupation code plus quarter. The analytical surface (a Claude Code notebook or Hex workbook) pulls both, joins them, and produces per-grantee delta values. The deltas, not the raw outcomes, are the analytical output.

Computed deltas

placement_180 vs regional placement rate
wage_at_180 vs regional median, adj CPI
retention_365 vs regional tenure benchmark
segment effect vs segment baseline

04

Write back · structured layer

Annotate the primary record with attributable impact

The interpretation flows back to the structured layer as an annotated outcome record per grantee: "outperformed regional placement rate by 14 percentage points," "wage outcome 9% above regional median adjusted for CPI." These annotations are now part of the persistent dataset, queryable by every future analysis without re-running the join.

Persisted as

attributable_placement_delta
attributable_wage_delta_cpi_adj
attributable_retention_delta
analysis_run_id + secondary_source_versions

05

Report · audience

The board sees attributable impact, not raw outcomes

The board report shifts from "78% placement rate" to "78% placement rate, 14 points above the regional baseline for the same occupation category and county." Same primary data. Different conclusion. Reporting raw outcomes without secondary context is reporting unconditioned numbers. Reporting against the baseline is reporting attributable impact.

Why this needs both

Structured layer cannot pull BLS
AI tool cannot hold per-grantee history
MCP makes the join tractable

No closed platform will ever cover every secondary data source a foundation might want. The right architecture is one that makes integration with arbitrary external sources a first-class capability.

Analytics content brief · Sopact 2026

Worked example · two of three

Real-time action on signals: from red flag to follow-up

A 16-week youth workforce cohort runs weekly check-ins: attendance, milestone completion, and two open-ended narrative questions. Three-quarters of the way through, one student starts disengaging. Attendance stays at 100%, so the attendance-only dashboard misses it. The narrative responses get shorter. Sentiment shifts from forward-looking to neutral. The student stops asking questions in their reflection. A multi-signal pattern emerges that no single column would catch.

The trap: the signal exists in the data, but nobody is going to log into a platform every morning to check for it. A dashboard that nobody opens is dormant information, not actionable insight. The fix is not a better dashboard. The fix is a delivery path that meets the program manager where they already work.

01

Detection · structured layer

Multi-signal pattern flagged against the rubric

The structured layer evaluates each student against a defined disengagement rubric: narrative length trend, sentiment drift, milestone slippage on soft-skill measures, and stated-goal-to-current-reflection consistency. The flag fires when three of four signals trip in the same two-week window. This is configured, not coded, and it runs every check-in cycle.

Rubric signals

narrative_length · trailing 4-week trend
sentiment_polarity · sliding window
soft_skill_milestone · slippage flag
goal_alignment · semantic distance

02

Pull · scheduled job

Daily 6am pull of the current flag list

An external scheduled job (Claude Code agent, n8n workflow, or similar) reads the flag list from the structured layer via MCP every morning. For each flagged student, the job pulls the context that makes the flag actionable: signals that triggered, prior context, the student's stated goals at intake, the program manager assigned.

Pulled per flagged student

flag_reason (which signals)
intake_goals (free-text from week 0)
last_3_reflections (excerpt)
assigned_program_manager

03

Reason · AI tool

Draft personalized outreach, one per flagged student

The AI tool drafts an outreach script for each student that references their intake goals, names the specific area where the signal is appearing, and asks about that area without naming the flag itself. The draft is structured: subject line, opening, one specific question tied to a stated goal, scheduling close. Not a template. A real draft.

Draft structure

Greeting + opening tied to intake goal
One specific question (signal-derived)
Scheduling line · 15 min, 3 slots
Sign-off in program manager's voice

04

Deliver · operational tool

Post to Slack, claim from there

The flag list plus drafted outreach posts to the program team's Slack channel at 6:15am. Each student is one thread with the draft attached. Program managers claim outreach with a thread reaction. An Asana task lands in their queue with a 72-hour follow-up. The path from signal to assigned-and-tracked action is roughly 90 seconds.

Operational surfaces

Slack channel (#cohort-signals)
Asana project (cohort follow-ups)
Calendar invite scaffolded
Reaction = claim of ownership

05

Close the loop · back to layer

Outreach logged, next pattern check sees it

The outreach result, whether the student responded, what they said, what the program manager observed, flows back to the structured layer as an intervention record on the student. The next pattern check incorporates the intervention as context, so the same flag does not re-fire for the same situation. The loop closes.

Logged back as

intervention_id · type · timestamp
response_received · response_excerpt
follow_up_required · follow_up_date
flag_status (resolved | active)

The actionable version is not "five students are at risk." That is information. The actionable version is here are five students, here is exactly why each was flagged, here is a draft outreach for each, and the follow-up is on my Asana board for Wednesday.

Analytics content brief · Sopact 2026

Worked example · three of three

The two-minute dashboard: on-demand analytics for a specific question

A program officer has a board meeting tomorrow. The board chair has emailed an unexpected question: how are Year-1 grantees performing on the workforce outcomes prioritized two years ago, broken down by original grant size? The standard board report does not slice the data this way. The standard cross-cohort report does not split by grant size. The data exists. The view does not.

The old workflow: email the analytics team, wait two days, get back a one-off slide deck built from a CSV pull and a manual join. The new workflow: two minutes, Claude Code, MCP, plain language.

01

Ask · plain language

Type the question into Claude Code

"Show me Year-1 outcome scores for the workforce grants from cohort 2023, broken down by original grant size quartile. Include the theory-of-change category each grant was funded under." The program officer is not writing SQL. They are writing the question.

What makes this possible

"Year-1 outcome scores" maps to dictionary
"workforce grants" maps to grant taxonomy
"theory-of-change category" is structured field
Grant size quartile computed on the fly

02

Resolve · MCP

The dictionary makes the question answerable

The AI tool reads the structured layer through MCP. "Workforce outcomes" maps to one outcome category because the dictionary says it does. "Year-1" maps to the standard reporting window. The semantic resolution happens in the dictionary, not in the AI. The AI is generating the query, not inventing the categories.

Resolved against

outcome_dictionary · workforce_outcomes
cohort_calendar · year_1 window
grant_size_quartile · derived view
theory_of_change · grant tagging

03

Render · disposable view

Interactive dashboard, filters included

Two minutes later, an interactive dashboard renders in the AI tool. Filterable by region, program type, grantee. Hover for the underlying records. Click to drill into one grant. Exportable as slides for tomorrow's meeting. The dashboard is disposable. After tomorrow, nobody looks at it again. That is the point.

Dashboard primitives

Grouped bar · score by size quartile
Filter chips · region, program, ToC
Hover tooltip · grantee + rationale
Export · png, slides, CSV

04

Iterate · live

Follow-up questions in the same session

The board chair is going to ask a follow-up. The program officer can ask it live: "Now add the median wage outcome at 180 days, and split by demographic segment." Thirty seconds later, the new view renders. No second analyst, no second day. The dashboard adapts to the conversation, not the other way around.

Iteration cost

Old workflow · 2 days, one analyst
New workflow · 30 to 90 seconds
Number of follow-ups practical · unbounded

05

Economics · the real change

From 15 standing dashboards to 5 standing plus 50 disposable

A foundation used to maintain 10 to 15 standing dashboards because every dashboard was expensive to build. With the structured layer plus an AI tool, the foundation maintains 3 to 5 standing dashboards for recurring views and builds 30 to 50 disposable dashboards a year for specific questions. Total analytics value rises. Maintenance cost falls.

Surface allocation

Standing (BI tool): exec, board, quarterly
Disposable (AI tool): everything else
Source of truth: structured layer (one)

Why this needs both. The AI tool alone cannot answer the question, because it does not know that "workforce outcomes" maps to one outcome category across 30 reporting forms. The structured layer alone cannot answer it in two minutes for a one-off slice nobody anticipated, because that is not what configured rollups do. The combination makes plain-language analytics tractable.

Decision framework

When to use what: seven criteria, six surfaces

No single analytics surface covers every survey question. The right surface is the one that satisfies the most criteria for the specific question being asked. Seven criteria settle most decisions: frequency, audience, data scope, methodological rigor, time sensitivity, action requirement, and reproducibility. The matrix below maps surfaces to question types so the choice becomes explicit instead of habitual.

Surfaces, by question type

Question type	Spreadsheet	BI tool	Notebook	AI tool (Claude Code)	Structured layer
Recurring framework rollup	no	partial	partial	no	best fit
One-off board question	partial	no	slow	best fit	source only
Multi-source impact study	no	no	best fit	good	source only
Executive standing dashboard	no	best fit	no	no	source only
Operational workflow automation	no	no	no	best fit	source + trigger
Audit-grade methodological work	no	no	best fit	too loose	source only
Quick scenario model, single owner	fine	no	overkill	good	overkill
Longitudinal individual-level tracking	breaks	partial	good with effort	no state	best fit

Novel one-off methodologies the framework was not designed for. Custom predictive modeling. Cross-system queries against operational systems outside the stakeholder domain. For those, the layer is the source; the analysis happens elsewhere.

The honest framing: the structured layer covers 70 to 80% of recurring analysis needs. The remaining 20 to 30%, which is the custom and operational work that disproportionately moves decisions, happens in an AI tool or notebook reading from the layer through MCP. A foundation using only the structured layer still gets more value than from a survey platform alone. A foundation using only AI tools without a structured source produces dashboards that do not aggregate across time. The combination is what works.

FAQ

Survey data analysis: ten questions, ten direct answers

What are the main methods of survey data analysis?

Four standard methods cover most survey data analysis work. Descriptive analysis summarizes responses through frequencies, means, and distributions. Inferential analysis tests whether differences between groups are statistically meaningful. Cross-tabulation breaks responses down by demographic or segment cuts. Thematic analysis codes open-ended responses against a framework. A fifth approach, stakeholder intelligence, integrates all four with qualitative reflection, behavioral data, and longitudinal identity to produce a portrait rather than a snapshot.

Why does pasting survey CSVs into ChatGPT or Claude give unreliable results?

Three failure modes appear consistently. First, large numeric tables exceed what language models can compute accurately, producing hallucinated totals and percentages. Second, qualitative themes drift across sessions because the model has no persistent dictionary mapping language to outcome categories. Third, longitudinal comparison fails because the model has no memory of how a respondent answered last quarter. Generic AI is useful for one-off summarization, not for the persistent, deterministic, framework-aligned analysis surveys require.

What is the difference between survey analytics and stakeholder intelligence?

Survey analytics analyzes structured responses from a fixed question set. Stakeholder intelligence is the broader category that treats every interaction with a stakeholder as data: surveys, transcripts, documents, behavioral signals, and secondary context, all aligned to a consistent framework. Survey analytics produces a snapshot. Stakeholder intelligence produces a portrait. The shift matters because surveys can only collect what you thought to ask, and the most decision-changing insight usually lives in the things you did not.

How do I analyze qualitative survey responses at scale?

Three steps make qualitative analysis tractable across hundreds of responses. Build a data dictionary that maps phrases and concepts to outcome categories before coding starts. Code every open-ended response against that dictionary so themes accumulate to the same categories across waves. Track emergent themes that do not fit existing categories and review them quarterly to extend the dictionary. The dictionary is the persistent layer that makes the qualitative-to-quantitative bridge work.

Can I run statistical tests on survey data with small sample sizes?

Yes, with caveats. Below 30 responses, parametric tests like t-tests lose reliability and non-parametric alternatives such as Mann-Whitney or Wilcoxon signed-rank are more appropriate. Confidence intervals widen substantially. For cohorts under 50, focus on effect sizes rather than significance testing alone, and treat single-cohort results as directional rather than conclusive. Pool across multiple cohorts when the program design is consistent enough to justify it.

How do I track survey responses over time at the individual level?

Persistent identity is the requirement. Every respondent needs a stable ID that carries across every wave, every form, every reporting period. Without it, a foundation running pre-program, mid-program, and post-program surveys has three disconnected datasets that require manual joining work. With it, individual-level change analysis becomes a query against one structured source. Most survey tools generate fresh response IDs per wave, which is why longitudinal tracking breaks for foundations relying on survey tools alone.

How do you combine primary survey data with secondary public data?

The integration pattern uses a structured primary data source, an analytical surface that can query both sources, and a join key that is meaningful in both. For workforce outcome evaluation, the primary source is per-grantee placement and wage data, the secondary source is BLS state and county employment data, and the join key is geography plus occupation category plus time period. The analytical surface pulls both, runs the comparison against the regional baseline, and writes the interpretation back to the primary record. This is the difference between reporting outcomes and reporting attributable impact.

What is a data dictionary in survey analysis and why does it matter?

A data dictionary is the mapping between every question or theme in your surveys and the outcome category it rolls up to in your framework. Skills training, capacity building, and professional development all map to one outcome category when the dictionary says they do. It matters because without it, every analysis starts by reconciling field names, units, and concept boundaries by hand. With it, semantic consistency carries across hundreds of forms and thousands of records, and the analysis becomes a query against a clean structure rather than a cleanup project.

Should I use Excel, SPSS, Python, or a specialized platform for survey analysis?

The right surface depends on the question. One-shot analysis with under 10,000 rows and a single owner fits a spreadsheet. Methodologically rigorous statistical work for audit or peer review fits a notebook in Python or R. Recurring framework-aligned analyses with longitudinal tracking fit a specialized stakeholder intelligence platform. Custom one-off dashboards for a board meeting fit a Gen AI tool like Claude Code reading from the structured platform. No single surface covers all use cases, and trying to make one cover everything is where most analytics work stalls.

How is AI useful for survey data analysis if it has these limits?

AI becomes useful for survey analysis when it operates against a structured data layer rather than a raw CSV. The structured layer provides the persistent identity, the data dictionary, and the framework alignment that generic AI cannot maintain on its own. AI then handles the work it does well: drafting summaries, coding qualitative responses against an existing dictionary, generating personalized outreach for flagged signals, and building disposable dashboards for specific questions. The combination of structured platform and AI tool is strictly more powerful than either alone.

Go deeper

Survey responses are the floor. Stakeholder intelligence is the portrait.

The full reference for the persistent layer: how surveys, transcripts, documents, and behavioral signals align to a single framework, with deterministic AI analysis you can audit.

Read the stakeholder intelligence guide →

Keep reading

Related guides for survey analysis and the persistent layer

Make it work

Make your survey data work for what matters most.

The structured layer that holds identity, dictionary, and framework across every wave. Read by Claude Code, Hex, or your BI tool through MCP. Honest about its limits.

See how Sopact Sense works → Book a 30-minute walkthrough

Unlock the power of data-driven insights!

Survey Data Analysis: Methods, AI Limits & The Persistent Layer

Survey data analysis runs in five stages. Most teams stall in stage 4.

What survey data analysis is, and where it stops

What survey analysis does well

Where survey analysis stops

Survey responses are the floor of what you can know, not the ceiling.

Four standard methods, and the fifth approach that integrates them

Three ways survey data breaks when poured into generic AI

Hallucinated totals on large tables

Theme drift across sessions

No memory of last wave

Why this matters specifically for surveys

The data dictionary that maps qualitative to quantitative, kept across every wave

What the layer holds

Qualitative input → quantitative outcome category

Why generic AI cannot maintain this on its own

Sopact Sense

Claude Code · Hex · Notebooks · BI

Primary plus secondary data: outcomes against a regional baseline

Per-grantee outcome data, aligned and longitudinal

BLS state and county data, fetched on the fly

Join on geography, occupation, time

Annotate the primary record with attributable impact

The board sees attributable impact, not raw outcomes

Real-time action on signals: from red flag to follow-up

Multi-signal pattern flagged against the rubric

Daily 6am pull of the current flag list

Draft personalized outreach, one per flagged student

Post to Slack, claim from there

Outreach logged, next pattern check sees it

The two-minute dashboard: on-demand analytics for a specific question

Type the question into Claude Code

The dictionary makes the question answerable

Interactive dashboard, filters included

Follow-up questions in the same session

From 15 standing dashboards to 5 standing plus 50 disposable

When to use what: seven criteria, six surfaces

Surfaces, by question type

Honest limits, surface by surface

Spreadsheet

Use for

Stops at

BI tool (Tableau, Power BI, Looker)

Use for

Stops at

Notebook (Jupyter, Hex, Deepnote)

Use for

Stops at

AI tool (Claude Code, Hex AI)

Use for

Stops at

Structured stakeholder layer (Sopact Sense)

Use for

Stops at

Survey data analysis: ten questions, ten direct answers

What are the main methods of survey data analysis?

Why does pasting survey CSVs into ChatGPT or Claude give unreliable results?

What is the difference between survey analytics and stakeholder intelligence?

How do I analyze qualitative survey responses at scale?

Can I run statistical tests on survey data with small sample sizes?

How do I track survey responses over time at the individual level?

How do you combine primary survey data with secondary public data?

What is a data dictionary in survey analysis and why does it matter?

Should I use Excel, SPSS, Python, or a specialized platform for survey analysis?

How is AI useful for survey data analysis if it has these limits?

Survey responses are the floor. Stakeholder intelligence is the portrait.

Related guides for survey analysis and the persistent layer

Sopact Sense overview

Stakeholder intelligence

Survey question types

Longitudinal survey design

Mixed-method surveys

Pre and post surveys

Qualitative survey analysis

Increase survey response rate

Survey analysis (overview)

Make your survey data work for what matters most.

Company

Resources