Overview

Sampling, Surveys, and Statistical Inference is a medium-to-hard conceptual topic in the Problem-Solving and Data Analysis domain on the digital SAT. Questions test the ability to evaluate whether a sample is representative of a population, identify sources of bias in data collection, interpret margin of error and confidence intervals, and distinguish between what can and cannot be concluded from observational studies versus experiments. These are calculator-active questions, but they primarily require logical reasoning rather than computation.

Key Points

1. Random Sampling Types

TypeDescriptionSAT relevance
Simple random sampleEvery member of the population has an equal chance of selectionGold standard for generalization
Stratified randomPopulation divided into subgroups (strata); random sample taken from eachEnsures representation of subgroups
ClusterPopulation divided into clusters; entire clusters selected randomlyEfficient for geographically dispersed populations
SystematicEvery nth member selected from a listPractical; can generalize if list has no pattern

Key principle: A random sample is required to generalize findings to the broader population. A non-random (biased) sample cannot support general conclusions.

2. Sources of Bias

Bias TypeDescriptionExample
Voluntary responsePeople choose to respond; tend to have extreme opinionsOnline polls, “call in” surveys
ConvenienceSample taken from whoever is easily accessibleSurveying only students in one classroom
Leading questionQuestion wording steers respondents toward a particular answer”Don’t you agree that…?”
UndercoverageSome population segments are systematically excludedPhone surveys missing those without phones
NonresponseSelected individuals do not respond; pattern differs from respondentsLow survey return rates

All biased samples produce estimates that do not accurately represent the population.

3. Margin of Error and Confidence Intervals

A confidence interval gives a range within which the true population parameter likely falls.

Where MOE = margin of error and p̂ = sample estimate.

Example: “42% ± 3%” → the interval is [39%, 45%].

The SAT always uses a 95% confidence level. This means: if the same survey were repeated many times, approximately 95% of the resulting intervals would contain the true parameter.

Effect of sample size on MOE:

Larger sample → smaller MOE → more precise estimate. Smaller variability + larger sample → most precise estimate.

4. Observational Studies vs. Experiments

FeatureObservational StudyExperiment
Researcher roleObserves without interveningAssigns subjects to conditions
Random assignmentNoYes (in a well-designed experiment)
Can establish causationNo — association onlyYes — if random assignment used
Can generalize to populationOnly if sample was randomOnly if sample was random

Confounding (lurking) variable: A third variable correlated with both the explanatory and response variables, potentially explaining the observed association without a causal relationship.

Control group: Receives no treatment (or a placebo). Comparison to the treatment group isolates the effect of the intervention.

5. Valid and Invalid Conclusions — SAT Decision Tree

Was the sample randomly selected?
├── YES → Can generalize to the population
└── NO  → Cannot generalize; results apply only to the sample

Was there random assignment to groups?
├── YES (experiment) → Can claim causation
└── NO (observational) → Association only; cannot claim causation

6. Common SAT Inference Scenarios

ScenarioValid conclusionInvalid conclusion
Random survey of 500 city residentsEstimate city-wide preferencesEstimate national preferences
Study with random assignment, treatment shows improvementTreatment causes improvementNothing if sample was non-random
Convenience sample shows trendTrend exists in the sampleTrend applies to the population
Two variables are correlated in an observational studyAssociation existsCausation

Pitfalls and Common Mistakes

Mistake 1: Concluding causation from an observational study. A study shows that people who eat breakfast have higher GPAs — students conclude breakfast causes higher GPA. Fix: Observational studies cannot establish causation. Look for confounding variables (e.g., students who eat breakfast may also have more structured home environments).

Mistake 2: Generalizing from a biased or non-random sample. A survey of website users is used to draw conclusions about “all adults.” Fix: Only random samples of the target population support generalization to that population.

Mistake 3: Misinterpreting the confidence interval. “95% confidence interval” means there is a 95% chance the true value is in this interval for a specific sample. Fix: The correct interpretation is: “We are 95% confident this procedure produces intervals that contain the true parameter.” The true parameter is fixed; the interval varies across samples.

Mistake 4: Thinking a larger margin of error is better. A larger MOE means less precision. Fix: Smaller MOE = better. To reduce MOE, increase sample size.

Mistake 5: Confusing the scope of a conclusion. A study on students at one school cannot be used to make inferences about all students nationally. Fix: Conclusions are only valid for the population from which the random sample was drawn, not a broader group.

Quick Reference Card

ConceptRule
Random sampleRequired to generalize to the population
Random assignmentRequired to claim causation
Observational studyAssociation only — never causation
ExperimentCan establish causation if random assignment used
MOE interpretationTrue value is within estimate ± MOE (95% confidence)
Larger nSmaller MOE — more precise
Voluntary response biasOverrepresents strong opinions
Convenience biasNot representative; easy-to-access sample
Confounding variableThird variable explaining apparent association
SAT confidence levelAlways 95%