Unit 7: Inference for Quantitative Data: Means

0.0(0)

Studied by 0 people

0%Unit 7 Mastery

0%Exam Mastery

View linked note

Build your Mastery score

AP Practice

Supplemental Materials

Call Kai

Card Sorting

1/49

Earn XP

Description and Tags

AP Statistics

Unit 7: Inference for Quantitative Data: Means

Last updated 2:11 AM on 3/12/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai	Chat

No analytics yet

Send a link to your students to track their progress

50 Terms

New cards

Inference

Using sample data to make a justified claim about a population parameter, accounting for sampling variability (uncertainty).

New cards

Parameter

A numerical characteristic of a population (e.g., the population mean $\theta$ ).

New cards

Statistic

A numerical characteristic computed from a sample (e.g., the sample mean x̄), used to estimate a parameter.

New cards

Population Mean (μ)

The true average of a quantitative variable for the entire population of interest.

New cards

Sample Mean ( $\bar{x}$ )

The average of the sample data; a point estimate of the population mean $μ$ .

New cards

Population Standard Deviation (σ)

The true standard deviation of the population; usually unknown in real applications.

New cards

Sample Standard Deviation (s)

The standard deviation computed from sample data; used to estimate the unknown population standard deviation σ.

New cards

Standard Error (SE)

An estimate of the standard deviation of a statistic; for a one-sample mean, $SE = s/\text{ }n$ .

New cards

Standard Deviation (σ)

The standard deviation computed from sample data; used to estimate the unknown population standard deviation $\theta$ .

New cards

t Statistic

Standardized statistic for a mean when $σ$ is unknown: $t = \frac{\bar{x} - \boldsymbol{\mu}}{s/\boldsymbol{√n}}$ .

New cards

Student’s t Distribution

A family of symmetric, bell-shaped distributions centered at 0 with heavier tails than the normal; used for inference about means when $\theta$ is unknown.

New cards

Degrees of Freedom (df)

A parameter that determines the exact shape/spread of a t distribution; smaller df gives heavier tails.

New cards

One-Sample Degrees of Freedom (n − 1)

For one-sample t procedures, df = n − 1.

New cards

Heavier Tails

A distribution feature giving more probability to extreme values; t distributions have heavier tails than normal because s varies from sample to sample.

New cards

Critical Value (t*)

The cutoff from a t distribution used to create a confidence interval, based on confidence level and df.

New cards

Confidence Interval

An interval of plausible values for a population parameter, computed from sample data.

New cards

Confidence Level (C%)

The long-run success rate of the interval method: about $C\text{ }\text{ }\text{ }\text{ }$ of intervals from repeated random samples would capture the true parameter.

New cards

Margin of Error (ME)

How far the confidence interval extends from the point estimate; for a one-sample mean, ME = t*·(s/√n).

New cards

One-Sided Alternative ( $\boldsymbol{\mu} > \boldsymbol{\mu}_0 \text{ or } \boldsymbol{\mu} < \boldsymbol{\mu}_0$ )

An alternative hypothesis specifying a direction; p-value is computed in the corresponding tail.

New cards

Significance Test

A procedure that uses sample data to evaluate evidence against a claim about a population parameter (e.g., a claim about μ).

New cards

Null Hypothesis (H0)

The default claim tested, stated with equality (e.g., $H0: \theta = \theta_0$ ).

New cards

Alternative Hypothesis (Ha)

The competing claim you seek evidence for (e.g., $H_a: \theta \neq \theta_0, \theta > \theta_0, \text{ or } \theta < \theta_0$ ).

New cards

Two-Sided Alternative (μ ≠ μ0)

An alternative hypothesis looking for a difference in either direction; leads to a two-sided p-value.

New cards

One-Sided Alternative (μ > μ0 or μ < μ0)

An alternative hypothesis specifying a direction; p-value is computed in the corresponding tail.

New cards

p-Value

Assuming H0 is true, the probability of getting a test statistic at least as extreme as the observed one (in the direction(s) of Ha).

New cards

Significance Level (α)

The cutoff probability for deciding statistical significance (commonly 0.05); compared to the p-value.

New cards

Reject H0

Decision when p ≤ α; conclude the data provide convincing evidence against H0 (supporting Ha).

New cards

Fail to Reject H0

Decision when p > α; conclude the data do not provide convincing evidence against $H_0$ (not the same as “accept $H_0$ ”).

New cards

Type I Error

Rejecting a true null hypothesis (a “false positive”).

New cards

Type II Error

Failing to reject a false null hypothesis (a “false negative”).

New cards

Power ( $1 - \boldsymbol{β}$ )

The probability of rejecting $H_0$ when $H_0$ is false; power = $1 - \text{(probability of Type II error)}$ .

New cards

Random Condition

Requirement that data come from a random sample or randomized experiment; supports generalization (random sampling) or cause-and-effect (random assignment).

New cards

Independence

Condition that observations do not influence each other; often supported by sampling design and the 10% condition when sampling without replacement.

New cards

10% Condition

When sampling without replacement, require n \text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }\text{ }$\text{ }\text{ } o (population size) to justify independence.

New cards

Normal/Large Sample Condition

For t procedures, the sampling distribution is approximately normal if the population is roughly normal or n is large enough for CLT; watch for strong skewness/outliers.

New cards

Central Limit Theorem (CLT)

For sufficiently large n, the sampling distribution of x̄ is approximately normal even if the population is not (often summarized by n ≥ 30, but outliers/skewness can still matter).

New cards

Outlier

An extreme data value that can strongly affect x̄ and s, especially when n is small, potentially undermining t procedures.

New cards

Robustness

The idea that t procedures often work reasonably well even when normality is not perfect, especially for moderate/large n without extreme outliers.

New cards

Statistical Significance

A result considered unlikely under H0 (typically p ≤ α); indicates evidence against H0, not necessarily a large or important effect.

New cards

Practical Importance

Whether an effect size is meaningful in context; a result can be statistically significant but practically trivial (especially with large n).

New cards

Two-Sample t Procedures

Inference methods for comparing means of two independent groups using $\bar{x}_1 - \bar{x}_2$ and an unpooled (Welch) standard error when $\theta_1$ and $\theta_2$ are unknown.

New cards

Parameter ( $\boldsymbol{\mu}_1 - \boldsymbol{\mu}_2$ )

The true difference in population means between group 1 and group 2 (often defined as population 1 minus population 2).

New cards

Statistic (x̄1 − x̄2)

The observed difference between sample means, used to estimate μ1 − μ2.

New cards

Two-Sample Standard Error

Estimated SD of x̄1 − x̄2 when σ’s are unknown: SE = √(s1²/n1 + s2²/n2).

New cards

Welch–Satterthwaite Approximation

Technology-based method for estimating df in two-sample t procedures; df depends on s1, s2, n1, and n2.

New cards

Paired (Matched-Pairs) Data

Data where observations come in natural pairs (e.g., before/after on the same person); the two measurements within a pair are not independent.

New cards

Difference Variable (d)

For paired data, compute one value per pair: $d = (first\text{ }measurement) - (second\text{ }measurement)$ ; inference is then done on the $d$ ’s.

New cards

Mean of Differences ( $\boldsymbol{\mu}_d$ )

The population mean of the paired differences; the parameter for paired t procedures.

New cards

Simulation-Based p-Value

An estimated p-value found by simulating the null model many times and counting the proportion of simulated statistics at least as extreme as the observed statistic.

New cards

Median Absolute Deviation (MAD)

A variability measure: the median of the absolute deviations from the median; can be used with simulations to assess unusual variability.