Unit 9: Inference for Quantitative Data: Slopes

0.0(0)
Studied by 0 people
0%Unit 9 Mastery
0%Exam Mastery
Build your Mastery score
multiple choiceAP Practice
Supplemental Materials
call kaiCall Kai
Card Sorting

1/49

Last updated 2:11 AM on 3/12/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

50 Terms

1
New cards

Least-squares regression line

The line that minimizes the sum of squared residuals and summarizes the linear relationship between an explanatory variable x and a response variable y in a sample.

2
New cards

Explanatory variable (x)

The variable used to explain or predict changes in the response; typically placed on the horizontal axis in regression.

3
New cards

Response variable (y)

The outcome variable being predicted or explained by x; typically placed on the vertical axis in regression.

4
New cards

Sample slope (b1b_1)

The slope of the least-squares regression line from a sample; estimates how the predicted y changes for a 1-unit increase in x.

5
New cards

Population slope (β1\beta_1 or β\beta)

The true slope parameter in the population regression model; represents how the population mean response changes with x.

6
New cards

Sample intercept (b0b_0)

The intercept of the sample regression line; the predicted value of y when x = 0 (may not be meaningful if x=0 is outside the data’s context).

7
New cards

Population intercept (β0\beta_0 or α\alpha)

The true intercept parameter in the population regression line; the population mean response when x = 0.

8
New cards

Population regression line

The population model for the mean response: μy=β0+β1x\mu_y = \beta_0 + \beta_1 x (equivalently μy=α+βx\mu_y = \alpha + \beta x).

9
New cards

Sample regression line

The fitted line from sample data: y^=b0+b1x\hat{y} = b_0 + b_1 x.

10
New cards

Slope inference

Using sample regression results to draw conclusions about the population slope parameter β1\beta_1 (e.g., testing β1=0\beta_1=0 or estimating β1\beta_1 with a confidence interval).

11
New cards

Null hypothesis for slope

A statement about the population slope, most commonly H0:β1=0H_0: \beta_1 = 0 (no linear relationship in the population).

12
New cards

Alternative hypothesis for slope

The competing claim about the population slope, such as Ha:β10H_a: \beta_1 \ne 0, Ha:β1>0H_a: \beta_1 > 0, or Ha:β1<0H_a: \beta_1 < 0 (chosen based on context).

13
New cards

Slope of 0 (flat population line)

A population slope of 0 means the population mean response does not change as x changes; no linear relationship is supported.

14
New cards

Linear relationship (regression context)

A relationship where the mean of y changes approximately linearly with x; required for valid linear regression slope inference.

15
New cards

Correlation–slope test equivalence (simple linear regression)

With one explanatory variable, testing for a linear relationship via regression is equivalent to testing whether the population correlation is 0, but regression questions should be phrased in slope terms.

16
New cards

Association

A relationship between variables where changes in one are related to changes in the other; regression slope inference primarily supports association in a population.

17
New cards

Causation

A cause-and-effect relationship; can be concluded from a significant slope only when the study design is a randomized experiment (within its scope).

18
New cards

Randomized experiment

A study where treatments are randomly assigned; supports cause-and-effect conclusions when conditions are met and results are significant.

19
New cards

Observational study

A study where variables are observed without random assignment; a significant slope supports association only, not causation.

20
New cards

Lurking variable

An unmeasured variable that may influence both x and y, potentially explaining an observed association in an observational study.

21
New cards

Sampling distribution of the slope

The distribution of sample slopes b1b_1 that would be obtained from repeated samples (or repetitions of an experiment) from the same population.

22
New cards

Mean of the sampling distribution (μb\mu_b)

The average of all possible sample slopes; under conditions, this mean equals the true population slope β1\beta_1.

23
New cards

Standard deviation of the sampling distribution (σb\sigma_b)

The true spread of sample slopes b1b_1 across repeated samples; typically unknown in practice.

24
New cards

Standard error of the slope (SEb1SE_{b_1})

An estimate of the standard deviation of the sampling distribution of b1b_1, computed from sample data; used for t inference about the slope.

25
New cards

t distribution (for slope inference)

The distribution used for slope tests/intervals because the true variability is unknown and must be estimated from sample residuals.

26
New cards

Residual

The vertical difference between an observed y value and its predicted value: yy^y - \hat{y}.

27
New cards

Residual standard deviation (s)

A measure of typical prediction error around the fitted line: s=Σ(yy^)2/(n2)s = \sqrt{\Sigma(y-\hat{y})^2 / (n-2)}.

28
New cards

Degrees of freedom (df=n2df = n - 2)

The df used in regression slope t procedures; n2n-2 because both slope and intercept are estimated from the data.

29
New cards

t statistic for slope

Standardizes the difference between the sample slope and hypothesized slope: t=(b1β1)/SEb1t = (b_1 - \beta_1) / SE_{b_1} (often with β1=0\beta_1=0).

30
New cards

p-value (slope test)

The probability, assuming H0H_0 is true, of observing a sample slope (or t statistic) at least as extreme as the one obtained, in the direction(s) of HaH_a.

31
New cards

Significance level (α\alpha)

The cutoff probability for deciding whether evidence is strong enough to reject H0H_0 (e.g., α=0.05\alpha = 0.05).

32
New cards

Statistical significance

A result is statistically significant if the p-value is less than α\alpha, indicating evidence against H0H_0 beyond random sampling variation.

33
New cards

Practical importance

Whether an effect is large enough to matter in context; statistical significance does not guarantee practical importance.

34
New cards

Confidence interval for the population slope

A range of plausible values for β1\beta_1, typically computed as b1±tSEb1b_1 \pm t^* SE_{b_1}, and interpreted as change in the population mean response per 1 unit of x.

35
New cards

Critical value (tt^*)

The t multiplier from the t distribution (with df=n2df = n-2) that matches the desired confidence level for an interval.

36
New cards

Margin of error (ME)

The amount added/subtracted in a confidence interval: ME=t×SEb1ME = t^* \times SE_{b_1}.

37
New cards

Mean response (μy\mu_y)

The population average value of y at a given x; regression inference targets how μy\mu_y changes with x, not individual outcomes.

38
New cards

Predicted value (y^\hat{y})

The value of y predicted by the sample regression line for a given x.

39
New cards

Linearity condition

Condition that the relationship between x and the mean of y is approximately linear; checked with a scatterplot and/or residual plot (no curved pattern).

40
New cards

Independence condition

Condition that observations are independent; supported by random sampling or random assignment and by avoiding situations with dependence (e.g., related subjects).

41
New cards

10% condition

When sampling without replacement, independence is plausible if the sample size n is less than 10% of the population.

42
New cards

Time correlation

Dependence across observations collected over time (e.g., daily prices/temperatures) that can violate the independence assumption.

43
New cards

Normality of residuals condition

Condition that residuals are approximately normally distributed around the line; checked with a histogram or normal probability plot of residuals (not y itself).

44
New cards

Equal variance (constant spread) condition

Condition that the variability of residuals is roughly constant across x; checked by looking for no fanning/funneling in a residual plot.

45
New cards

Funnel (fan) pattern

A residual plot pattern where residual spread increases or decreases with x, indicating nonconstant variance (violating equal variance).

46
New cards

Influential point

A data point that strongly affects the fitted line (slope, SE, p-value) and can change conclusions; often associated with extreme x or large residuals.

47
New cards

High leverage point

A point with an extreme x-value compared to the rest of the data that can “pull” the regression line.

48
New cards

Outlier (large residual)

A point with an unusually large vertical deviation from the regression line; can distort regression results, especially if also high leverage.

49
New cards

Extrapolation

Using a regression model to predict for x-values far outside the observed range; predictions and inference are less trustworthy there.

50
New cards

r-squared (coefficient of determination)

The proportion of variation in y explained by the linear model with x; unitless and distinct from interpreting slope or establishing causation.