Unit 9: Inference for Quantitative Data: Slopes

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/49

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 2:11 AM on 3/12/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

50 Terms

1
New cards

Least-squares regression line

The line that minimizes the sum of squared residuals and summarizes the linear relationship between an explanatory variable x and a response variable y in a sample.

2
New cards

Explanatory variable (x)

The variable used to explain or predict changes in the response; typically placed on the horizontal axis in regression.

3
New cards

Response variable (y)

The outcome variable being predicted or explained by x; typically placed on the vertical axis in regression.

4
New cards

Sample slope (b1)

The slope of the least-squares regression line from a sample; estimates how the predicted y changes for a 1-unit increase in x.

5
New cards

Population slope (β1 or β)

The true slope parameter in the population regression model; represents how the population mean response changes with x.

6
New cards

Sample intercept (b0)

The intercept of the sample regression line; the predicted value of y when x = 0 (may not be meaningful if x=0 is outside the data’s context).

7
New cards

Population intercept (β0 or α)

The true intercept parameter in the population regression line; the population mean response when x = 0.

8
New cards

Population regression line

The population model for the mean response: μy = β0 + β1x (equivalently μy = α + βx).

9
New cards

Sample regression line

The fitted line from sample data: ŷ = b0 + b1x.

10
New cards

Slope inference

Using sample regression results to draw conclusions about the population slope parameter β1 (e.g., testing β1=0 or estimating β1 with a confidence interval).

11
New cards

Null hypothesis for slope

A statement about the population slope, most commonly H0: β1 = 0 (no linear relationship in the population).

12
New cards

Alternative hypothesis for slope

The competing claim about the population slope, such as Ha: β1 ≠ 0, Ha: β1 > 0, or Ha: β1 < 0 (chosen based on context).

13
New cards

Slope of 0 (flat population line)

A population slope of 0 means the population mean response does not change as x changes; no linear relationship is supported.

14
New cards

Linear relationship (regression context)

A relationship where the mean of y changes approximately linearly with x; required for valid linear regression slope inference.

15
New cards

Correlation–slope test equivalence (simple linear regression)

With one explanatory variable, testing for a linear relationship via regression is equivalent to testing whether the population correlation is 0, but regression questions should be phrased in slope terms.

16
New cards

Association

A relationship between variables where changes in one are related to changes in the other; regression slope inference primarily supports association in a population.

17
New cards

Causation

A cause-and-effect relationship; can be concluded from a significant slope only when the study design is a randomized experiment (within its scope).

18
New cards

Randomized experiment

A study where treatments are randomly assigned; supports cause-and-effect conclusions when conditions are met and results are significant.

19
New cards

Observational study

A study where variables are observed without random assignment; a significant slope supports association only, not causation.

20
New cards

Lurking variable

An unmeasured variable that may influence both x and y, potentially explaining an observed association in an observational study.

21
New cards

Sampling distribution of the slope

The distribution of sample slopes b1 that would be obtained from repeated samples (or repetitions of an experiment) from the same population.

22
New cards

Mean of the sampling distribution (μb)

The average of all possible sample slopes; under conditions, this mean equals the true population slope β1.

23
New cards

Standard deviation of the sampling distribution (σb)

The true spread of sample slopes b1 across repeated samples; typically unknown in practice.

24
New cards

Standard error of the slope (SEb1)

An estimate of the standard deviation of the sampling distribution of b1, computed from sample data; used for t inference about the slope.

25
New cards

t distribution (for slope inference)

The distribution used for slope tests/intervals because the true variability is unknown and must be estimated from sample residuals.

26
New cards

Residual

The vertical difference between an observed y value and its predicted value: y − ŷ.

27
New cards

Residual standard deviation (s)

A measure of typical prediction error around the fitted line: s = sqrt( Σ(y−ŷ)² / (n−2) ).

28
New cards

Degrees of freedom (df = n − 2)

The df used in regression slope t procedures; n−2 because both slope and intercept are estimated from the data.

29
New cards

t statistic for slope

Standardizes the difference between the sample slope and hypothesized slope: t = (b1 − β1) / SEb1 (often with β1=0).

30
New cards

p-value (slope test)

The probability, assuming H0 is true, of observing a sample slope (or t statistic) at least as extreme as the one obtained, in the direction(s) of Ha.

31
New cards

Significance level (α)

The cutoff probability for deciding whether evidence is strong enough to reject H0 (e.g., α = 0.05).

32
New cards

Statistical significance

A result is statistically significant if the p-value is less than α, indicating evidence against H0 beyond random sampling variation.

33
New cards

Practical importance

Whether an effect is large enough to matter in context; statistical significance does not guarantee practical importance.

34
New cards

Confidence interval for the population slope

A range of plausible values for β1, typically computed as b1 ± t*SEb1, and interpreted as change in the population mean response per 1 unit of x.

35
New cards

Critical value (t*)

The t multiplier from the t distribution (with df = n−2) that matches the desired confidence level for an interval.

36
New cards

Margin of error (ME)

The amount added/subtracted in a confidence interval: ME = t* × SEb1.

37
New cards

Mean response (μy)

The population average value of y at a given x; regression inference targets how μy changes with x, not individual outcomes.

38
New cards

Predicted value (ŷ)

The value of y predicted by the sample regression line for a given x.

39
New cards

Linearity condition

Condition that the relationship between x and the mean of y is approximately linear; checked with a scatterplot and/or residual plot (no curved pattern).

40
New cards

Independence condition

Condition that observations are independent; supported by random sampling or random assignment and by avoiding situations with dependence (e.g., related subjects).

41
New cards

10% condition

When sampling without replacement, independence is plausible if the sample size n is less than 10% of the population.

42
New cards

Time correlation

Dependence across observations collected over time (e.g., daily prices/temperatures) that can violate the independence assumption.

43
New cards

Normality of residuals condition

Condition that residuals are approximately normally distributed around the line; checked with a histogram or normal probability plot of residuals (not y itself).

44
New cards

Equal variance (constant spread) condition

Condition that the variability of residuals is roughly constant across x; checked by looking for no fanning/funneling in a residual plot.

45
New cards

Funnel (fan) pattern

A residual plot pattern where residual spread increases or decreases with x, indicating nonconstant variance (violating equal variance).

46
New cards

Influential point

A data point that strongly affects the fitted line (slope, SE, p-value) and can change conclusions; often associated with extreme x or large residuals.

47
New cards

High leverage point

A point with an extreme x-value compared to the rest of the data that can “pull” the regression line.

48
New cards

Outlier (large residual)

A point with an unusually large vertical deviation from the regression line; can distort regression results, especially if also high leverage.

49
New cards

Extrapolation

Using a regression model to predict for x-values far outside the observed range; predictions and inference are less trustworthy there.

50
New cards

r-squared (coefficient of determination)

The proportion of variation in y explained by the linear model with x; unitless and distinct from interpreting slope or establishing causation.

Explore top notes

note
Notes
Updated 1187d ago
0.0(0)
note
Photons
Updated 900d ago
0.0(0)
note
Biology - Evolution
Updated 1476d ago
0.0(0)
note
RIse of Democracy Vocab Pt. 3
Updated 1499d ago
0.0(0)
note
Indirect Values
Updated 1499d ago
0.0(0)
note
Notes
Updated 1187d ago
0.0(0)
note
Photons
Updated 900d ago
0.0(0)
note
Biology - Evolution
Updated 1476d ago
0.0(0)
note
RIse of Democracy Vocab Pt. 3
Updated 1499d ago
0.0(0)
note
Indirect Values
Updated 1499d ago
0.0(0)

Explore top flashcards

flashcards
faf
40
Updated 957d ago
0.0(0)
flashcards
faf
40
Updated 957d ago
0.0(0)