1/44
Name | Mastery | Learn | Test | Matching | Spaced | Call with Kai |
|---|
No analytics yet
Send a link to your students to track their progress
Categorical Variables
Variables that place individuals into groups or categories.
Quantitative Variables
Variables that take numerical values for which arithmetic operations make sense.
Discrete Variables
Quantitative variables that take a fixed set of countable values with gaps between them.
Continuous Variables
Quantitative variables that take any value in an interval with no gaps.
Frequency Table
A table displaying the count of individuals in each category.
Relative Frequency Table
A table displaying the percent or proportion of individuals in each category.
Relative Frequency Formula
Relative Frequency = Frequency / Total Sample Size.
Bar Chart
A graph representing categorical data where bars represent categories and their height represents frequency.
Pie Chart
A circular graph showing proportions of the whole for categorical data.
Dotplot
A graph where each data value is shown as a dot above its location on a number line.
Stemplot
A graphical display of data where the stem represents the first digit(s) and the leaf represents the final digit.
Histogram
A graph showing the frequency distribution of quantitative data in bins.
SOCS Method
A method for describing the distribution: Shape, Outliers, Center, Spread.
Symmetric Distribution
A distribution where the left and right sides are approximate mirror images.
Skewed Right Distribution
A distribution where the tail extends to the right; mean > median.
Skewed Left Distribution
A distribution where the tail extends to the left; mean < median.
Unimodal Distribution
A distribution with one distinct peak.
Bimodal Distribution
A distribution with two distinct peaks suggesting two different populations.
Median
The midpoint of a distribution where half the observations are smaller and half are larger.
Mean
The arithmetic average of a data set.
Range
The difference between the maximum and minimum values in a data set.
Standard Deviation
A measure of the typical distance of the values from the mean.
Interquartile Range (IQR)
The range of the middle 50% of the data.
Five-Number Summary
A set of descriptive statistics that includes the minimum, Q1, median, Q3, and maximum.
Outliers
Values that fall far away from the rest of the data.
1.5 x IQR Rule for Outliers
An observation is an outlier if it falls outside the fences: Lower Fence = Q1 - 1.5(IQR), Upper Fence = Q3 + 1.5(IQR).
Boxplot
A graphical representation of the five-number summary, showing minimum, Q1, median, Q3, and maximum.
Normal Distribution
A symmetric, bell-shaped density curve defined by mean (μ) and standard deviation (σ).
Empirical Rule
In a Normal distribution: 68% within μ ± 1σ, 95% within μ ± 2σ, 99.7% within μ ± 3σ.
Z-score
A measure of how many standard deviations a value is from the mean, calculated as z = (value - mean) / standard deviation.
Cumulative Relative Frequency Graph
A graph that illustrates the percentile of data in a distribution.
Quantitative Data
Data that can be expressed as numbers, often used for statistical analysis.
Categorical Data
Data that can be grouped into categories or labels, not quantitative.
Clustering
Distinct groups of data points within a distribution.
Visualization Techniques
Methods such as bar charts, pie charts, dotplots, stemplots, and histograms to graphically represent data.
Comparative Language
Using terms like greater than, less than, and similar to when comparing distributions.
Resistance in Statistics
Certain measures, like median and IQR, are not significantly affected by outliers.
Distribution Characteristics
Includes shape, center, spread, and outliers to describe statistical data.
Frequency Analysis
The use of frequency distribution tables to summarize data.
Statistical Graphs
Visual representations of data used to observe patterns, trends, and distributions.
Data Summarization
The process of presenting data in a simplified manner using tables and graphs.
Skewness
The measure of asymmetry in the distribution of data.
Resistant vs. Non-Resistant Measures
Resistant measures, like median, are less affected by extreme values; non-resistant measures, like mean, can be significantly affected.
Data Visualization Importance
Graphical representations help to quickly convey complex data insights.
Potential Pitfalls in Data Analysis
Common mistakes in understanding variables, describing data, and visualizing distributions.