AP CSP Big Idea 2 (Data): Analysis, Metadata, and Preparing Data for Use

0.0(0)
Studied by 0 people
0%Big Idea 2 Mastery
0%Exam Mastery
Build your Mastery score
multiple choiceMultiple Choice
call kaiCall Kai
Supplemental Materials
Card Sorting

1/24

Last updated 3:08 PM on 3/12/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

25 Terms

1
New cards

Data (AP CSP sense)

Any information stored in a form a computer can process (e.g., numbers, text, images, sounds, locations, clicks, sensor readings).

2
New cards

Program (in data analysis)

A set of precise steps that takes data as input and can transform, summarize, and extract patterns from it.

3
New cards

Transform (data)

Change data’s form or representation (e.g., convert units, create new fields, recode categories).

4
New cards

Summarize (data)

Compute compact descriptions of a dataset such as counts, totals, averages, minimum/maximum, or distributions.

5
New cards

Pattern extraction

Using computation to find relationships, trends, clusters, or unusual values (anomalies) in data.

6
New cards

Iteration (through a dataset)

Looping through many records/values to compute results (e.g., checking every row in a table).

7
New cards

Filtering

Keeping only records that match a condition (e.g., only rows where grade = 12 or value > threshold).

8
New cards

Aggregation

Grouping and combining data to produce summaries (e.g., totals per category).

9
New cards

Visualization

Presenting results in charts/graphs/maps so humans can interpret patterns and summaries.

10
New cards

Data-processing pipeline

Common sequence of steps: input → parse → clean/validate → transform → analyze → output.

11
New cards

Parse

Interpret a data format into usable parts (e.g., split a CSV row into columns).

12
New cards

Clean/Validate

Detect and handle missing/invalid values, formatting issues, duplicates, and inconsistencies so analysis is reliable.

13
New cards

Selection (rows)

A process that keeps specific records based on a rule (e.g., appending only rows where grade = 12).

14
New cards

Counter (variable)

A variable that increases by 1 for each item that matches a condition (used for counting).

15
New cards

Sum (accumulator)

A running total that adds the data values themselves (used before computing totals/averages).

16
New cards

Average (mean)

A summary statistic computed as total sum divided by number of items; requires both sum and count (or LENGTH).

17
New cards

Missing values

Data entries not recorded or absent (e.g., blank, NA, null, ?), which can skew or break computations if not handled.

18
New cards

Duplicate records

The same person/event recorded multiple times; can distort counts and totals unless duplicates are appropriately handled.

19
New cards

Inconsistent categories

Same category represented in different text forms (e.g., "NY", "New York", "newyork"), preventing correct grouping/counting without standardization.

20
New cards

Outlier / impossible value

An unusually extreme or invalid entry (e.g., negative age, temperature of 999) that may indicate error or a rare real event.

21
New cards

Biased sample

Data that does not represent the target population; programs cannot fix this, so results may be misleading.

22
New cards

Correlation vs. causation

A correlation means two values vary together; it does not prove one causes the other.

23
New cards

Metadata

“Data about data”: context that explains meaning, units, quality, and constraints so data can be interpreted correctly.

24
New cards

Data dictionary

A metadata document describing each field/column (name, meaning, data type, allowed values, units, missing-value rules).

25
New cards

Imputation

Filling in missing data with an estimated/default value (e.g., group average), which can hide uncertainty and distort results if unjustified.

Explore top notes

note
RIse of Democracy Vocab Pt. 3
Updated 1507d ago
0.0(0)
note
KOREAN - IMPORTANT VOCABULARY
Updated 1262d ago
0.0(0)
note
Indirect Values
Updated 1507d ago
0.0(0)
note
biology
Updated 1941d ago
0.0(0)
note
Factorisation (copy)
Updated 1081d ago
0.0(0)
note
Chapter 9 - Jacksonian Era
Updated 1444d ago
0.0(0)
note
Hello
Updated 1195d ago
0.0(0)
note
RIse of Democracy Vocab Pt. 3
Updated 1507d ago
0.0(0)
note
KOREAN - IMPORTANT VOCABULARY
Updated 1262d ago
0.0(0)
note
Indirect Values
Updated 1507d ago
0.0(0)
note
biology
Updated 1941d ago
0.0(0)
note
Factorisation (copy)
Updated 1081d ago
0.0(0)
note
Chapter 9 - Jacksonian Era
Updated 1444d ago
0.0(0)
note
Hello
Updated 1195d ago
0.0(0)

Explore top flashcards

flashcards
faf
40
Updated 965d ago
0.0(0)
flashcards
faf
40
Updated 965d ago
0.0(0)