Data and Statistics
Master statistical analysis and data interpretation for the ParaPro test
What You'll Learn
Understanding Data
The foundation of statistics
What is Data?
Data is information collected through observation, measurement, or research. It can be numbers, words, measurements, observations, or descriptions.
Quantitative Data
Numerical information that can be measured
- • Test scores: 85, 92, 78, 96
- • Heights: 5'2", 5'8", 6'1"
- • Temperature: 72°F, 68°F, 75°F
- • Age: 12, 13, 14, 15 years
Qualitative Data
Descriptive information about qualities
- • Colors: red, blue, green
- • Preferences: like, dislike
- • Categories: small, medium, large
- • Types: fiction, non-fiction
Organizing Data
Frequency Tables
Shows how often each value appears in a data set
| Score | Tally | Frequency |
|---|---|---|
| 90-100 | |||| | 4 |
| 80-89 | |||| || | 7 |
| 70-79 | ||| | 3 |
Ordered Lists
Data arranged from least to greatest (or vice versa)
Original: 85, 92, 78, 88, 95, 82, 90
Ordered: 78, 82, 85, 88, 90, 92, 95
Measures of Central Tendency
Finding the "middle" of data
Mean (Average)
The mean is the sum of all values divided by the number of values. It's what most people think of as the "average."
Formula
Mean = Sum of all values ÷ Number of values
Example: Test Scores
Find the mean of: 85, 92, 78, 88, 95, 82, 90
Step 1: Add all scores: 85 + 92 + 78 + 88 + 95 + 82 + 90 = 610
Step 2: Count values: 7 scores
Step 3: Divide: 610 ÷ 7 = 87.14
Mean = 87.14
Median (Middle Value)
The median is the middle value when data is arranged in order. It divides the data set in half.
Odd Number of Values
Data: 78, 82, 85, 88, 90, 92, 95
7 values (odd), so median is the 4th value
78, 82, 85, 88, 90, 92, 95
Median = 88
Even Number of Values
Data: 78, 82, 85, 88, 90, 92
6 values (even), average of 3rd and 4th
78, 82, 85, 88, 90, 92
Median = (85 + 88) ÷ 2 = 86.5
Mode (Most Common Value)
The mode is the value that appears most often in a data set. A data set can have one mode, multiple modes, or no mode.
One Mode
Data: 85, 92, 85, 88, 85, 90
85 appears 3 times (most frequent)
Mode = 85
Multiple Modes
Data: 85, 92, 85, 88, 92, 90
85 and 92 each appear twice
Modes = 85 and 92
No Mode
Data: 85, 92, 78, 88, 95, 82, 90
Each value appears only once
No mode
Measures of Spread
How spread out is the data?
Range
The range shows how spread out the data is by finding the difference between the highest and lowest values.
Formula
Range = Highest value - Lowest value
Example
Data: 78, 82, 85, 88, 90, 92, 95
Highest value: 95
Lowest value: 78
Range = 95 - 78 = 17
Understanding Outliers
Outliers are values that are much higher or lower than most of the data. They can significantly affect the mean.
Impact of Outliers
Without outlier:
Data: 80, 82, 85, 88, 90
Mean = 85
Median = 85
With outlier:
Data: 80, 82, 85, 88, 90, 20
Mean = 74.2 (affected!)
Median = 83.5 (less affected)
Data Collection Methods
Ways to gather data
Surveys
Asking questions to collect information
- • Written questionnaires
- • Interviews
- • Online forms
- • Phone surveys
Observations
Watching and recording what happens
- • Counting occurrences
- • Recording behaviors
- • Measuring results
- • Tracking patterns
Experiments
Testing under controlled conditions
- • Scientific tests
- • Before/after comparisons
- • Control groups
- • Repeated trials
Existing Records
Using already collected data
- • School records
- • Government data
- • Historical information
- • Previous studies
Good Sampling Practices
- ✓ Random selection: Everyone has equal chance of being chosen
- ✓ Large enough size: More data = more reliable results
- ✓ Representative: Sample reflects the whole population
- ✗ Avoid bias: Don't favor certain groups or outcomes
Analyzing and Interpreting Data
Drawing conclusions from data
Drawing Conclusions
Example: Class Test Scores
Two classes took the same test:
Class A
Mean: 82, Median: 83, Mode: 85
Range: 15 (75-90)
Class B
Mean: 82, Median: 82, Mode: 80
Range: 35 (60-95)
Conclusion: Both classes have the same mean, but Class B has more variation in scores (larger range). Class A's scores are more consistent.
Making Predictions
We can use data patterns to make reasonable predictions about future events or unknown data.
Example: Book Reading
A student's reading over 5 days: 20, 22, 19, 21, 23 pages
Mean = 21 pages per day
Prediction: In 10 days, the student will likely read about 210 pages (21 × 10)
Real-World Applications
Statistics in daily life
Sports Statistics
A basketball player's points in 5 games: 18, 22, 15, 25, 20
Mean: 20 points per game
Median: 20 points
Range: 10 points
Use: Evaluate player performance and consistency
Weather Data
Daily high temperatures for a week: 72°, 75°, 78°, 74°, 73°, 76°, 74°F
Mean: 74.6°F
Mode: 74°F (appears twice)
Range: 6°F
Use: Plan activities and understand weather patterns
Classroom Assessment
Quiz scores: 8, 9, 7, 10, 6, 9, 8, 9, 10, 7 (out of 10)
Mean: 8.3
Median: 8.5
Mode: 9
Use: Identify areas needing review and student progress
Common Mistakes to Avoid
Confusing mean, median, and mode
Using the wrong measure of central tendency
✓ Remember: Mean = average, Median = middle, Mode = most common
Not ordering data for median
Finding median without arranging data from least to greatest
✓ Always order data first when finding median
Including outliers without consideration
Not recognizing when outliers skew the mean
✓ Consider using median when outliers are present
Miscounting when finding mode
Not carefully tracking frequency of each value
✓ Use tally marks or frequency table to track occurrences
Practice Problems
Test your understanding
Basic Statistics
Given the data set: 12, 15, 18, 12, 20, 16, 12, 19, 17. Find the mean, median, mode, and range.
Comparing Data Sets
Group A scores: 85, 88, 92, 86, 90. Group B scores: 78, 95, 82, 91, 94. Calculate the mean for each group and determine which has the larger range.
Real-World Application
A student reads the following pages each day: 25, 30, 22, 28, 35, 20, 26. What is the average number of pages read per day?
Show Answers
1. Mean = 15.67, Median = 16, Mode = 12, Range = 8
Sum: 141, Count: 9, Ordered: 12,12,12,15,16,17,18,19,20
2. Group A mean = 88.2, Group B mean = 88. Group B has larger range (17 vs 7)
3. 26.57 pages per day (186 ÷ 7 = 26.57)
Key Takeaways
Mean: The average of all values (add all, divide by count)
Median: The middle value when data is ordered
Mode: The value that appears most often
Range: The spread of data (highest - lowest)
Analysis: Use statistics to understand patterns and make predictions
Continue Learning
Ready to Practice?
Test your data and statistics knowledge with practice questions