This hands-on exercise will guide you through calculating the mean, median, mode, range, variance, and standard deviation for a small dataset. Performing these calculations manually, at least once, helps solidify understanding of what these values represent before relying on software tools.Our Sample Dataset: Exam ScoresImagine a small class took a short quiz, and their scores (out of 100) are as follows:[85, 90, 78, 92, 85, 88, 76, 95, 85, 90]This list represents our dataset. Let's analyze these scores using the statistics we've learned.Calculating Measures of Central TendencyThese statistics help us understand the "center" or typical value of the data.1. Mean (Average)The mean is the sum of all values divided by the number of values. The formula is: $$ \bar{x} = \frac{\sum_{i=1}^{n} x_i}{n} $$ Where $x_i$ represents each score, and $n$ is the total number of scores.Sum of scores: $85 + 90 + 78 + 92 + 85 + 88 + 76 + 95 + 85 + 90 = 864$Number of scores (n): $10$Mean: $\bar{x} = \frac{864}{10} = 86.4$The average score for this quiz is 86.4.2. Median (Middle Value)The median is the middle value when the data is sorted. If there's an even number of data points, it's the average of the two middle values.First, sort the scores: [76, 78, 85, 85, 85, 88, 90, 90, 92, 95]Number of scores (n): $10$ (an even number)Middle positions: For $n=10$, the middle positions are the 5th and 6th values.Values at middle positions: The 5th score is 85, and the 6th score is 88.Median: $M_e = \frac{85 + 88}{2} = \frac{173}{2} = 86.5$The median score is 86.5. Half the students scored below 86.5, and half scored above.3. Mode (Most Frequent Value)The mode is the value that appears most often in the dataset.Sorted scores: [76, 78, 85, 85, 85, 88, 90, 90, 92, 95]Frequencies:76: 1 time78: 1 time85: 3 times88: 1 time90: 2 times92: 1 time95: 1 timeMode: The score 85 appears most frequently (3 times).The mode score is 85.Calculating Measures of Spread (Variability)These statistics tell us how spread out or dispersed the data points are.1. RangeThe range is the difference between the highest and lowest values.Highest score (Max): 95Lowest score (Min): 76Range: $Max - Min = 95 - 76 = 19$The scores span a range of 19 points.2. Variance (Sample Variance, $s^2$)Variance measures the average squared difference of each score from the mean. We use the sample variance formula (dividing by $n-1$) because our scores represent a sample of potential student performance.The formula is: $$ s^2 = \frac{\sum_{i=1}^{n} (x_i - \bar{x})^2}{n-1} $$Let's break this down:Mean ($\bar{x}$): We calculated this as 86.4.Calculate deviations from the mean ($x_i - \bar{x}$):$85 - 86.4 = -1.4$$90 - 86.4 = 3.6$$78 - 86.4 = -8.4$$92 - 86.4 = 5.6$$85 - 86.4 = -1.4$$88 - 86.4 = 1.6$$76 - 86.4 = -10.4$$95 - 86.4 = 8.6$$85 - 86.4 = -1.4$$90 - 86.4 = 3.6$Square the deviations ($(x_i - \bar{x})^2$):$(-1.4)^2 = 1.96$$(3.6)^2 = 12.96$$(-8.4)^2 = 70.56$$(5.6)^2 = 31.36$$(-1.4)^2 = 1.96$$(1.6)^2 = 2.56$$(-10.4)^2 = 108.16$$(8.6)^2 = 73.96$$(-1.4)^2 = 1.96$$(3.6)^2 = 12.96$Sum the squared deviations ($\sum (x_i - \bar{x})^2$): $1.96 + 12.96 + 70.56 + 31.36 + 1.96 + 2.56 + 108.16 + 73.96 + 1.96 + 12.96 = 318.4$Divide by $n-1$: Here, $n=10$, so $n-1=9$.Variance ($s^2$): $\frac{318.4}{9} \approx 35.38$The sample variance is approximately 35.38. This value is in "squared points," which isn't very intuitive.3. Standard Deviation (Sample Standard Deviation, $s$)Standard deviation is the square root of the variance. It gives us a measure of spread in the original units (quiz points).The formula is: $$ s = \sqrt{s^2} $$Standard Deviation ($s$): $\sqrt{35.38} \approx 5.95$The sample standard deviation is approximately 5.95 points. This suggests that, on average, scores tend to deviate from the mean score of 86.4 by about 5.95 points.Summary of FindingsFor our dataset [85, 90, 78, 92, 85, 88, 76, 95, 85, 90]:Mean: 86.4Median: 86.5Mode: 85Range: 19Variance ($s^2$): approx 35.38Standard Deviation ($s$): approx 5.95The mean and median are very close, suggesting the distribution of scores is relatively symmetric around the center. The mode is slightly lower. The standard deviation gives us a sense of the typical spread around the average score.Frequency Distribution and VisualizationWe can also look at the frequency of scores within certain ranges (bins). Let's group scores into bins of width 5:75-79: 2 scores (76, 78)80-84: 0 scores85-89: 4 scores (85, 85, 85, 88)90-94: 3 scores (90, 90, 92)95-99: 1 score (95)This frequency distribution can be visualized using a histogram:{"layout":{"title":"Distribution of Quiz Scores","xaxis":{"title":"Score Range"},"yaxis":{"title":"Number of Students"},"bargap":0.1,"template":"simple_white"},"data":[{"type":"histogram","x":[85,90,78,92,85,88,76,95,85,90],"marker":{"color":"#228be6"},"xbins":{"start":75,"end":100,"size":5}}]}Histogram showing the frequency of student scores within 5-point intervals. The tallest bar corresponds to the 85-89 range, reflecting the mode (85) falling within this bin.Using ToolsWhile manual calculation is useful for learning, in practice, you'll use software tools. Spreadsheet programs like Google Sheets or Microsoft Excel have functions like AVERAGE(), MEDIAN(), MODE.SNGL(), MAX(), MIN(), VAR.S(), and STDEV.S(). Programming languages like Python, with libraries such as NumPy or Pandas, provide similar functions (e.g., mean(), median(), mode(), var(), std()) that make these calculations effortless, especially for large datasets.This practical exercise demonstrated how to compute basic descriptive statistics. These numbers provide a first important summary of your data's characteristics, forming a foundation for more detailed analysis.