DigitHelm

Average Calculator | Mean, Median & Mode

Calculate the mean, median, mode, standard deviation, IQR, box plot, and more for any dataset.

Accepts commas, semicolons, spaces, or newlines. Negative numbers and decimals supported.

What Is the Average Calculator | Mean, Median & Mode?

The mean, median, and mode are the three pillars of descriptive statistics. Each answers a slightly different version of the same question: what is the typical value in this dataset? The arithmetic mean (sum ÷ count) is the most mathematically convenient and works beautifully for symmetric, well-behaved data, but a single extreme outlier can drag it far from where most values actually sit.

The median sidesteps that problem entirely. By picking the literal middle value of a sorted list it ignores the magnitude of extremes and focuses on rank. This is why economists report median household income rather than mean income, a handful of billionaires would inflate the mean while the median stays close to the experience of the typical household. The mode adds a third lens: it finds the most frequently occurring value, which is the only average that works on non-numeric (categorical) data such as favourite colours or product choices.

Beyond these three, this calculator also computes dispersion measures, range, IQR, standard deviation, variance, and coefficient of variation, that describe how spread out the data is, plus a five-number summary with an interactive box plot to visualise distribution shape and flag outliers automatically using Tukey's IQR fences.

Formula

Central Tendency Measures
Mean (Arithmetic)x̄ = Σx / n
MedianMiddle value of sorted data
ModeValue(s) with highest frequency
Midrange(Min + Max) / 2
Geometric Meanⁿ√(x₁ × x₂ × … × xₙ)
Harmonic Meann / Σ(1/xᵢ)
Dispersion & Spread
RangeMax − Min
IQRQ3 − Q1
Population Varianceσ² = Σ(xᵢ − μ)² / n
Sample Variances² = Σ(xᵢ − x̄)² / (n − 1)
Std Dev (Pop.)σ = √(σ²)
Std Dev (Sample)s = √(s²)
Coeff. of VariationCV = (s / |x̄|) × 100%
MADΣ|xᵢ − x̄| / n
x̄ / μ, sample mean / population mean
n, count of values
Σ, sum over all values
Q1–Q3, quartiles (25th, 50th, 75th percentile)

How to Use

Entering your data

  1. 1Type or paste numbers: Enter your dataset in the text area. Separate values with commas, spaces, semicolons, or newlines, any combination works.
  2. 2Use quick examples: Click one of the example buttons (Student Grades, Monthly Revenue, With Outliers, Bimodal) to load a sample dataset instantly.
  3. 3Click Calculate: Press the Calculate button or use Ctrl + Enter to run all statistics at once. Negative numbers and decimals are fully supported.
  4. 4Read the results: Central tendency, dispersion, and the five-number summary appear in labelled cards. Scroll down to see the box plot and summary details.

Exploring the Details panel

  1. 1Frequency Table: Switch to the Frequency Table tab to see every unique value with its count, relative percentage, cumulative percentage, and a visual frequency bar.
  2. 2Step-by-Step: Choose Step-by-Step to see the full working for mean, median, and mode laid out in monospace equations, great for checking homework.
  3. 3Sorted Data: The Sorted Data tab shows all values in ascending order as coloured chips: orange for the median, red for outliers detected by the Tukey IQR method.

Example Calculation

Example 1, Symmetric dataset (student grades)

Dataset: 78, 85, 92, 65, 88, 73, 95, 82, 69, 77  (n = 10)

Sum = 78+85+92+65+88+73+95+82+69+77 = 804

Mean = 804 ÷ 10 = 80.4

Sorted: 65, 69, 73, 77, 78, 82, 85, 88, 92, 95

Median = (78 + 82) ÷ 2 = 80 (average of 5th and 6th values)

Mode = No mode (all values appear once)

Mean and median are almost identical, which confirms the data is roughly symmetric.

Example 2, Skewed dataset with an outlier

Dataset: 10, 12, 11, 13, 12, 14, 11, 12, 85, 13  (n = 10)

Mean = 193 ÷ 10 = 19.3 ← pulled up by outlier 85

Sorted: 10, 11, 11, 12, 12, 12, 13, 13, 14, 85

Median = (12 + 12) ÷ 2 = 12 ← robust to the outlier

Mode = 12 (appears 3 times)

IQR = Q3 − Q1 = 13 − 11 = 2

Upper fence = 13 + 1.5 × 2 = 16 → 85 is an outlier

The outlier inflates the mean by 7 points while the median and mode stay unaffected, a classic case where median is the more representative average.

Example 3, Standard deviation worked out

Dataset: 4, 7, 13, 2  (n = 4)

Mean x̄ = (4+7+13+2) ÷ 4 = 26 ÷ 4 = 6.5

Deviations²: (4−6.5)² + (7−6.5)² + (13−6.5)² + (2−6.5)²

= 6.25 + 0.25 + 42.25 + 20.25 = 69

Pop. Variance σ² = 69 ÷ 4 = 17.25

Samp. Variance s² = 69 ÷ 3 = 23

Samp. Std Dev s = √23 ≈ 4.796

Example 4, Bimodal distribution

Dataset: 2, 2, 4, 5, 5, 6, 7, 8, 8, 8, 9, 10  (n = 12)

StatisticValueNote
Mean6.17Sum 74 ÷ 12
Median6.5(6 + 7) ÷ 2, average of 6th and 7th values
Mode2, 8Each appears twice, bimodal distribution
Std Dev2.59Sample standard deviation

Two separate peaks (2 and 8) suggest the data may come from two different groups, something a single average cannot capture alone.

Understanding Average | Mean, Median & Mode

What Is an Average?

In everyday language, “average” almost always means the arithmetic mean, add everything up and divide by how many items you have. But in statistics, “average” is an umbrella term for any measure of central tendency: a single number that represents the typical value in a dataset. The three most important are the mean, the median, and the mode, each illuminating a different facet of the data's centre.

Knowing which average to use, and understanding what the others are telling you, separates surface-level data reading from genuine statistical insight. A salary survey might report that the mean income is $90,000 while the median is $58,000. Both numbers are technically correct, but they paint very different pictures of reality. This calculator computes all three simultaneously, so you can compare them and draw the most accurate conclusion.

The Arithmetic Mean, When It Works Best

The arithmetic mean is calculated by dividing the sum of all values by the count of values (x̄ = Σx / n). It is the most mathematically tractable average and is the foundation of many statistical methods, variance, standard deviation, z-scores, and regression all build on it directly.

  • Best for: Symmetric, roughly bell-shaped distributions where extreme values are rare (height, weight, test scores in large groups).
  • Sensitive to outliers: One very large or very small value shifts the mean noticeably. In a dataset of salaries $40k, $45k, $50k, $55k, $1M, the mean exceeds $230k, far above what any typical employee earns.
  • Use in further calculations: Variance, standard deviation, and most inferential statistics are built around the mean. If you need to do further analysis, mean-based statistics are usually the right starting point.

Quick rule: if your histogram is roughly bell-shaped with no obvious outlier spikes, the mean is the best single summary. If it is skewed or has a heavy tail, lead with the median.

The Median, The Outlier-Resistant Average

The median is the value that sits exactly in the middle of a sorted dataset, with half the values below and half above. For an odd number of values, it is the central value; for an even count, it is the average of the two central values. No matter how large the highest value grows or how small the lowest shrinks, the median does not change, making it the preferred average whenever outliers or skew are present.

  • Income and wealth: National statistics agencies use median household income because a small number of very high earners would inflate the mean above what most families actually earn.
  • Real estate: Median home price is the standard industry metric for the same reason, a handful of luxury properties would skew the mean upward.
  • Medical data: Survival times, recovery periods, and biomarker distributions are often right-skewed (a few very long times), so median is the honest summary.
  • Response time / latency: Web engineers track median page load time alongside the 95th and 99th percentile to understand typical and worst-case user experience.

The Mode, Patterns and Peaks

The mode identifies the most frequently occurring value or values in a dataset. It is the only central tendency measure that works with categorical (non-numeric) data, you can find the modal shoe size, the most popular product category, or the most common survey response. For numeric data, the mode is useful for spotting clustering and natural breaks.

  • Unimodal: One clear peak, most test scores cluster around 75. Typical of a single homogenous group.
  • Bimodal: Two peaks, heights in a mixed male/female sample often form two humps. A signal that the dataset may contain two distinct subpopulations.
  • No mode: All values occur equally often. This happens with truly uniform data or very small, unique datasets.
  • Retail and inventory: The mode of size purchases tells a clothing retailer which size to stock most.

Understanding the Five-Number Summary and Box Plot

The five-number summary, minimum, Q1, median (Q2), Q3, and maximum, gives a complete skeletal picture of any distribution in just five values. This calculator renders it as an interactive box plot with whiskers extending to the Tukey fences, the IQR box shaded between Q1 and Q3, the median marked as a solid line, and outlier values plotted as individual red dots outside the whiskers.

ComponentCalculationWhat it tells you
Q1 (25th percentile)25% of values fall below this pointLower boundary of the middle half
Q2 / Median (50th)Middle value of the sorted datasetTrue center of the distribution
Q3 (75th percentile)75% of values fall below this pointUpper boundary of the middle half
IQRQ3 − Q1Spread of the central 50%, key to outlier detection
Lower fenceQ1 − 1.5 × IQRValues below this are flagged as outliers
Upper fenceQ3 + 1.5 × IQRValues above this are flagged as outliers

For a deeper dive into quartiles, see our Standard Deviation Calculator which explains spread in detail, or use the Z-Score Calculator to see how many standard deviations a particular value sits from the mean.

Mean vs Median vs Mode, Which One Should You Use?

SituationBest measureWhy
Symmetric, no outliersMeanMathematically efficient; used by inferential stats
Skewed or contains outliersMedianResistant to extreme values
Categorical dataModeOnly average defined for non-numeric data
Reporting typical salary/priceMedianHigh earners/prices would inflate the mean
Averaging growth ratesGeometric meanRates multiply, not add
Averaging speeds over distanceHarmonic meanSpeed is a rate: distance/time
Comparing spread across unitsCV %Standard deviation relative to the mean

Standard Deviation, Variance, and the Coefficient of Variation

Knowing the average is only half the story. Two datasets can share an identical mean but have radically different spread, one tightly clustered around the centre, the other scattered across a wide range. Standard deviation quantifies that spread in the same units as the original data, making it intuitively interpretable.

  • Sample vs population: Use sample standard deviation (divides by n − 1) when your dataset is a sample drawn from a larger population. Use population standard deviation (divides by n) only when you have every single member of the group.
  • Coefficient of variation (CV): Divides standard deviation by the absolute mean, expressed as a percentage. This makes it possible to compare variability across datasets with different scales, for example, comparing price variability of $5 coffee versus $50,000 cars.
  • Mean Absolute Deviation (MAD): The average of the absolute differences from the mean. More intuitive than variance because it stays in the same units as the data without squaring.

Our Standard Deviation Calculator provides an extended step-by-step breakdown of the variance and standard deviation calculation if you need it for coursework or verification.

Real-World Applications of Average Calculations

  • Education: Teachers compute class averages (mean) to gauge overall performance. Students track their GPA using a weighted average where each course's credit hours determine its contribution.
  • Finance and investing: Portfolio managers use geometric mean to compute compound annual growth rates (CAGR). Analysts use mean and standard deviation to quantify risk. The coefficient of variation compares risk across assets with different average returns.
  • Quality control (manufacturing): Six Sigma teams track the mean and standard deviation of measurements to ensure products stay within tolerance. Control charts are built directly from these statistics.
  • Sports analytics: Batting averages, points-per-game, and fantasy sports rankings are all forms of mean. Median performance over a season is used to filter out exceptional peak games from typical contribution.
  • Healthcare and research: Clinical trials report mean and median outcomes. Skewed biological measurements (blood pressure, reaction times, virus load) often use median to avoid misleading results from extreme cases.
  • Machine learning and data science: Imputing missing values with the mean (for symmetric features) or median (for skewed features) is a standard data-cleaning step. Feature scaling often uses mean and standard deviation (standardisation).

Frequently Asked Questions

What is the difference between mean, median, and mode?

The mean is the arithmetic average, sum of all values divided by the count. The median is the middle value when data is arranged in order; it splits the dataset into two equal halves. The mode is the value that appears most frequently. They each capture the "centre" of a dataset differently, and choosing the right one depends on the shape of your data and whether outliers are present.

When should I use the median instead of the mean?

Use the median when your data is skewed or contains outliers. Common examples include income data, house prices, and response times, where a few very large values would inflate the mean far above the typical experience. The median stays anchored to the true middle regardless of how extreme the highest or lowest values are.

What does "No mode" mean?

A dataset has no mode when every value appears exactly once (or when all values appear the same number of times). In that case, no single value is more frequent than the others, so a mode cannot be meaningfully defined. The calculator reports "No mode" in both situations.

Can a dataset have more than one mode?

Yes. A dataset with two equally frequent peaks is called bimodal, and one with three or more is multimodal. For example, in [2, 2, 5, 5, 9] both 2 and 5 each appear twice, making them co-modes. This often indicates the data comes from two distinct subgroups, which is worth investigating separately.

What is the difference between population and sample standard deviation?

Population standard deviation (σ) divides the sum of squared deviations by n and is used when your dataset contains every member of the group you care about. Sample standard deviation (s) divides by n − 1 (Bessel's correction) and is used when your data is a random sample drawn from a larger population. The correction makes the sample estimate unbiased. In most real-world analysis, use the sample version.

What is the IQR and why does it matter?

The interquartile range (IQR) is the difference between the 75th percentile (Q3) and the 25th percentile (Q1). It represents the spread of the middle 50% of the data, making it far more resistant to outliers than the full range. It is also the foundation of Tukey's outlier detection method: values below Q1 − 1.5×IQR or above Q3 + 1.5×IQR are flagged as outliers.

What is skewness and what does it tell me?

Skewness measures how asymmetric a distribution is. A skewness near 0 indicates a roughly symmetric distribution (mean ≈ median). Positive skewness means the right tail is longer, the mean is pulled above the median by high outliers (common in income or wealth data). Negative skewness means the left tail is longer, the mean falls below the median. This calculator uses Pearson's second skewness coefficient: 3(mean − median) / standard deviation.

When should I use geometric mean instead of arithmetic mean?

Use geometric mean when you are averaging quantities that multiply together rather than add, such as compound growth rates, investment returns over multiple periods, or ratios. For example, if a stock grew by 10%, 20%, and 5% in three consecutive years, the geometric mean gives the equivalent constant annual growth rate. The geometric mean is only defined for positive values.

What is the coefficient of variation (CV)?

The coefficient of variation expresses the standard deviation as a percentage of the absolute mean: CV = (s / |x̄|) × 100%. It lets you compare variability across datasets that have different units or very different average magnitudes. A CV below 15% generally indicates low relative spread; above 30% suggests high variability. It is undefined when the mean is zero.

How does the outlier detection work?

This calculator uses the Tukey IQR fence method. Lower fence = Q1 − 1.5 × IQR; Upper fence = Q3 + 1.5 × IQR. Any value that falls outside these fences is classified as an outlier and highlighted in red on the sorted data view and box plot. This is the same method used by most statistical software including R, Python's matplotlib, and Excel.

Related Calculators