Box Plot Generator
Paste a list of numbers and get a publication-ready box plot: the interquartile box, the median line, whiskers that stop at the last data points inside the 1.5 × IQR fences, and every outlier drawn as its own labeled dot. Download the chart as an SVG file or copy the five-number summary with one click.
The Conventions This Generator Uses
Quartiles — median-split (Moore & McCabe, exclusive): the data is sorted and split at the median, leaving the median itself out when the count is odd; Q1 and Q3 are the medians of the two halves.
Whiskers — Tukey convention: each whisker extends to the most extreme data point still within 1.5 × IQR of the box, not to a computed fence value and not necessarily to the minimum or maximum.
Outliers: values beyond either fence are plotted as individual dots rather than absorbed into a whisker.
Entering Your Data
- Separate values with commas, spaces, or line breaks; a column pasted from a spreadsheet works directly.
- At least two values are required, and the plot becomes genuinely informative from about five values upward.
- Keep everything in one unit — a box plot of mixed scales flags spurious outliers.
Enter numbers separated by commas, spaces, or line breaks
Related Calculators
Quartile Calculator
Calculate Q1, Q2, Q3, IQR, and Tukey fences with automatic outlier detection.
Outlier Calculator
Detect outliers with Tukey's 1.5 × IQR fences or a z-score threshold, plus a cleaned data preview.
Standard Deviation Calculator
Calculate standard deviation, variance, and spread with clear statistical outputs.
Anatomy of a Box Plot
A box plot compresses an entire distribution into one horizontal figure. The box spans the middle 50% of the data, from Q1 to Q3, and the line cutting through it marks the median. The whiskers reach outward to the most extreme observations that still count as ordinary, and anything past them appears as an individual dot demanding an explanation. Reading left to right you see, in order: extreme low values, the ordinary low tail, the crowded middle, the ordinary high tail, and extreme high values.
The boundaries behind the picture:
IQR = Q3 − Q1
Lower fence = Q1 − 1.5 × IQR
Upper fence = Q3 + 1.5 × IQR
Whisker ends = most extreme data points inside the fences
Note that the whiskers stop at actual data points, not at the fences themselves. The fences are invisible thresholds; the whiskers snap inward to the last real observation on each side. That is why a whisker often ends at an oddly specific value rather than a round number.
Why the Method Statement Matters
Two perfectly correct box plot tools can draw different boxes from the same data, because quartiles have several published definitions. This generator uses the exclusive median-split method of Moore and McCabe, the convention taught in most introductory statistics courses and used consistently across this site — the quartile calculator and the five number summary calculator will always agree with the chart drawn here.
Spreadsheets typically interpolate instead: Excel's QUARTILE.INC can place Q1 and Q3 a fraction away from the values shown here on small data sets. Neither answer is wrong, but a homework grader or a journal reviewer will expect you to name the convention — which is exactly why this page states both the quartile method and the whisker rule up front.
Whiskers, Fences, and the 1.5 × IQR Rule
The 1.5 multiplier comes from John Tukey, who invented the box plot in the 1970s. Because the interquartile range measures the spread of the middle half of the data, it cannot be inflated by the very outliers it is screening for — the weakness that makes standard-deviation-based rules unreliable. For roughly bell-shaped data, the zone between the fences covers about 99.3% of observations, so a point outside it is rare enough to plot individually.
A dot beyond the fence is a flag, not a conviction. It may be a typo, a measurement glitch, or a genuine extreme event worth its own paragraph in your report. The outlier calculator applies the same fences numerically when you need the flagged values as a list rather than a picture.
What a Box Plot Shows That an Average Hides
A mean reduces a data set to one number; a box plot keeps its shape. Skew shows up instantly: if the median line sits close to Q1 and the upper whisker stretches far to the right, the data leans toward high values. Spread is the width of the box, robust to whatever the extremes are doing. And unusual observations are physically separated from the body of the data instead of being silently averaged in.
The format really pays off in comparisons. Stacking box plots of the same measurement across groups — test scores by class, response times by server, yields by field — reveals differences in center, spread, and outliers in a single glance, with no distributional assumptions required.
Worked Example: Support Tickets with Outliers on Both Sides
A help desk logs eight ticket resolution times, in minutes: 1, 45, 50, 52, 55, 58, 60, 120 — one auto-closed duplicate, six routine tickets, and one escalation. The generator proceeds like this:
- Sort: 1, 45, 50, 52, 55, 58, 60, 120.
- Median: with 8 values, average the 4th and 5th: (52 + 55) ÷ 2 = 53.5.
- Split into halves: lower half 1, 45, 50, 52 and upper half 55, 58, 60, 120.
- Quartiles: Q1 = (45 + 50) ÷ 2 = 47.5; Q3 = (58 + 60) ÷ 2 = 59; IQR = 59 − 47.5 = 11.5.
- Fences: 47.5 − 1.5 × 11.5 = 30.25 and 59 + 1.5 × 11.5 = 76.25.
- Outliers: 1 and 120 fall outside the fences, so both are drawn as dots.
- Whiskers: the most extreme values inside the fences are 45 and 60, so the whiskers stop there — not at 1 or 120, and not at the fence values 30.25 and 76.25.
The resulting picture tells the honest story: a tight box from 47.5 to 59 says routine tickets take about an hour, while the two dots expose the auto-closed record and the escalation. A bare average of 55.1 minutes would have hidden both.
Frequently Asked Questions
Why don't the whiskers reach my minimum and maximum?
Because this generator follows Tukey's convention: whiskers extend only to the most extreme data points within 1.5 x IQR of the box. If your minimum or maximum lies beyond a fence, it is drawn as an outlier dot instead of being absorbed into a whisker. Simpler tools that always stretch whiskers to the extremes hide exactly the values most worth noticing.
Can I use the downloaded SVG in Word, Google Docs, or a paper?
Yes. The download serializes the chart with all theme colors baked in, so it renders identically outside this site. SVG is a vector format that scales without pixelation; Word, Google Docs, PowerPoint, and LaTeX (via conversion to PDF or EPS) all accept it, and browsers open it directly.
Why does my box look different from the one Excel draws?
Excel's box and whisker chart uses interpolated quartiles (the QUARTILE.INC convention) by default, while this generator uses the median-split method taught in most statistics courses. On small data sets the two conventions can place Q1 and Q3 at slightly different values, which shifts the box edges and occasionally changes which points count as outliers.
What does it mean if the median line is not centered in the box?
It signals skew in the middle half of your data. A median hugging the left edge of the box means the values between Q1 and the median are packed tightly while those between the median and Q3 are spread out - a right-leaning distribution. The whisker lengths tell the same story for the tails.
How many data points do I need for a useful box plot?
The generator draws a plot from two values, but the five-number summary only becomes meaningful from around five observations, and outlier detection stabilizes with ten or more. For very small samples, consider showing the raw data points themselves; a box plot of four values can suggest more structure than the data contains.