Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The mean and variance are not features of a box plot. Box plots show the quartiles, which are about the cumulative distribution.


Which is why I find the article so compelling because I'd always read box plots as being about variance. To me the plot implied a quite normal distribution.


Note that "not knowing how to correctly interpret a boxplot" is not equivalent to "boxplots are useless".


If people like me are in the audience, they might be worse than useless.


Sure. But if someone is using, for example, a notched boxplot to quickly evaluate differences in medians (i.e., they know how to correctly interpret a boxplot), it can still be a useful plot that conveys specific information that you would otherwise not get when looking at a violin plot, histogram, kernel density estimate or a strip plot.

My point, again, was: just because a boxplot is not useful to some people, doesn't mean that it is not a useful plot (particularly when augmented with a rugplot or a strip plot). Plots are not just used to convey information to others: they are also a useful tool in exploratory data analysis.

Notice that you can also apply the same critique to almost any plot: some people don't know how to interpret a violin plot (or kernel density estimate plot) correctly... does that make them useless?

The main advantage of a boxplot is that it is parameter-free (unlike histograms, violin plots and kernel density plots) and quickly conveys very specific information (median, range, quantiles, confidence interval for the median) that other types of plot usually don't.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: