What about violin plots. https://en.m.wikipedia.org/wiki/Violin_plot

317070 · on June 23, 2024

I was thinking the same thing while reading, but the author does mention them at the end (together with the bee swarm plot or sina plot, which I think is the better version of a violin plot)

https://www.rhoworld.com/i-swarm-you-swarm-we-all-swarm-for-...

Scea91 · on June 23, 2024

I use violin plots but a complication is that the shape depends upon the bandwidth hyperparameter of the kernel density estimator that is used inside. The plot can differ a lot for different bandwidth values.

Selection of the 'proper' bandwidth is a classic bias-variance tradeoff problem.

IshKebab · on June 23, 2024

While true, that's not an additional problem compared to box plots which effectively just set the bandwidth to maximum. So IMO they are strictly better.

IanCal · on June 23, 2024

I find violin plots suggest far smoother results than actually exist so you need to be careful with the amount of data.

IshKebab · on June 23, 2024

I agree but so do box plots. I think probably the best thing is violin plots when there's lots of data and bee swarm plots when there isn't. But either are better than box plots.

karmakaze · on June 23, 2024

What about using rotated, symmetric histograms--like a quantized violin plot?

mjfisher · on June 23, 2024

The author mentions those at the bottom of the article, but two problems highlighted still remain:

* There's another intermediary concept (kernel density estimation) between the audience and the data

* They're still likely to misrepresent tight groupings and discontinuities, which will be smoothed out

adammarples · on June 23, 2024

Histograms and box plots are just clunky kernels density estimates too