
7 Feb
2013
7 Feb
'13
4:44 a.m.
On 06/02/13 22:26, Andy Georges wrote:
Quantifying performance changes with effect size confidence intervals - Tomas Kalibera and Richard Jones, 2012 (tech report)
This is a good one - it was actually a talk by Richard Jones that highlighted to me the problems with averaging over benchmarks (aside from the problem with GM, which he didn't mention). This paper mentions Criterion, incidentally.
• [[1]] J.E., Smith. Characterizing computer performance with a single number. CACM 31(10), 1988.
And I wish I'd read this a long time ago :) Thanks. No more geometric means for me! Cheers, Simon