I don't really disagree with you other than you seem to see the glass as three quarters empty rather than three quarters full.
Well, please don't get me wrong - this series of papers from Olive's group is still novel and groundbreaking and added a ton of understanding about loudspeaker measurements, what aspects of those measurements matter more, and how predictive measurements can be.
What bothers me is that there are some people who, without fully understanding the research, believe that loudspeaker listener preferences can be entirely predicted by measurements, or worse, that eyeballing a series of measurements can even come close. I get that people really really want to believe this to be true (because it's true for solid state devices like DACS and amplifiers), but wishing something to be true does not make it true.
To me it shows that three quarters of what determines preference in most cases is frequency response either on axis or the balance of the on axis to off axis and the smoothness of the response. This is pretty much Floyd Toole's mantra of flat on axis with smooth well controlled directivity and a lack of resonances.
Smoothness and directivity (on/off-axis response) account for less than 3/4, since
30% of listener preferences is associated with
bass extension. While Harman's design philosophy involves smooth directivity, and Revel/JBL make really good loudspeakers based on these design goals, it doesn't mean that you can't have a highly preferred speaker that doesn't follow this philosophy. And you can clearly see this by looking at the chart I posted above from Olive's paper, by looking horizontally and seeing how many loudspeakers with the same subjective score had widely differing preferences scores predicted by measurement.
To me the difference in confidence level between the 13 speakers and 70 speakers shows that when the speakers are all very similar in size and construction, really the only thing that separates them is the frequency response in one form or another.
Perhaps. But it also give us an idea of how the final regression would perform when further extrapolating outside the sample. R went from 0.99 to 0.70 when the original model based on the 13 loudspeakers was applied to the larger sample of 70. R^2 of 0.70 is 49%, which means that the extrapolated formula could only explain HALF the variability in listener preferences. So we talk about 74% for the 70 speakers...how much lower is it when we look at speakers that weren't part of that sample of 70?