I don't understand enough about multiple regression models to give a confident answer, but it's all in the papers if you're so inclined.
I'd hazard a guess that the Blade's dip at 300Hz may be the culprit:
View attachment 487253
What I can tell you with confidence though is that it is ill-advised (even according to the authors) to read much in to the preference scores.
A 1.0 difference in preference score does not say much and differences smaller than that are basically meaningless.
Keep in mind as well that the preference score only describes frequency response and no other aspect of loudspeaker performance.
It is feasible for a speaker to get a perfect preference score, but only at 50dB SPL and in practice that speaker just breaks apart at higher SPLs, resulting in an awful listening experience.