BTW, I think the biggest problem with subjectivist reviewers is lack of skill, not because the test is sighted. Lack of skill is clearly documented in research by Harman as I showed earlier:
If these were ratings of doctors, would you select anyone from "Audio Reviewers" category? I assume the answer is no. Why are you making a dispensation for audio?
The last time I was at Harman taking this test it was with a group of top dealers that Harman had brought in to quality for their room optimization technology. Sean Olive ran the test and this group could not get past level 2 or 3 which is what the general public does. In other words, they had no special skills, as the "Audio Retailers" bucket shows. I got up to 5 or 6 at the time (this was 10 years ago) but Sean sailed way past me. This is the power of training and knowing what you are doing. These subjectivist simply don't know what they are doing.
In contrast, sighted vs not has a soft impact on listener preference with speakers due to large difference between them. When testing their own Harman employees with implicit bias for their own brands, the difference was not that large between sighted and blind:
Whether sighted or blind, speakers G and D both got better scores than Speakers S and T. The role reversal only happened for speaker S and there, the difference was between 5.8 and 6.4. Not 1 vs 10. Speaker T maintained its score either way.
So this is not like testing electronics where frequency response is flat and only difference is distortion. There, bias can easily overwhelm the tiny differences between electronics. Here, the bias factor needs to be quite large. Having Harman's own employees in this test provided that to some extent. But nothing like what we find in sighted tests of electronics vs blind.
Net, net, I think the major issue in subjective speaker reviews is that the job is very hard and next to impossible. You are judging a speaker with no reference to what is correct. You will be relying on memory of many other speakers which is faulty especially when combined with inability to reliably detect differences in speakers.
Keep in mind that blind testing of speakers as I have explained is NOT done with one speaker. As Dr. Toole elegantly put it, it will produce nothing useful. They test 3 or more speakers against each other. Only then the results are found to be useful (post statistical analysis). You can see that in place in that graph above.
There is no research to back listening tests of one speaker by itself, blind or otherwise. You all need to internalize this as it is not often understood and attention just put on "blind vs sighted." A multi-way sighted speaker test is far more revealing in my book than any blind single speaker test.
This is such a hairy problem that one needs to give up than attempting with a straight face to say they can solve it. This is why I have resorted to EQ testing. This way I have a comparison. I can do it blind. And I can subsegment the problem and solve it. Please think through this before saying again that you want blind tests of speakers. And that testing of a single speaker by itself done by joe reviewer is good if he doesn't look at measurements. There is nothing of the sort in the cards.
If these were ratings of doctors, would you select anyone from "Audio Reviewers" category? I assume the answer is no. Why are you making a dispensation for audio?
The last time I was at Harman taking this test it was with a group of top dealers that Harman had brought in to quality for their room optimization technology. Sean Olive ran the test and this group could not get past level 2 or 3 which is what the general public does. In other words, they had no special skills, as the "Audio Retailers" bucket shows. I got up to 5 or 6 at the time (this was 10 years ago) but Sean sailed way past me. This is the power of training and knowing what you are doing. These subjectivist simply don't know what they are doing.
In contrast, sighted vs not has a soft impact on listener preference with speakers due to large difference between them. When testing their own Harman employees with implicit bias for their own brands, the difference was not that large between sighted and blind:
Whether sighted or blind, speakers G and D both got better scores than Speakers S and T. The role reversal only happened for speaker S and there, the difference was between 5.8 and 6.4. Not 1 vs 10. Speaker T maintained its score either way.
So this is not like testing electronics where frequency response is flat and only difference is distortion. There, bias can easily overwhelm the tiny differences between electronics. Here, the bias factor needs to be quite large. Having Harman's own employees in this test provided that to some extent. But nothing like what we find in sighted tests of electronics vs blind.
Net, net, I think the major issue in subjective speaker reviews is that the job is very hard and next to impossible. You are judging a speaker with no reference to what is correct. You will be relying on memory of many other speakers which is faulty especially when combined with inability to reliably detect differences in speakers.
Keep in mind that blind testing of speakers as I have explained is NOT done with one speaker. As Dr. Toole elegantly put it, it will produce nothing useful. They test 3 or more speakers against each other. Only then the results are found to be useful (post statistical analysis). You can see that in place in that graph above.
There is no research to back listening tests of one speaker by itself, blind or otherwise. You all need to internalize this as it is not often understood and attention just put on "blind vs sighted." A multi-way sighted speaker test is far more revealing in my book than any blind single speaker test.
This is such a hairy problem that one needs to give up than attempting with a straight face to say they can solve it. This is why I have resorted to EQ testing. This way I have a comparison. I can do it blind. And I can subsegment the problem and solve it. Please think through this before saying again that you want blind tests of speakers. And that testing of a single speaker by itself done by joe reviewer is good if he doesn't look at measurements. There is nothing of the sort in the cards.