Grrc
I appreciate the thought you put into this, along with the references you cite (although not too many references - don't want to be part of the reference-creep documented in that study!
).
There is one main takeaway I get from your post, which I wholeheartedly agree with: Many hobbyist forums tend to focus on certain components or aspects not because those aspects are decisive, but rather
because those aspects are part of the forum's founding interest area. Put crudely, the forum is about hammers so all they want to talk about is nails. It's why the Computer Audiophile (now AudiophileStyle) forum has a contingent that wants to talk about things like how soundstage depth is impacted by SSDs vs spinning hard drives or USB vs optical vs coax. It's why high-end vinyl forums include a lot of talk about tonearm wiring and why vintage audio forums feature lots of passionate opinions about the relative sound quality of different receivers in the same manufacturer's model line from the same year. Those forums all draw people who are mainly interested in that stuff.
Even a place like Gearslutz, which IMHO has a very scientifically "healthy" focus on room treatments since the room is arguably the only thing more influential than the speakers, has that focus because it's a forum for DIYers; and acoustic treatments and studio-type room considerations are a major focus of the folks who are drawn to that forum.
So yes, this forum - Audio SCIENCE Review - is drawn to measurements. And for reasons probably as much to do with cost and convenience (size and ease of shipping, for one), it has tended to measure mostly source components, DACs, and amps. And as it has become known for that, it's drawn more members who are interested in those things.
However, where I part ways with your analysis is in your hypothesis that ASR focuses on "easy" measurements of DACs, amps, and so on because of larger cultural shifts in science and/or because a lot of members here are comforted by the certainty of "easy" (aka positivistic) measurements.
I think that hypothesis is weak. I think it's weak because for every comment here about how a DAC that has SINAD of 118 is better than one with SINAD of 112, you see a comment about how it's important to find amps with better SINAD because DAC measurements become pointless if there are no affordable amps with SINAD above the low 90s - in other words, folks here are very conscious of the importance of system balancing.
And that awareness has in recent months led to increasing requests and interest for Amir to test speaker amps, including larger (and more expensive, and more of a hassle ship back and forth) AB amps, multichannel receivers, and so on. And more recently, Amir himself has talked about speaker measurements. And again, even in that conversation the founding issue is that complexity and expense of doing proper measurements on speakers
precisely because of what you cite as the difficulty of properly measuring speakers.
So can we all as a group and community here benefit from being aware that not all established measurements fully describe human perception when it comes to the least linear and most distorted part of a sound system (the speakers)? Yes - transducers are, by their nature, the "lowest-fi" and hardest to measure in meaningful ways (that would include mics and headphones too).
But I think this community clearly has demonstrated a willingness and desire to learn, and so I would disagree with your claim that we are fiddling away with DAC SINAD and ignoring the sonic elephant in the room.
Finally, with respect, I think there's a serious logical fallacy in proposing a "middle way" between science and listening based on all that you have written here. Apologies if I'm misunderstanding, but a "middle way" appears to presume that measurements are definitionally insufficient and so we must combine measurements and listening in some 50-50 (middle) way.
I would argue against that - not because I don't value listening (it's the whole point!), but rather because IMHO measurements and listening are qualitatively different and so one cannot simply split the difference between them. If Speaker A measures better than Speaker B but I like Speaker B better, I don't say, "Can you show me a Speaker C that measures better than Speaker B but not as good as Speaker A," and hope that Speaker C will conversely sound better than Speaker A but not as good as Speaker B. That would be a middle way between measurement and listening, but it would end up with me buying a speaker that I didn't like as much as Speaker B.
Instead, I would either buy Speaker B, or I would double-check my source material and the listening space where I heard the speakers. After making a reasonable effort to rule out factors that might be making Speaker B falsely seem better, if I still liked Speaker B, I'd buy it. There is no middle way there.
@tmtomh , thanks for your long reply. I read it a couple of times to add a few comments.
I will try and explain where I think we differ.
And sorry to all who like short posts. I am not a big fan of twitter. I am not a fan either of posts that are short just to shout repeatedly “four legs good, two legs bad”, as if science were a choir session. It takes longer and more space to develop a minority, diverging argument.
You wrote:
«So yes, this forum - Audio SCIENCE Review - is drawn to measurements. And for reasons probably as much to do with cost and convenience (size and ease of shipping, for one), it has tended to measure mostly source components, DACs, and amps. And as it has become known for that, it's drawn more members who are interested in those things».
The main point of my article is to open one’s eyes for good and bad science.
Akerlof (2019) wrote:
«Before describing the implications of our analysis, it is important to emphasize what has—and what has not been—said. The theoretical and empirical accomplishments of modern economics, obtained with Hard standards for the conduct of research, should be rightly celebrated. But such standards should not be uniformly applied to all economic problems; especially, they should not be applied to those problems for which those standards are too restrictive: for lack of evidence or because motivation significantly differs from standard economic assumptions. Different terrains call for different vehicles. A sailboat is useless in crossing a (riverless) desert; a camel is useless in crossing a sea».
So Akerlof is not criticizing Hard per se. Hard is good science. It’s the narrowness in methods he points the finger at, lack of alternative methods that may create sins of omission; too much of a good thing. When you seem to liken science with measurements, you narrow - dumb down actually - science to an exercise for Measurement Man.
In the same spirit I am not criticizing measurements of DACs and other transparent electronics as such; I use these observations of an overweight of Low Importance to make the point that Akerlof’s figure 1 is relevant for ASR as well. See my applying observations from ASR to Akerlof’s figur 1 below.
Please note that Ellison (2002) does not predict only one outcome (like over-emphasis on Hard/Low Importance) as the outcome can go either way depending on social processes. I am making the case that all discussions on ASR gravitate towards south east in the figure above. Any attempt to try and lift the discussion northward are arrested as if by a thought police. I think this narrow focus on Hard measurements of low Importance on ASR carries over into other fields as well, like speaker discussions and other discussions of general interest. Let me provide an example.
Previously, I quoted a Danish JAES article from 2017, co-written by well-known professor Søren Bech, that opined:
«This disconnect between specifications and perception have made it challenging for acousticians and engineers (and consumers) to predict how a loudspeaker will sound on the basis of these specifications».
https://www.researchgate.net/public..._sound_quality_-_a_review_of_existing_studies
Now, let me quote AES Fellow John Watkinson:
«Traditional loudspeaker measurements are disreputable. It is widely known that several speakers having the same measurements can sound quite different. To a scientist this can only mean that the measurements are either insufficiently accurate or incomplete. Little wonder that extensive subjective assessment is traditionally thought necessary in the development of a loudspeaker».
Source:
http://cyrille.pinton.free.fr/elect...nson-Putting_science_back_in_loudspeakers.pdf
So there seems to be a gap here between people like Watkinson and Bech - who speak about differences that were not apparent in readily available measurements - and people on ASR who act as if measurements yield a complete picture of reproduced sound. Is the ASR community interested in discussing inputs like those of Watkinson and Bech. If not, why is that? Please note that
@Floyd Toole possibly gives an answer to the limits or obstacle of measurements when he says that a measurement device is not brain and a set of of ears. In other words, if audio measurements yielded a complete picture of sound, audio scientists would need to know more about the brain and ears than our best scientists from medical schools.
In your post, you wrote:
“Finally, with respect, I think there's a serious logical fallacy in proposing a "middle way" between science and listening based on all that you have written here. Apologies if I'm misunderstanding, but a "middle way" appears to presume that measurements are definitionally insufficient and so we must combine measurements and listening in some 50-50 (middle) way”.
A gorilla in the room is the fact that for example Harman listen too! A lot. Possibly more than anyone else. They have listening panels that give rise to measurements procedures based on statistical analyses. What came first; listening or science, listening or Idea? What do you think gave rise to Harman choosing listening to decide product development in the first place?
Having said that, I don’t know of any speaker producer who uses measurements only, having discarded all listening sessions. If you or somebody else know of a speaker producer who has skipped all listening in the R&D phase, please inform me.
And let me add that the gold standard in modern audio research, controlled listening, has a potentially conservative effect. Let me use an example to illustrate my point, in the spirit of Akerlof’s note on the sailboat and the camel. If we go back to the 1800s, and develop a measurement regime for testing taxis, that testing regime would ask very different questions than a measurement regime for taxis 100 years later. The vox populi based measurement regime for horse drawn taxis of the 1800 would probably conclude that the new motorcar taxis were of inferior quality (lack of space, repairs needed, few gas stations etc.) It’s just the more philosophical observer who would conclude that the testing regime for taxis of the 1800s were a limiting factor for New, not a factor that drove innovations and New. In other words, a certain testing regime - like vox populi - may be a conserving force for Old and an impediment for New. Is it coincidence that many people remark that modern speakers look very much the same as they used to?
Say a new speaker arrived in 2020, a true point source speaker. If one tested this version 1 of the new point source, one could conclude that the point source speaker is a bad design because it scores low on certain parameters where the Old got a better score. However, there may be ways to measure New, measurements that are not readily available, that would reveal that New is a better design than Old.
My point is, the controlled listening test method - vox populi - is a middle way and has always been so. Vox populi has many strong arguments in favor of it, and I am a big fan of it. But I also realize that vox populi has its limits. I am uncertain if vox populi is a practical driver of New in speaker design.
The most famous remark on the limits of vox populi a.k.a. democratic elections is associated with Churchill:
“Many forms of Government have been tried, and will be tried in this world of sin and woe. No one pretends that democracy is perfect or all-wise. Indeed it has been said that democracy is the worst form of Government except for all those other forms that have been tried from time to time.…”