In 2018,
Amir examined a peer-reviewed conference paper published in the 2011 proceedings of the Audio Engineering Society. The author is a faculty member at a well-regarded audio engineering program at McGill University in Canada. This study demonstrated, through a double-blind AB preference test, that a statistically significant number of listeners were able to express preference for one DAC over another:
- "While the differences were slight, the op-amps in this test were capable of producing levels of distortion within their normal, unclipped operational range that were detectable by listeners. It is possible for listeners to differentiate between op-amps based on distortion characteristics alone. The ability to distinguish between the distortion characteristics of op-amps appears to be dependent on a complex interaction of the THD vs. Frequency, THD vs. Amplitude, and the harmonic spectrum of the distortion. This low-level distortion is enough to alter the character of the sound but does not necessarily reduce listener preference."
View attachment 432161
As you can see, 9 of the 24 listener-observations (some listeners tested more than once) were able to express a statistically significant (column 1 p<.05) preference for one DAC over another even though both the A and B DACs had THD+N ratios well below 0.1% (columns 2 & 3).
In a follow-up 2014 study, the author finds that
- "Due to the gain-bandwidth characteristics of operational amplifiers, their nonlinearities are frequency dependent, showing a rise in distortion at higher frequencies. Depending on the circuit and system implementations, this distortion can be significant to listener perception of sonic character and quality and is therefore relevant to models of op amp-based analog equipment."
Finally,
a 2019 masters thesis out of the University of Huddersfield (UK) cites these McGill studies and was able to reproduce their general findings.
- "This thesis presents and discusses research undertaken into the detection and measurement of colouration produced by audio transformers. It is common for transformers to be subjectively described by audio professionals as ‘warm’, ‘fat’, ‘smooth’, etc. however there is little evidence to show if there is an audible difference and if there is a correlation between these levels of perceptual attributes and the performance of the device. Therefore, the research question was defined as: is there an audible difference with a transformer?
- "A review of the objective and subjective elements of the study was conducted. First the history, application and operation of the audio transformer with an aim to understand the objective measures and performance of the device with focus on the nonlinear distortion response. This also includes the design and testing of a suitable test circuit used in the measurement method. The subjective review concerns perception of distortion and testing methodologies for investigating audible differences. The conclusion of the review is that few pieces of research exist showing the relationship between device, distortion and perception
- "The testing of each transformer involved the use of a specially designed test circuit using a variety of measures including THD+N and frequency domain analysis, to provide the most information about the operation of the device. Using the same setup, the device responses to a variety of samples were recorded and implemented in a double-blind triple-stimulus with hidden reference test using trained listeners in accordance with the ITU-R BS.1116-3 recommendation. The test results were then analysed for a random distribution using a 1-tailed binomial test. The results of the analysis show a high likelihood that the bass samples were audibly different shown by the significant p-values of all 3 samples at less than 0.001. A slight correlation seems to exist with THD+N, 3rd harmonic distortion and level however with no other obvious trends, it was concluded that the distortion and therefore the audibility is programme dependent.
- "It was concluded that transformers are likely to produce a level of distortion deemed audible although the effect is considered to be programme dependent."
None of these tests "prove" that I (or anyone) can tell the difference between two operational amplifiers with otherwise similar measurements of THD. But the statistical significance of these seemingly well thought out experiments should increase our confidence in the plausibility that I--and myriad other audio professionals and enthusiasts--aren't suffering from some mass delusion when we hear qualitative differences between opamps.
Audio enthusiasts and technicians should be less-dismissive of claims that individuals can hear such qualitative differences between DACs. There may be quantifiable differences in the way low-level operational amplifier distortion generates audible even- or odd-order harmonics.