Mr. Haelscheir
Active Member
- Joined
- Mar 16, 2023
- Messages
- 139
- Likes
- 106
I would like to solicit where we are for the "state of the art" of transducer measurements.
tl;dr: What methods could we explore for measuring the dynamic performance of transducers with actual music as opposed to steady or swept tones? Am I missing literature finding such analysis to be ineffectual?
Room EQ Wizard already presents an excellent suite for my own uses in assessing magnitude and phase response, impulse and step response (whether or not this can actually be correlated with the subjective perception of transients and their attack, sharpness, or decay), CSD (residuals probably only audible with extreme, isolated transients or "illegal signals" where a transducer's nonlinearities are rendered audible) (see https://www.head-fi.org/threads/hifiman-he1000-se.886228/page-320 for this suite in action), and an excellent Real-Time Analyzer for multitone distortion measurement as I had reported in https://audiosciencereview.com/foru...n-susvara-headphone-review.50705/post-1888972 (post #1,183; it particularly showed that a headphone with the same harmonic distortion performance past a frequency could still have worse multitone distortion performance within that same range; I still need to repeat the test with the multitone truncated to exclude the higher-distortion bass of the "lesser" headphone).
The limitation of that classic battery of measurements is its reliance on steady-state signals or averaging over a complex signal like a pink spectrum multitone. For the goal of assessing a transducer's accuracy or its ability to only produce residuals below audible thresholds, one might argue even spectrally dense multitone measurements to be insufficient for capturing the dynamic or transient performance of transducers with real music. As for impulse responses, a member here (I don't have the link on me) had managed to demonstrate the agreement between an impulse response calculated from a sine sweep (frequency domain measurement) and a time domain recording, so there should be no concern for the capacity of sine sweeps to capture such isolated transient performance. https://www.superbestaudiofriends.org/index.php?threads/burst-response-hd800-sr-207-hd650.3688/ had been a somewhat interesting read, whatever you may think of SBAF, but is probably already captured by the step response and CSD.
Edit: More clearly distinguished "frequency response" from "transfer function".
The questions I wish to answer with these dynamic measurements include:
What I want a dynamic audio analyzer to do:
tl;dr: What methods could we explore for measuring the dynamic performance of transducers with actual music as opposed to steady or swept tones? Am I missing literature finding such analysis to be ineffectual?
Room EQ Wizard already presents an excellent suite for my own uses in assessing magnitude and phase response, impulse and step response (whether or not this can actually be correlated with the subjective perception of transients and their attack, sharpness, or decay), CSD (residuals probably only audible with extreme, isolated transients or "illegal signals" where a transducer's nonlinearities are rendered audible) (see https://www.head-fi.org/threads/hifiman-he1000-se.886228/page-320 for this suite in action), and an excellent Real-Time Analyzer for multitone distortion measurement as I had reported in https://audiosciencereview.com/foru...n-susvara-headphone-review.50705/post-1888972 (post #1,183; it particularly showed that a headphone with the same harmonic distortion performance past a frequency could still have worse multitone distortion performance within that same range; I still need to repeat the test with the multitone truncated to exclude the higher-distortion bass of the "lesser" headphone).
The limitation of that classic battery of measurements is its reliance on steady-state signals or averaging over a complex signal like a pink spectrum multitone. For the goal of assessing a transducer's accuracy or its ability to only produce residuals below audible thresholds, one might argue even spectrally dense multitone measurements to be insufficient for capturing the dynamic or transient performance of transducers with real music. As for impulse responses, a member here (I don't have the link on me) had managed to demonstrate the agreement between an impulse response calculated from a sine sweep (frequency domain measurement) and a time domain recording, so there should be no concern for the capacity of sine sweeps to capture such isolated transient performance. https://www.superbestaudiofriends.org/index.php?threads/burst-response-hd800-sr-207-hd650.3688/ had been a somewhat interesting read, whatever you may think of SBAF, but is probably already captured by the step response and CSD.
Edit: More clearly distinguished "frequency response" from "transfer function".
The questions I wish to answer with these dynamic measurements include:
- Do transducers or the electronics that drive them change in frequency domain magnitude response or transfer function (relation of output level to input level) in the middle of playing certain audio signals?
- Do some transducers or electronics actually reject certain low-level signals so as to "lose the details"? I define "detail" as all the raw, objective information contained within the recording (effectively samples), and "resolving capability" as the ability for the output of the playback system to possess a linear correlate for every sample within the reference recording.
- In some transducers or electronics, can the presence of juxtaposed higher-level signals cause lower-level signals to be rejected (as opposed to simply being masked by intermodulation distortion and other), or in other words have their transfer function or levels altered?
What I want a dynamic audio analyzer to do:
- The user takes a reference track, for example, the first minute of the second movement of Boulez' Mahler Symphony No. 5 recording (alternatively, some idealized synthetic if not AI-generated dynamic test signal, which is open to discussion), captured as a .WAV file, then records using microphones (be it a measurement microphone, a dummy head, or in-ear microphones) in a sufficiently low-noise environment the output of the transducer.
- The FFT of the sample-aligned reference and transducer .WAV files are calculated and plotted with respect to time (this need not be done in real-time so as to be able to maximize resolution). As I understand, FFT window length (assuming a rectangular window as having sufficient dynamic range for the transducers in question) present tradeoffs in frequency and temporal resolution, whereby for longer FFT window lengths, one attains greater frequency resolution for lower frequencies (at least when looking at a logarithmic scale; I suppose the main lobe width per tone would effectively be equal when using a linear scale?) at the cost of reduced temporal resolution for the changes in all frequencies, hurting the analysis of transient behaviour. I happened to come across the term "wavelet transform" which might help for tuning the resolution for the desired view. An alternative idea I am curious of is whether it is possible to reduce the FFT length with increasing frequency toward the theoretical fastest if not "actual" rates at which individual tones of a given frequency are changing, if such a concept even exists insofar as a time domain signal obviously doesn't have an "instantaneous FFT" without consideration of window length (I could be mistaken regarding this).
- After being able to obtain the spectral content of the reference and transducer signals at a given time, the software correlates all the tones common between the reference and transducer signals and from this estimates the instantaneous frequency response of the transducer for each sample (or chosen window). Except where residuals overlap with the tones in the reference signal, this analysis would reveal whether the playback system's transfer function (frequency response and possibly things like "DAC or amp tonality" which we are skeptical of) is changing with respect to time for the given practical music stimulus. A 3D visual could be made plotting the deviation of this frequency response from the steady-state sine sweep measurement, pointing out areas of dynamic compression or extension.
- From this, the actual residuals, be it distortion in the form of tones, reflections, or tones actually failing to decay fast enough, could be extracted and plotted with respect to time for analysis of audibility. Knowledge of the steady-state frequency response could perhaps be used to discern residuals as opposed to frequency response changes within tones already present in the reference recording. This analysis could also be used to discern where tones in the reference signal are "missing" or attenuated and hence not as well "resolved". The software could depict these residuals and perhaps visualize where tones resolved from the reference signal (the audibility of those themselves can be analyzed) are being masked by the adjacent residuals. As such, we would have an objective analysis of a system's "resolution" and "accuracy" independent of tonality (linear distortions).
- This analysis could also be applied to the direct measurement of electronics so as to do a matters more than the PK Metric at least in being able to separate linear distortions from nonlinear distortions.
- Maybe FSAF does exactly this, but it is still way above my head for me to discern that.
Last edited: