That's some interesting data re the measured variance of human subject HRTF (Fig 5), I find the standard deviation graph to be one of the most useful there, as it's showing the bracket within which 68% of the population would fall. Up to 3kHz it's showing that 68% of humans would fall only +/- 1dB of the Target Response (mean). So for sure you can say HATS in terms of HRTF are valid up to 3kHz. From 4-7kHz you've got about +/-2.5dB variation, and above 7kHz you got about +/-5dB variation. So taking that all into consideration then I would think measurements are meaningless above 7kHz unless individuals responses above 7kHz are a predictable/consistent "Shelf dB" above or below the mean, in which case a person could experiment with Shelf EQ's above & below the calculated EQ (e.g. Oratory1990 for example)??
My previous paragraph, that just addresses HRTF calculated from "speakers in a room", but then you've got your HpTF that you mentioned on top of that as an additional variable and source of variance within a population. This is what you were showing in Fig 6. I think. Regarding variance of HpTF as seen in those graphs it seems to be about the same level of variance in HpTF as there is with HRTF and following the same patterns across the frequency range in terms of greater variance at the higher frequencies.
So given we have two sources of variance, both HRTF [(calculated from "speakers in a room") which is used to create any Target Frequency Response we use for headphone EQ] and HpTF which just shows the variance of a given headphones Frequency Response for any individual then where does that leave us in terms of how accurate EQ's can be that are based on measurements on Dummy Heads? I suppose combining the two variances of HRTF and HpTF this is further magnifying the overall error, I suppose you could work out mathematically how much that variance increases when you combine them given that variance of each variable seems about equal - would it double the overall variance or is it something like a 1.5 factor? EDIT: although in a previous post you said you've found HpTF and HRTF are often linked, so then that would indicate that you think when combining the variance of HpTF & HRTF that the overall variance is not as large as one would initially think, what kind of an increased variance factor would it be do you think?
Going back to an observation I made in the first paragraph that most variance is above 7kHz, can we use Shelf EQ's above 7kHz to manipulate EQ's that have been created on dummy heads to experiment ourselves as to what sounds most accurate, or does an individuals ear frequency response above 7kHz not follow the general trends at all, and thereby "Shelf EQ technique" above 7kHz holds no water?
Again, a lot of the validity of headphone measurements along with the validity of Target Responses will come down to how far an individuals physical anatomy varies from the dummy standard as a whole, although it seems pretty darn reliable & indisputable up to the 2kHz.
As for the validity of the B&K 5128 and the validity of the ASR headphone project.....I think we have to accept that there are all these variances within the population that we've talked about here, and if we're gonna do it then it would have to be something above & beyond what is offered on other sites. If the B&K 5128 itself is not inherently more "accurate" than the equipment being used by other sites then that's not a differentiation point either.....cursory evaluation seems to suggest B&K 5128 offering very similar results to other sites for HD650 it seems (I've not looked at this in detail, just gone off other members good posting in this thread). So any differentiation to other sites will come down to what we do with the measurements in terms of interpretations related to comparative headphone quality between different headphones (that might come under things like distortion & other measured variables incl frequency response), and perhaps of course we can also offer headphone EQ service in terms of offering filters for people.......I think we gotta do something different if B&K 5128 itself isn't proving to be "next gen" or "anything special".
Some good questions here.
In broad terms, I think the premise that high-frequency equalization should mostly take the form of shelf or very low Q peak filters (broad adjustments in level, essentially) is quite advisable. I personally take this to a relative extreme; I don't even aim to notch the peak from my HD800, because I've found that doing so without putting a hole in the surrounding treble somewhere is troublesome. For most practical people's purposes, so long as it doesn't sound bad, I'd say that relatively fine/high Q equalization is reasonable up to 8-10khz, although it depends on a number of variables (particularly how the headphone's response varies with position on the head of the wearer).
Regarding the intersection of HRTF and HpTF, let's consider two extremes and how they would impact things:
First, let's imagine a world where headphones are "HRTF chameleons" - by some process, whether acoustic or computer controlled, they perfectly approximate the individual wearer's HRTF in the target sound field. In this world, you could see extremely wide variation in HRTFs and HpTFs, but have
zero variation in subjective timbre of headphones, because headphone subjective frequency response is equal to HpTF minus HRTF. This world is almost entirely reconcilable with Hammershøi & Møller's data.
Second, let's imagine a world where headphones are "HRTF blind" - perhaps in this world a trend of very deeply inserted in-ear monitors dominates, but for whatever reason, the HpTF is
absolutely constant between wearers, even as individual HRTF varies. Subjective frequency response would track roughly according to the scenario you're outlining in this post - up to around the peak of the ear resonance there'd be agreement, and then it would all go south. A headphone that sounds peaky to one person would be smooth to another, and we'd have a great deal of trouble comparing our subjective impressions of headphones at all in the higher frequencies.
Now, in reality, we don't live in either of those worlds - some of the aspects of individual anatomy that influence HRTF also influence HpTF, so they aren't uncorrelated, but they also don't coincide perfectly. There's also the "x factor" of whether headphones have individualized and atypical interactions with some anatomy that doesn't relate in any way to HRTF - unarguably this happens at low frequencies with headphones with high acoustic impedance when a leak is present in the pad volume, but it could also happen at higher frequencies, and this would introduce another source of response variation.
Pragmatically, I don't think that we can use measurements of headphones on population average measurement fixtures to make some of the projections we can make about other types of equipment - e.g. "DUT A will sound the same as DUT B" - and non-individualized equalization is inevitably going to be an area where caution is wise, and erring towards broader filters and leaving the high Q features mostly alone is likely to yield better results on average, but equally, headphones are such radically audibly different devices that we don't
need the degree of consistency of coherency between test and in situ that we have with amplifier or DAC measurements to make very reasonable extrapolations about what will sound better.
This all said, I still don't entirely grasp what Amir's primary goal with this headphone testing project is; if it's to redefine headphone metrology and take it out of the dark ages...well, we weren't in the dark ages to begin with, so that'd be pretty hard
If it's to present an additional impartial source of reliable headphone measurements alongside what presently exists (Oratory, Resolve/Headphones.com, Clarityfidelity/Speakerphone, Keith Howard/HeadphoneTestLab, Brent Butterworth/Soundstage Solo, etc), then we've already got validation of concept. If it's to improve on the current state of the art in headphone metrology that's very conceivable - the 5128 can reasonably claim to be the most accurate way to measure headphones that presently exists, and there's still plenty of room for innovation in methodology if Amir is interested in that.