Countering misinformation about AutoEQ

markanini · Feb 13, 2024

MacClintock said:
For IEMs, @jaakkopasanen even uses not the 2019v2IE Harman target, but lowers the bass by 2 dB. I don't know why, but this 1) distortes obviously the compliance score and 2) has not objective justification. The region is also limited to 40HZ until 10kHz. So when an IEM has weak subbass or extremley unsmooth and spiky treble above 10kHz (which many, like for example the Chu, but not the Variations or the Truthear Nova, have), this is simply neglected. In my view, by doing this, and all of it is not even clearly stated, but must be extracted from the Python source code, he is doing a big disservice to the research.

Misinformation on many levels. First, The preference scores on the AutoEQ Ranking page are referencing Harman IE 2019 for IEMs, nothing else. Only the separate EQ presets are using a lightly modified Harman target, which is fine. Automatic EQ doesn't become more reliable just for adhering to a particular target in this context. This makes your erroneous remark about the frequency range irrelevant, in fact 10kHz+ was never on the table for 711 couplers per IEC standards and manufacturer specs, that's why Sean Olives model didn't consider +10kHz. So AutoEQ would be doing it incorrectly if they changed it to what you want.

MacClintock · Feb 13, 2024

markanini said:
Misinformation on many levels. First, The preference scores on the AutoEQ Ranking page are referencing Harman IE 2019 for IEMs, nothing else.

Not true. Misinformation is what your are dispelling. It is NOT the 2019v2IE Harman target but the bass is reduced by a ca. 2 dB shelf. Educate yourself.

The IEM Harman Target 2019 sounds "off" to me. Is it just me?

I had some responses in mind, but I'm checking out of this argument and leaving this here: Back on topic, is the Harman target used in AutoEQ even correct? One outcome of the above exchange is that, in looking at the Truthear Zero on AutoEQ, I noticed it overshoots almost the entire bass shelf...

audiosciencereview.com

markanini said:
Only the separate EQ presets are using a lightly modified Harman target, which is fine.

See above.

markanini said:
Automatic EQ doesn't become more reliable just for adhering to a particular target in this context. This makes your erroneous remark about the frequency range irrelevant, in fact 10kHz+ was never on the table for 711 couplers per IEC standards and manufacturer specs, that's why Sean Olives model didn't consider +10kHz. So AutoEQ would be doing it incorrectly if they changed it to what you want.

Even if the region above 10 kHz is not considered by the score rating (which is not from Sean Olive, only from @jaakkopasanen), it is still a relevant region. Thus, if an IEM measures well below it, but has above it many spikes or holes in the FR, it is still bad and not very Harman compliant.

bluefuzz · Feb 13, 2024

I don't understand the problem. AutoEQ is fully customizable to produce EQs of any number of PEQs conforming to any target you like. If you don't like the defaults then make your own correction! Any software has to come with some default settings as a demonstration of its capabilities but they should only be considered a starting point for your own work ...

jaakkopasanen · Feb 13, 2024

MacClintock said:
Not true. Misinformation is what your are dispelling. It is NOT the 2019v2IE Harman target but the bass is reduced by a ca. 2 dB shelf. Educate yourself.

The IEM Harman Target 2019 sounds "off" to me. Is it just me?

I had some responses in mind, but I'm checking out of this argument and leaving this here: Back on topic, is the Harman target used in AutoEQ even correct? One outcome of the above exchange is that, in looking at the Truthear Zero on AutoEQ, I noticed it overshoots almost the entire bass shelf...

audiosciencereview.com

See above.

Even if the region above 10 kHz is not considered by the score rating (which is not from Sean Olive, only from @jaakkopasanen), it is still a relevant region. Thus, if an IEM measures well below it, but has above it many spikes or holes in the FR, it is still bad and not very Harman compliant.

The scoring function is using the original targets, unmodified.

AutoEq/dbtools/update_result_indexes.py at master · jaakkopasanen/AutoEq

Automatic headphone equalization from frequency responses - jaakkopasanen/AutoEq

github.com

MacClintock · Feb 14, 2024

jaakkopasanen said:
The scoring function is using the original targets, unmodified.

AutoEq/dbtools/update_result_indexes.py at master · jaakkopasanen/AutoEq

Automatic headphone equalization from frequency responses - jaakkopasanen/AutoEq

github.com

Ok, thanks for clarifying this, as all of it is not clearly stated on the github or somewhere else and having two different targets is confusing and not justified. And furthermore the scoring function is limited to the region of 40Hz and 10 kHz, that is why many IEMs with terrible treble (like the Chu) are up there in the rating. Also the Truthear Zero:Red is much lower rated in this soring, but has higher or equally high rating as the Chu and the Salnotes Zero in the oratory measurements.

What headphones are concerned, at the top there are

Mark Levinson No 5909	99	1.14	0.06
Sennheiser HE 1 Orpheus 2	99	1.18	-0.02
Dyson Zone	96	1.45	0.01
Shure SRH440	96	1.31	0.11
HIFIMAN Sundara (post-2020 earpads)	95	1.54	-0.01
Philips Fidelio X2HR

Anybody taking this seriously has lost his mind, besides the HE1 all these headphones are mediocre or just good (Sundara, SRH440, 5909) if not plain trash. For example the Fidelio has according to Amir's measurements high distortion, treble peaks, channel imbalance, strange group delay, variable impedance and cannot even be EQed properly. Yet it is the 6th accoding to this "scoring". I know, it is only meant to group the tonal balance, but after seing something like this, one should doubt the validity of this approach altogether.

IAtaman · Feb 14, 2024

MacClintock said:
Ok, thanks for clarifying this, as all of it is not clearly stated on the github or somewhere else and having two different targets is confusing and not justified. And furthermore the scoring function is limited to the region of 40Hz and 10 kHz, that is why many IEMs with terrible treble (like the Chu) are up there in the rating. Also the Truthear Zero:Red is much lower rated in this soring, but has higher or equally high rating as the Chu and the Salnotes Zero in the oratory measurements.

What headphones are concerned, at the top there are

Mark Levinson No 5909 99 1.14 0.06
Sennheiser HE 1 Orpheus 2 99 1.18 -0.02
Dyson Zone 96 1.45 0.01
Shure SRH440 96 1.31 0.11
HIFIMAN Sundara (post-2020 earpads) 95 1.54 -0.01
Philips Fidelio X2HR

Anybody taking this seriously has lost his mind, besides the HE1 all these headphones are mediocre or just good (Sundara, SRH440, 5909) if not plain trash. For example the Fidelio has according to Amir's measurements high distortion, treble peaks, channel imbalance, strange group delay, variable impedance and cannot even be EQed properly. Yet it is the 6th accoding to this "scoring". I know, it is only meant to group the tonal balance, but after seing something like this, one should doubt the validity of this approach altogether.

I'd think Sean Olive would take this seriously as he is the lead author in this research. Do you think he lost his mind?

HarmonicTHD · Feb 14, 2024

MacClintock said:
Ok, thanks for clarifying this, as all of it is not clearly stated on the github or somewhere else and having two different targets is confusing and not justified. And furthermore the scoring function is limited to the region of 40Hz and 10 kHz, that is why many IEMs with terrible treble (like the Chu) are up there in the rating. Also the Truthear Zero:Red is much lower rated in this soring, but has higher or equally high rating as the Chu and the Salnotes Zero in the oratory measurements.

What headphones are concerned, at the top there are

Mark Levinson No 5909 99 1.14 0.06
Sennheiser HE 1 Orpheus 2 99 1.18 -0.02
Dyson Zone 96 1.45 0.01
Shure SRH440 96 1.31 0.11
HIFIMAN Sundara (post-2020 earpads) 95 1.54 -0.01
Philips Fidelio X2HR

Anybody taking this seriously has lost his mind, besides the HE1 all these headphones are mediocre or just good (Sundara, SRH440, 5909) if not plain trash. For example the Fidelio has according to Amir's measurements high distortion, treble peaks, channel imbalance, strange group delay, variable impedance and cannot even be EQed properly. Yet it is the 6th accoding to this "scoring". I know, it is only meant to group the tonal balance, but after seing something like this, one should doubt the validity of this approach altogether.

It clearly state what it does - compliance with Harman target not more not less.

You are now adding other characteristics which of course are not considered in the score, as clearly stated. Yes those other characteristics are as important and that’s why no one knowledgeable will select headphones on Harman target compliance alone, especially as one can EQ to it.

jaakkopasanen · Feb 14, 2024

MacClintock said:
Ok, thanks for clarifying this, as all of it is not clearly stated on the github or somewhere else and having two different targets is confusing and not justified. And furthermore the scoring function is limited to the region of 40Hz and 10 kHz, that is why many IEMs with terrible treble (like the Chu) are up there in the rating. Also the Truthear Zero:Red is much lower rated in this soring, but has higher or equally high rating as the Chu and the Salnotes Zero in the oratory measurements.

What headphones are concerned, at the top there are

Mark Levinson No 5909 99 1.14 0.06
Sennheiser HE 1 Orpheus 2 99 1.18 -0.02
Dyson Zone 96 1.45 0.01
Shure SRH440 96 1.31 0.11
HIFIMAN Sundara (post-2020 earpads) 95 1.54 -0.01
Philips Fidelio X2HR

Anybody taking this seriously has lost his mind, besides the HE1 all these headphones are mediocre or just good (Sundara, SRH440, 5909) if not plain trash. For example the Fidelio has according to Amir's measurements high distortion, treble peaks, channel imbalance, strange group delay, variable impedance and cannot even be EQed properly. Yet it is the 6th accoding to this "scoring". I know, it is only meant to group the tonal balance, but after seing something like this, one should doubt the validity of this approach altogether.

I simply implemented the preference prediction score as it is defined in the research. I won't comment too much about the validity and completeness of the research, but the score does correlate very well with double blind listening tests.

bodhi · Feb 14, 2024

MacClintock said:
. I know, it is only meant to group the tonal balance, but after seing something like this, one should doubt the validity of this approach altogether.

It seems to me that you are free to do exactly that.

MacClintock · Feb 14, 2024

jaakkopasanen said:
I simply implemented the preference prediction score as it is defined in the research.

That actually seems to be the problem. People see it and take it for granted, thinking it really has preditive power for the sound quality of a headphone, which it doesn't.

jaakkopasanen said:
I won't comment too much about the validity and completeness of the research, but the score does correlate very well with double blind listening tests.

Could you please show me a publication where the Dyson Zone and the Fidelio X2HR do perform that good? Or are you just talking about some virtual headphone which were EQed to their FR? Because that way you would neglect high distortion, treble peaks, channel imbalance, strange group delay, variable impedance and the fact that some of these cannot be EQed properly.

solderdude · Feb 14, 2024

Dyson:

Schermafdruk van 2024-02-14 19-21-24.png

aside from the huge (+10dB) treble peak (in the 'sharpness' frequency band) it seems to have an otherwise excellent tonality.

X2HR (to me) with a little help in the treble and bass lowered a bit sounded quite good to me.

tonality wise, overall and smoothed, this seems to be the correct tonality.

But yes... sounds nothing like the HE1 and Sundara is bass shy compared to the others.
And I agree... the ratings based on some numbers on tonal balance measured on a specific fixture is not related to sound quality but has some relation to tonality.

MacClintock · Feb 14, 2024

HarmonicTHD said:
It clearly state what it does - compliance with Harman target not more not less.

You are now adding other characteristics which of course are not considered in the score, as clearly stated. Yes those other characteristics are as important and that’s why no one knowledgeable

Unfortunately not everbody is knowledgeable.

HarmonicTHD said:
will select headphones on Harman target compliance alone, especially as one can EQ to it.

Not every headphone can be EQed easily, see the Fidelio example.

MacClintock · Feb 14, 2024

solderdude said:
And I agree... the ratings based on some numbers on tonal balance measured on a specific fixture is not related to sound quality but has some relation to tonality.

Thank you, this is exacly my point. In fact in many cases just some "faint resemblance" to sound quality.

MacClintock · Feb 14, 2024

IAtaman said:
I'd think Sean Olive would take this seriously as he is the lead author in this research. Do you think he lost his mind?

Sean Olive, whose reasearch I hold in high esteem, is well aware of the limits of the scoring system, especially if it is taken as a pure indicator of sound quality. Or do you think he walks around with the Dyson Zone, sometimes replacing it with the Fidelio and the Chu?

IAtaman · Feb 14, 2024

MacClintock said:
Sean Olive, whose reasearch I hold in high esteem, is well aware of the limits of the scoring system, especially if it is taking as a pure indicator of sound quality. Or do you think he walks around with the Dyson Zone, sometimes replacing it with the Fidelio and the Chu?

Apparent absurdity of that mental image demonstrates nothing other than the fact that instead of making an actual statement, you are just resorting to argumentum ad absurdum.

Do you know what are the limitations of the scoring system?

And enlighten us please, how does "group delay" correlate with preference?
And what exactly is the effect of "high distortion"?
Tell me please, which of those headphones that score 95 and higher in the predictive scoring system have such high distortion that they are objectively not preferable?
With sources if possible please.

MacClintock · Feb 15, 2024

IAtaman said:
Apparent absurdity of that mental image demonstrates nothing other than the fact that instead of making an actual statement, you are just resorting to argumentum ad absurdum.

Do you know what are the limitations of the scoring system?

I gave you the arguments all above in previous postings.

IAtaman said:
And enlighten us please, how does "group delay" correlate with preference?

Messy group delay may cause resonances, polarity issues, strange soundstage, etc.

IAtaman said:
And what exactly is the effect of "high distortion"?

Just make a listening test, if you really don't know the basics of hifi.

IAtaman said:
Tell me please, which of those headphones that score 95 and higher in the predictive scoring system have such high distortion that they are objectively not preferable?
With sources if possible please.

As I mentioned earlier, the Fidelio, ranked no. 6. has quite high distortion and Amir didn't like it at all, even after EQ. He also did not like the no. 1, the 5909. What more is needed to discredit this mindless use of the ranking system?

IAtaman · Feb 15, 2024

MacClintock said:
I gave you the arguments all above in previous postings.

Messy group delay may cause resonances, polarity issues, strange soundstage, etc.

Just make a listening test, if you really don't know the basics of hifi.

As I mentioned earlier, the Fidelio, ranked no. 6. has quite high distortion and Amir didn't like it at all, even after EQ. He also did not like the no. 1, the 5909. What more is needed to discredit this mindless use of the ranking system?

That is it?

The basis of your strong criticism of the research, going as far as referring to people who take it seriously as lost their minds, is "Amir did not like it" - is that it?

I'll tell you what's more needed: some actual, objective proof.

You said high distortion. Here are the distortion level graphs for Fidelio and HD650 side by side. What makes HD650's distortion acceptable and Fidelio's "high"?

Big EQ will be on the bass where Fidelio has a lot better extension down to 50Hz compared to HD650 which has a response that starts to droop of at around 100Hz. And the little bump at around 5K, no real reason to think it is audible, but it will be toned down in the EQ anyway. Same for distortion at 270Hz range. So I'd argue that Fidelio EQ'ed to target will have less distortion than HD650 EQ'ed to target. Do you disagree?

You said messy group delay. Here are the GD for Fidelio and HD650 side by side. Which one is messier and how does it effect sound quality or preference - explain please.

If there is more, do explain. What makes Fidelio objectively a bad headphone, please, treat me as a total novice and spell it all out for me.

MacClintock · Feb 15, 2024

IAtaman said:
That is it?

The basis of your strong criticism of the research, going as far as referring to people who take it seriously as lost their minds, is "Amir did not like it" - is that it?

He did not like it for reasons, and it is for these reasons that the Fidelio is objectively a bad or at best mediocre headphone.

In contrast to the HD 650 the Fidelio has also a series of resonance peaks in the treble, which are generally almost impossible to EQ properly.

IAtaman said:
I'll tell you what's more needed: some actual, objective proof.

You said high distortion. Here are the distortion level graphs for Fidelio and HD650 side by side. What makes HD650's distortion acceptable and Fidelio's "high"?

Big EQ will be on the bass where Fidelio has a lot better extension down to 50Hz compared to HD650 which has a response that starts to droop of at around 100Hz. And the little bump at around 5K, no real reason to think it is audible, but it will be toned down in the EQ anyway. Same for distortion at 270Hz range. So I'd argue that Fidelio EQ'ed to target will have less distortion than HD650 EQ'ed to target. Do you disagree?

Yes, distortion is overall higher on the Fidelio, just look at the graphs, especially in the bass where both need a boost.

IAtaman said:
If there is more, do explain. What makes Fidelio objectively a bad headphone, please,

If’s funny how you selectively put some graphs. The others, that don’t support your claim, you discard, namely the gross channel imbalance (2-4 dB in a wide frequency range), the spiky reasonance peaks in the FR and the nonconstant impedance. The first makes an unpleasant listening experience in general and the latter will make it not possible to accurately EQ to a given target, exactly what Amir experienced.

IAtaman said:
treat me as a total novice and spell it all out for me.

That is not difficult as you seem to be one.

isostasy · Feb 15, 2024

@MacClintock what's your beef with, the ranking table or the people who treat it like gospel? From above posts you seem completely aware of the limitations of the statistical model so I'm not sure why you're so bothered that some headphones you don't like appear at the top. Not sure what you're suggesting Jaako does instead. The fact some people aren't knowledgeable enough to make proper use of tools/info/etc. is unfortunately an issue in all walks of life.

Jimbob54 · Feb 15, 2024

isostasy said:
@MacClintock what's your beef with, the ranking table or the people who treat it like gospel? From above posts you seem completely aware of the limitations of the statistical model so I'm not sure why you're so bothered that some headphones you don't like appear at the top. Not sure what you're suggesting Jaako does instead. The fact some people aren't knowledgeable enough to make proper use of tools/info/etc. is unfortunately an issue in all walks of life.

Suspect its both. If the table (or indeed Oratory's scores he puts on his PDFs) didnt exist then people wouldnt run around quoting scores like it meant anymore than how likely a headphone's stock tonality is to be "liked" by the most number of people. TBH I think more value can be obtained from the rankings from a consumer perspective by using it to rule out poor sounding headphones from ones purchasing decisions than assuming the ones at the top of the list are automatically the ones to look at.

I reckon the scoring has considerably more value for manufacturers, especially if they measure and calculate during the design/ proto phase.

Countering misinformation about AutoEQ

Major Contributor

Addicted to Fun and Learning

Major Contributor

Member

Addicted to Fun and Learning

Major Contributor

Major Contributor

Member

Major Contributor

Addicted to Fun and Learning

Grand Contributor

Addicted to Fun and Learning

Addicted to Fun and Learning

Addicted to Fun and Learning

Major Contributor

Addicted to Fun and Learning

Major Contributor

Addicted to Fun and Learning

Senior Member

Grand Contributor

Similar threads