You say above "I'm a big believer in science and understand very well the pitfalls of various human cognitive biases".I wasn't just eyeballing the graph; I was actually counting the decibel difference. That graph simply doesn't show the bass deficiency (it has more than the Stealth according to that graph) and there's no massive dip at ~4k. The only place that graph and Amir's agree is the ~4db dip around 1.5k. I don't think it's coincidence that's the only place I actually hear a dip. As for the bass, I don't think I agree with that graph that the Caldera is bassier than the Stealth/Aeon, but neither do I think the Stealth/Aeon is ~3-5db bassier than the Caldera as Amir's measurements suggest. They sound fairly close from what I can remember (and from what I can directly compare with the Aeon, which is tuned very close to the Stealth). It's entirely possible that the different pads and seals are dramatically affecting the measurements of both.
It's hard enough doing an A/B comparison let alone going by what you remember some music sounding like through a headphone.
You can't expect Amir's graph to "agree" as the measurement rigs are completely different.
"The deviation is very high in the sub-bass (due to non-reliable seal), around 3 kHz (due to the wrong acoustic impedance of the pinna), and in the frequency regions 5 kHz and upwards, where deviations are much to high to obtain reliable measurements from the EARS."
https://www.reddit.com/r/headphones/comments/7szpqm/_/dt9pm7d