This does not confirm no audible difference.
Amplifier performance is important. There is much more to it. When your FFT zooms in on harmonics we can see that there are visible differences in the harmonic sequences. The SINAD numbers may calculate the same. Also think modulation side bands.
There was no GRAS headphone measurement fixture mentioned. I have one here on my shelf.
If you read my comment it was about a (calibrated Laboratory Grade) GRAS measurement microphone, it could be B&K as well. Think GRAS model number 46AG. I have a pair, plus others.
What op-amp rolling is about, are there reliable perceived sound differences among a handful of op-amps as they are swapped in and out of a amplifier?
Amplifier SINAD and FFT measurements alone can not tell you that there are any op-amp caused human perceptible sound differences coming from of the speaker or not. PEAQ in the basic form uses filters (think transfer function) and FFT to model the function of a average human ear.
Audio Precision has it's own version(s) of PEAQ.