Every single one.
Measured and modeled how? Remember even if one is a bit better both are probably below audibility. Sighted tests of audibility are mostly a waste of time.
Then do the measurements. Just listening and coming to conclusions is not convincing or reliable to find the truth.
Why would you play the music and measure with a microphone when you have a Focusrite? You can play the various configurations you use and measure the results using REW or Multitone. Or record the music outputs and compare with Deltawave. Using a microphone will place significant constraints on how clean, reliable and repeatable the measurements are. Do these measurement with the ADC in the Focusrite.
Here is a review with measurements of the 4th gen 2i2.