If we think it is sub related the then OP should match using dB(A) and have the measurement for dB(C), which implicitly says whether there is more base or not…
We can hypothesize till the cows come home otherwise.
It is science actually.
The fellow did not want the more expensive speaker and booth he and his other half observed it in a repeatable fashion.
So when one try’s to come up with hypothesis to test, it is not religion, it does sound a bit like a scientific process.
But I totally agree that they need in-room measurements. Which was the dB(A) versus dB(C) numbers I suggested.
I think I read your initial post as suggesting they needed to be in the room of the house that the OP and his other half own. But I think you may have meant the room that they listened to them in.
So we can argue whether they need do pink noise or sweeps or just level them using the app in dB(A) and then see what the app says for bot sets in dB(C)… or what is your specific suggestion?
Given it is in a shop, I suspect that they could go out of their way to make the more expensive speaker sound better, if they wanted to
Which could include some non optimisation of the cheaper ones using a sub.
I am not saying that they did.
But/So the OP needs a method that the store will abide.