As a final note (don't worry JP, I'll go away) I am only removing the stylus from the cartridge, not the cartridge or the headshell. I'm fairly sure the stylus goes back to a consistent reference position each time, so should be repeatable.
I think if you want to see how (harmonic) distortion is affected by stylus wear, you first have to find the measuring source that gives the least amount of distortion from itself, and that could become pretty important very quickly.
For example: Jico states that a stylus is worn out at 3% thd at 15khz. So without knowing what type of numbers to expect that'll actually happen in your study, that's -30dBV
If you look at the plots from
@USER not even the ca trs 1007 record can measure that, in fact it sits 10dB above that so adding that 3%THD probably won't even show realistically (let alone to calculate how much it is), and that's for a fully worn stylus (according to Jico). I have no idea how Jico measures that figure, but as you're probably gonna run your experiment only once, getting a method to measure harmonic distortion with as much resolution as possible will probably be very important for the outcome of your experiment.
For that last one you could look at the tacet vinyl check record. It doesn't have a sweep but it has multiple tonebands and imho they measure quite low in distortion.
But in the end the amount of distortion is I think quite a rabbithole, as a louder test signal is more difficult to track at the same frequency than a quiteter one (so probably gives more harmonic distortion), but it's delta with the noise floor of the record etc is also greater so that noise floor has less of an impact on the measured distortion.
Also I think you should consider the question whether or not you should be looking at harmonic distortion to asses wear.
If wear results in mistracking, and if I listen at the Tacet IGD tracking test, harmonic distortion is not what in the first place comes to my mind (I wonder if I could actually hear harmonics of the S and F sibilants and fricatives frequency wise).
That test is imho for a lot of stylus shapes quite useful to asses wear btw. However it's not a numerical test, it's a listening test.