1. ### Alternative method for measuring distortion

Exactly, adding randomness decreases information, not increases it.
If you add more noise the entropy will increase more; if you add a lot of noise the entropy will increase even more; at some high level of entropy the original signal will disappear completely. Entropy and information are reciprocal.
For sure, linearization is mathematical, but its use in audio is determined by psychoacoustics.
They are from CDs without change. As is.
For sure, and is not required in another ones.
Self-dither does not require linearization of quantizer. Whether the signal is self-dithered or not can not be determined by math, only by ear. So, the necessity of linearization is determined by ear, not by math.
... and increases the quantization noise.
The information you are talking about is relevant only in the context of hearing. Linearization is not the main goal/necessity, it is just a means for achieving the main goal - to reduce the annoyance of granular noise of quantization.
It is just a thought experiment helping to understand the difference between engineering/math/syntactic and semantic levels of audio information.
In various fields of application dither can benefit to various purposes; I'm talking about benefits of dither in audio, where such benefits determined by psychoacoustics.
yes, this is another area where dithering is beneficial, and also those benefits are due to features of human perception.
No doubt, it is beneficial for hearing, not for math.
I would add - the loss of information, which is important for hearing. In other areas such information can be unimportant.
At this stage/level of the discussion such “general” arguments are not enough. Please, elaborate - what exact premise you mean, why is it incorrect and provide the correct one. Here I will show how linearizion of quantizer by means of dithering increase resulting quantization error. It is easy...
Linearization of quantizer by dithering results in increase of quantization error (SQNR). We need some reason for such increase. The reason is in psychoacoustics - increased error is less audible and more pleasant for hearing.
I would mostly agree here. The applied math can not be used in this case, because we don't know the application area of the signal. And the best strategy for quantizing unknown signal is rounding as it provides the best Signal-to-Quantization-Noise Ratio. There are several other quantizing...
Yes, this is in more details.
Agree. This really doesn't matter. That is why I added: "I can expect that this looks unusual and understand why you say "dithering of 16bit signal" in this case. Not important". Meaning that "dithering of 16bit signal" is also correct and explained why I said "dithering of 32bit signal". Yes...
Everything is math and engineering, even psychoacoustics is full of math. But I'm talking here about different levels of information - engineering and semantic (you pointed to the appropriate article by Warren Weaver above). In order to understand better the difference between those levels I...
Considering that: - the noise is added before quantization (to 32bit signal) - dithering is used not only in combination with quantization (not in audio) I prefer to say "dithering of 32bit signal" just because the noise is added to it. I can expect that this looks unusual and understand why...