Intuitively, the opposite seems right to me. Lower frequencies have more sample points along each crest, offering more chances (sample points) to bias the dither (more chances for a randomly picked 0 to become a 1). Also less quantization error because more dice rolls give a smoother approximation and higher confidence estimate of the relative amplitude.
Put differently, if only a single sample point LSB is a 1, it cannot be differentiated from the random process that created the dither. But if, say, 7 of 8 consecutive samples all have an LSB of 1, it's looking less random and more likely you have encoded a signal whose amplitude is some fraction of the LSB. Not necessarily 7/8 of the LSB, of course.
But this is really a question for
@j_j , or likely, one that he's already answered somewhere here at ASR.