The paper is
here.
The relevant passage is:
"In some musical instruments, acoustic pressure builds up extremely fast during onset transients reaching tens of dBs within a few microseconds. For example, transient onset of xylophone shows waveforms with rise time of less than 10 μs with instantaneous peak output reaching 126dB SPL. Trumpet playing fortecan register 120-130dB peak SPL with steep rise of the waveform within only 10μs to full signal level. Snare drum reaches 130dB and cymbals 136dB peak SPL within microseconds [9].Rogowski ‘s values were measured during a short musical selection captured using 1/4 inch, Brüel&Kjaer 4135 microphone and a 192kHz, 12bit A/D conversion. Can CD-rate sampling of audio every 22.7μs register the full waveform detail of the sound of these instruments? Based on these onset requirements, to achieve a transparent recording medium, one should sample audio with less than 1μs between samples to accurately capture steep waveform changes. One revealing transient test of recording system is to use the sound of dangling keys or striking wine glasses as a source. Our familiarity with these sounds is frequently refreshed and can be used to inform us of problems with the transparency of the recording system. A brutal case of poor transparency occurred in early consumer CD players that used single D/A converter working with both channels in multiplex. Noticeable directional shifts were sometimes heard on transients when left and right channel converted the same onset in a sequence of samples."
There's some confused information in that. Most importantly, the author seems to forget the relationship between risetime and frequency.
If an acoustic source's output has a risetime of under 22.7us, that is simply because it contains frequency content above 22.05KHz (ie the maximum frequency that redbook can accurately capture, and a bit above the maximum frequency that most humans can hear).
So the higher frequency content is present, and measurable (both in the frequency domain and the time domain, which is the domain that the above passage focuses on) - but it is not audible. The highest
audible content that is present simply can't have a risetime of less than around 23 or more, depending on the hearing ability of the person listening of course.
In other words, redbook
cannot capture this instrument in its fullness, but it
can capture every component of the instrument that is audible to humans.