• Welcome to ASR. There are many reviews of audio hardware and expert members to help answer your questions. Click here to have your audio equipment measured for free!

Let's develop an ASR inter-sample test procedure for DACs!

On the practical topic of preventing intersample overs, do we know how the volume control is implemented on common devices such as Wiim streamers/amps or AVRs? Or is the sort of volume control required not commonly implemented?
 
On the practical topic of preventing intersample overs, do we know how the volume control is implemented on common devices such as Wiim streamers/amps or AVRs? Or is the sort of volume control required not commonly implemented?
The most common volume control implemented on DAC chips and in DSP is a digital one, which is exactly what you'd want to lower the level to avoid ISOs. An analog volume control will not help, since it doesn't modify digital and that's where the ISO problem occurs. So, the question is: is the volume control implemented in the digital or analog domain?
 
Note that 4x oversampling can miss at most a 0.688dBFS ISP for a signal at 1/2 the original sampling rate (maximum valid frequency).

So, a -1dBTP is perfectly acceptable as a target:

That's what I'm saying. 4x oversampling helps but it doesn't solve the issue. Anything above 0, even if it's +0.001dBFS results in degradation of the original information. The rule is that digital audio should not be clipping, ever. It's not the same as analog audio.

This topic is bigger than annoying music production trends. It's about faithfully transducing digital information into analog audio. If there is an opportunity to improve such transduction, we shouldn't push against it IMO.

The original waveform either exceeds 0dBFS level in the analog domain, or it doesn't.
Nope. ISOs are invalid data because the input of the ADC has been overdriven.
One way to prevent ISOs is to add an analog peak detector at the input of the ADC which fires when the signal gets higher than the ADC is specified to accept.
This is correct. Leave enough headroom is the proper way to prevent them.

This is all completely wrong.

Virtually every professionally recorded signal is recorded leaving ample headroom to avoid digital clipping. dB meters are standard in audio interfaces, mixers and DAWs.

The reason why the digital samples get close to 0dBFS in finished audio products is what happens in the digital realm during audio processing: summing, gain boosting, compression, limiting, EQs, etc.

Even still, if the DSP/mix/master is done properly (which is most of the time, unless we're talking amateur production) none of the samples ever go above 0dBFS. Yes, even in the masters that are brickwalled to infinity.

With a "proper" (<0dBFS) master, the only time when samples can go past 0 is when an oversampling filter (or any digital filter, really) is applied, which is what happens in modern DACs.

Are you sure about that? My understanding is that ISO's are a random phenomenon due to the sampling theorem itself which only becomes a problem when sampling at or near 0 dB.

There is nothing random about this phenomenon. Every sampled wave, when reconstructed, goes past the sample points, except when the samples happen to coincide exactly with the peaks of the wave. When the samples get closer to 0dBFS, it's likely that the oversampled wave will run out of headroom and it will therefore get chopped.

Check earlier posts in the thread to learn more.

Are ADC's even used extensively besides vocals in most modern recordings? Seems like most of the "instruments" are just digital synthesisers? Could be wrong and trying to understand.

Yes, but it doesn't really make a difference. A properly digitally recorded signal and a digitally synthesized signal are identical in this context (they're all samples).

Once again, ISPs have almost nothing to do with ADCs or the recording/sampling process. It's a misconception.

The only time you can blame the recording/sampling step for ISPs over 0dBFS is when you record a signal that is almost clipping, then do nothing to it (no level adjustment, no compression, no nothing), then run it through a DAC or digital filter. The voice notes on your phone might have this issue.

6 dB can be spared easily as S/N of modern DACs is high enough.

Indeed. Lowering the volume before oversampling is the way to solve clipped ISPs. Ideally you'd have a dynamic system that lowers the volume just enough for a given audio track, but I don't know how one would go about implementing it. Surely it can be done in software.

On the practical topic of preventing intersample overs, do we know how the volume control is implemented on common devices such as Wiim streamers/amps or AVRs? Or is the sort of volume control required not commonly implemented?

The purpose of this thread is precisely to get reviewers to include ISP testing in their suites, because right now the majority's opinion is that ISPs aren't a big enough problem. That means we don't know how most DACs will perform in this area. I would say that they are an even bigger problem than sub-optimal SINAD, but that's just me.
 
Last edited:
Even still, if the DSP/mix/master is done properly (which is most of the time, unless we're talking amateur production) none of the samples ever go above 0dBFS
What do you mean by a sample? If we have a 16-bit CD recording, then the samples physically can't exceed +32767 and -32768, which gives us 0 dBFS. But when converting to analog, modern recordings use output filters to restore those peaks that were intentionally cut off in digital format. Simply to make it 1-2 dB louder and more sharper. And, unfortunately, many, many publishers do this these days. Especially in popular music. Classical and jazz are more often recorded correctly.
 
That's what I'm saying. 4x oversampling helps but it doesn't solve the issue. Anything above 0, even if it's +0.001dBFS results in degradation of the original information. The rule is that digital audio should not be clipping, ever. It's not the same as analog audio.

Just master to -1dBTP level or below using a proper TP meter (4x oversampled), as recommended. This will avoid all ISPs as long as the mastered signal contains only valid frequencies. DAW plugins to do this are commonly available -- it's not a complex task. And it's a standard recommendation that's been published for nearly 20 years. The fact that mastering engineers don't follow recommendations in the pursuit of greater and greater loudness is on them, not on equipment manufacturers.
 
This is all completely wrong.
You quoted a number of out-of-context posts, so it's hard to know what it is you're responding to. What I'm talking about is the analog waveform being represented by the produced/mastered PCM recording. PCM is a sampled representation of an analog waveform that either exceeds 0dBFS or it doesn't. It doesn't matter how much the DAC oversamples. Any kind of resampling or interpolation can result in an ISP if the analog waveform levels exceed 0dBFS. Is this wrong?

With a "proper" (<0dBFS) master, the only time when samples can go past 0 is when an oversampling filter (or any digital filter, really) is applied, which is what happens in modern DACs.
The point is that 0dBFS is not a proper target. -1dBTP is the proper target for mastering, if you care to avoid ISOs.
 
Last edited:
What do you mean by a sample?

Poor wording choice on my part. When I say that a sample "goes over 0dBFS" I simply mean that it's a clipped sample. The analog wave would have to go past that value to be properly reconstructed, but it just can't.

You responded to a number of out-of-context posts, so it's hard to know what it is you're responding to.

I'm saying that the claim that clipped ISPs (over 0dBFS) are created when recording/sampling a signal is wrong. That's basically only true if you record at clipping levels, which doesn't happen unless we're talking about people that don't know how recording works.

It doesn't matter how much the DAC oversamples. Any kind of resampling can result in an ISP if the analog waveform levels exceeds 0dBFS. Is this wrong?

It is wrong. every time you increase the upsampling factor (meaning, you insert samples between samples) the Peak of the digital wave increases in amplitude.

With just 4 samples per wave, we are perfectly fine here:

isps-2.jpg


With 9 samples per wave, we're clipping:

isps-4.jpg


Let's say you adjust the volume so that you don't clip anymore. You'll be fine until you increase the oversampling factor, at which point you'll be clipping again.

That's just how oversampling works.
 
It is wrong. every time you increase the upsampling factor (meaning, you insert samples between samples) the Peak of the digital wave increases in amplitude.
Whaat? No, that's totally wrong. Upsampling doesn't change the waveform that the samples represent. It just fits more points on the same curve, the curve doesn't get taller because you oversample. It doesn't matter if you upsample to 10000x, the peak will never go beyond the peak of the analog representation.
 
Whaat? No, that's totally wrong. Upsampling doesn't change the waveform that the samples represent. It just fits more points on the same curve, the curve doesn't get taller because you oversample. It doesn't matter if you upsample to 10000x, the peak will never go beyond the peak of the analog representation.

You don't get what I'm saying. The samples can never go past the original sampled wave (assuming we're not filtering in the passband), but if you insert more samples between samples, then the digital waveform becomes more and more like the original sampled analog waveform. If the original sampled waveform needs to go above 0dBFS to be properly reconstructed, then the samples have to follow suit, but they can't. So we get clipping.

In the digital realm, the oversampled wave does get relatively taller than the non-oversampled one, just like a real analog wave would get taller if filtered.

This is the article I took the pictures from btw.
 
You don't get what I'm saying. The samples can never go past the original sampled wave (assuming we're not filtering in the passband), but if you insert more samples between samples, then the digital waveform becomes more and more like the original sampled analog waveform. If the original sampled waveform needs to go above 0dBFS to be properly reconstructed, then the samples have to follow suit, but they can't. So we get clipping.

In the digital realm, the oversampled wave does get relatively taller than the non-oversampled one, just like a real analog wave would get taller if filtered.

This is the article I took the pictures from btw.

OK, but you said that this was wrong:
PCM is a sampled representation of an analog waveform that either exceeds 0dBFS or it doesn't. It doesn't matter how much the DAC oversamples. Any kind of resampling or interpolation can result in an ISP if the analog waveform levels exceed 0dBFS. Is this wrong?

Oversampling or any kind of interpolation or DSP can reveal ISPs. It can occur with 2x oversampling, or 10000x oversampling. But ISPs will not occur if the mastered recording is scaled not to exceed -1dBTP. This is true with any oversampling ratio.
 
"It's also worth also noting that most streaming services require that music uploaded to them has a maximum true peak of -1 or -2 dB. (The 1 or 2 dB headroom is to allow for errors when the music is streamed via a lossy encoding process.) The web page Mastering for Streaming: Platform Loudness and Normalization Explained, for example, lists the maximum allowed true peak – shown as e.g. -1 dBTP – of several services."
"Streaming services" respect this? They explain that? Qoobuz tidal Amazon etc?
 
"It's also worth also noting that most streaming services require that music uploaded to them has a maximum true peak of -1 or -2 dB. (The 1 or 2 dB headroom is to allow for errors when the music is streamed via a lossy encoding process.) The web page Mastering for Streaming: Platform Loudness and Normalization Explained, for example, lists the maximum allowed true peak – shown as e.g. -1 dBTP – of several services."
"Streaming services" respect this? They explain that? Qoobuz tidal Amazon etc?
Not sure where they got "require" from. More like recommend. Spotify:
Mastering tips
...
These guidelines also apply to tracks delivered for lossless playback.
 
"Streaming services" respect this? They explain that? Qoobuz tidal Amazon etc?

At least in the case of Spotify, it is up to the distributor/artist or whoever uploads music on the platform to comply with these recommendations.

But normalization is available and that does help a lot.

Oversampling or any kind of interpolation or DSP can reveal ISPs. It can occur with 2x oversampling, or 10000x oversampling. But ISPs will not occur if the mastered recording is scaled not to exceed -1dBTP. This is true with any oversampling ratio.

Yes, any oversampling factor can reveal clipped ISPs above 0dBFS. But you can only completely get rid of oversampling clipping when the loudest sample post-oversampling is below or at 0dBFS.

Since ISPs have no theoretical limit, the only way to be 100% sure you don't get any digital clipping is for the True Peak oversampling factor to match the one found in DACs (meaning that the loudest sample has the same value).

You also have to consider that audio waveforms are hardly ever periodic in nature, so estimates made with sine waves or test signals will not apply exactly to the real world.
 
There is nothing random about this phenomenon. Every sampled wave, when reconstructed, goes past the sample points, except when the samples happen to coincide exactly with the peaks of the wave. When the samples get closer to 0dBFS, it's likely that the oversampled wave will run out of headroom and it will therefore get chopped.

Check earlier posts in the thread to learn more.
Since you can't predict if a given close to or at 0dBFS sample will cause an ISO that to me is random. This is true even for steady state signals and much more so for a music signal.
 
"It's also worth also noting that most streaming services require that music uploaded to them has a maximum true peak of -1 or -2 dB. (The 1 or 2 dB headroom is to allow for errors when the music is streamed via a lossy encoding process.) The web page Mastering for Streaming: Platform Loudness and Normalization Explained, for example, lists the maximum allowed true peak – shown as e.g. -1 dBTP – of several services."
"Streaming services" respect this? They explain that? Qoobuz tidal Amazon etc?
Then I do not understand how so many songs on Tidal (lossless) still trigger the clipping indicators on my RME ADI-2 Pro..
 
(My "copy".. Just a part of it)
Thank you for posting this. I know all the theoretical background, just wanted to highlight that intersample overs are still a topic with current music streaming, so no "solved" issue.

Funny enough - I was one of several persons who brought up this topic in the minidsp user forum and asked for a SHD firmware update to allow volume adjust prior to its ASRC. No reaction from the minidsp devteam at that time. As a result, I sold my SHD and switched to a RME solution. Now they seem to have implemented the fix as originally requested by me and even provide an explanation in their manual. Good for all other users, too late for me.
 
Last edited:
Since ISPs have no theoretical limit, the only way to be 100% sure you don't get any digital clipping is for the True Peak oversampling factor to match the one found in DACs (meaning that the loudest sample has the same value).

This is where you’re missing the fact that ISO can’t be “unlimited” for a properly bandwidth limited signal. It is limited and can only exceed the sampled values by a maximum of 0.6888dB when oversampled by 4x and even less when oversampled by a larger factor.
 
Since you can't predict if a given close to or at 0dBFS sample will cause an ISO that to me is random. This is true even for steady state signals and much more so for a music signal.

Of course you can. That's what True Peak metering is for. True Peak metering uses oversampling to "fill in the gaps" in the sample stream so that you can know exactly where ISPs are.

The tough part would be predicting where ISPs are going to be before digitizing a signal. With a predictable signal, I guess you could time the sampling so that the individual samples end up exactly where you want them to be.

What I mean is, in a lab, you can replicate results with 100% consistency.

This is where you’re missing the fact that ISO can’t be “unlimited” for a properly bandwidth limited signal. It is limited and can only exceed the sampled values by a maximum of 0.6888dB when oversampled by 4x and even less when oversampled by a larger factor.

That value is the result of a test done with a perfectly consistent periodic signal like a sine wave, not with actual audio material. If we only ever listened to sine waves, then sure, 4x would be fine. In fact, we could just calculate the exact value of peaks and we wouldn't need True Peak metering at all.

From the Recommendation ITU-R BS.1770-5 document:

“For continuous pure tones it is easy to demonstrate, for example, a 3 dB under-read for an unfortunately-phased tone at a quarter of the sampling frequency. The under-read for a tone at half the sampling frequency could be almost infinite; however most digital audio signals do not contain significant energy at this frequency (because it is largely excluded by anti-aliasing filters at the point of D/A conversion and because ‘real’ sounds are not usually dominated by continuous high frequencies).”

We're talking edge cases, but cases nonetheless.
 

seems in any case assumed in the production of essentially electronic (or mixed) music, or even let's say..."commercial"...

in the end not very demanding rating rendered..(and it is an edifying proof)
Sorry
 
Back
Top Bottom