Let's develop an ASR inter-sample test procedure for DACs!

erazortt · Feb 12, 2025

danadam said:
According to the SDK documentation it should be available. Or at least it should be possible to do if the application (wiim in this case) chooses to do so. From changelog:

SpStreamInfo:

But its the Spotify App from Spotify I am using, on my phone. And this does not allow for that. If their own app does not do that, what is the chance that some manufacturer will implement this.

And is this SDK really for spotify connect? Or for the development of an App to be run inside the device? Because I am talking about Spotify Connect and not the app insde the Wiim. Spotify and Tidal Connect are not dependent on the App inside the Wiim. And I think most people use it that way. Because like this you just use the app you already have installed on the phone, in this case Spoify. Tidal does the same thing. That is especially useful for guests who want to play some of their music.

erazortt · Feb 12, 2025

danadam said:
And how does it look with volume normalization enabled?

And also, do you think that spotify has different tracks, one for without normaliztion and one with? And then one for the "Quiet", one for "Normal" and one for the "Loud" setting? I can hardly believe that. Why should they? I bet they are using the same unnormalized track, transmit that to the device, decode that there, and only then apply those settings. But as I said, non of these settings exist if you use Spotify Connect (via the official Spotify app from Spotify).

danadam · Feb 12, 2025

erazortt said:
But its the Spotify App from Spotify I am using, on my phone. And this does not allow for that. If their own app does not do that, what is the chance that some manufacturer will implement this.

AFAIU Spotify Connect means that you can control the device from the spotify app on your mobile. In order to do that, there has to be a code running on the device and that code uses the SDK. That code, in the nomenclature of the SDK is called "Application":

Application The partner code that uses the eSDK.

And that's what I meant in my comment.

erazortt said:
And also, do you think that spotify has different tracks, one for without normaliztion and one with?

No, they have a single track, which the client (desktop) decodes to PCM in float format and then applies the normalization. So if there are samples which end up greater than 1.0 after decoding, they won't be clipped when the normalization is enabled.

erazortt · Feb 12, 2025

danadam said:
No, they have a single track, which the client (desktop) decodes to PCM in float format and then applies the normalization. So if there are samples which end up greater than 1.0 after decoding, they won't be clipped when the normalization is enabled.

Well but then the damage on the track was already done since with spotify we are now talking about lossy compression. And lossy compression creates its own overshoots, which are also not only intersample, but real sample overshoots. So if the track is that loud during encoding, then there is no way to heal that. I have explicitly used Spotify Connect, to be sure to get the real track and not a post processed version of it which might have gotten volume changes before being trasmitted via SPDIF.

Here the showcase what lossy compression does with loud tracks. You see the waveform of the original flac track from my previous post, now I have decreased its volume by 3db, converted that to aac (maximal bitrate) and then zoomed into the (decoded) waveform:

Everything above the red line are actual samples which are higher then -3dBFS. But since I encoded the original at -3dBFS, all those would have needed to be above 0dBFS, and since thats not possible it would have been clippings. And we are talking actual samples here, not intersample peaks. This is the issue with lossy compression, if the original too close to 0dBFS.

So coming back to the discussion. This is why lossy compression compounds the problem of intersample peaks. Becasue if we are already that loud that those intersample peaks are a problem, what happens when we additionally get clipping from the decoder because it would have wanted to put real samples above 0dBFS.

danadam · Feb 12, 2025

erazortt said:
So if the track is that loud during encoding, then there is no way to heal that.

Of course there is. It is exactly what I wrote in what you quoted. Decode it to PCM in float format and reduce the volume before converting to integer format. Here are some examples:

What exactly happens to a master after it's been sent to Spotify?

I was reading ASR’s thread on inter-sample overs, and one of the things that caught my attention was that peaks over 0dBFS are not just caused by oversampling but also by lossy encoding, among other things. I use Spotify to listen to most of my music, and I’ve always kept audio normalization...

www.audiosciencereview.com

erazortt · Feb 12, 2025

danadam said:
Of course there is. It is exactly what I wrote in what you quoted. Decode it to PCM in float format and reduce the volume before converting to integer format. Here are some examples:

What exactly happens to a master after it's been sent to Spotify?

I was reading ASR’s thread on inter-sample overs, and one of the things that caught my attention was that peaks over 0dBFS are not just caused by oversampling but also by lossy encoding, among other things. I use Spotify to listen to most of my music, and I’ve always kept audio normalization...

www.audiosciencereview.com

If its being done if float yes, true. But the point is, that normalization is not available when using Spotify Connect. You suggest that this would be something which Wim needs to implement on their side. Does anybody know if any manufacturor has implmented Spotify Connect in a way where the normalization setting would be available in the Spotify App? I'm asking because I have used Spotify Connect steering multiple devices, and I have never seen that this option would have been enabled.
Or to move away from Spotify, does Tidal allow for volume normlization in their app when using Tidal Connect?

JIW · Feb 12, 2025

erazortt said:
Just to show how the state of things ist when we are talking streaming:
View attachment 427966
So above is an already bad lossless flac file which I bought from Quobz. At the bottom we see what we get if we stream that track via Spotify (I recoreded directly the digital signal from the SPDIF output of the Wiim Ultra). Hmm, that looks really bad.. I mean AAC is actually a really good encoder and at a bitrate of 192 kbps is would be able to be really completly transparent, and Spotify uses 320 kbps. However, it was never intended to be able to encode near to 0dBFS, or god forbid above 0dBFS. And what we see here is how badly the whole industry is in denail of that issue.

It really saddens me to say that, but the audiophiles are right: only lossless, bit perfect streams are the way to go here. Not because the lossy compression was soo bad, but because it is complely missued!
And even worse, concerning the analog outputs from our hardware (which I showed in posts #642, #644), or even worse from resamplers (#621), the only way out for the consumer is to use lossless sources with as much sampling rate as possible!! The audiophiles were right concerning high-res, all that time! Can you belive what I just wrote?! I cannot even belive myself what I just wrote! And why, because of the high sampling rate of high-res it allows for only very minuscule intersample peaks in the hearable range!

Here's the plots which is backing that up with actual science.
First the same plot I showed in post #647 which was at 44.1kHz sampling rate:
View attachment 427972

And now at 192kHz:
View attachment 427974
Look at the values of the ordinate (aka y-axis). The difference is not as the factor between 192kHz and 44.1kHz would suggest (192/44.1=4.35) but its actually a factor 20.72 better!

So here we are, because nobody in the industry cares, the user has to actully follow what audiophiles have always been saying and use lossless high-res sources. And actually DSD (aka SACD) is the kings here because it brings the problem problem down to a mathematically perfect 0! And really, I must ask, is that perhaps what our audiophile friends though they heared, when they said that bitperfect highres and SACD sounded more "natural" than resampled, lossy sources?

Even for high sample rate formats, ISOs will approach infinity as the frequency approaches the Nyquist frequency. It is of course lower in audible band, but why restrict the concern to that if the oversampling filter being overloaded produces distortion for any sample rate? Simply reducing the volume before sample rate conversion goes a long way and as you show, assuming only harmonic distortion products in the audible band, a little over 1 dB will do the trick at 44.1 kHz. Still, those high level harmonics can produce intermodulation distortion or damage downstream components.

If one already owns the files, an easy thing to do is determine the true peak value by volume-reduced many-fold oversampling and lower the volume such that the peak does not exceed the digital maximum value at very high sample rates, e.g. 705.6 kHz or 768 kHz.

xplo5iv · Feb 15, 2025

As I read the comments with regards to ASRC's such as in the AD DSP chips, I can't understand why they'd be problematic in themselves, as they're 32 bit chips and therefore have significant headroom.
What am I missing? Are they not capable of correctly using the extra bits available?

AnalogSteph · Feb 16, 2025

xplo5iv said:
as they're 32 bit chips and therefore have significant headroom.

Internally, that is. I had a look and it seems that 4 bits are added to incoming 24-bit data for headroom (translating to ~24 dB worth):

https://ez.analog.com/dsp/sigmadsp/w/documents/5169/what-are-the-number-formats-for-sigmadsp

Doesn't mean the ASRCs do.

It shouldn't really matter for things like a DSP crossover anyway, as normally you'd be far away from 0 dBFS levels due to the DSP being preceded by master volume. Unless, of course, that is actually inside the DSP.

Scytales · May 14, 2025

NTTY said:
Thanks @danadam, can’t be clearer.

And here below an example IRL, with a square signal digitally created at 0dBFS without filtering. When played from a CD player, it overloads the digital filter to a point of defeating it and so the output shows the typical images of the conversion with sin(x)/x envelope:

View attachment 406983

And for those interested, more available info in two tutorials from AD:
- MT-002 : What the Nyquist Criterion Means to Your Sampled Data System Design
- MT-017 : Oversampling Interpolating DACs

And of course the Nielsen / Lund AES paper that was recommended to me by @AnalogSteph and from which I reused the test protocol to measure ISO resistance with his help.

Some add-ons to the library of links.

The website of TC Electronic has some more downloadable papers from Nielsen and Lund about the subject of inter-sample overs: https://www.tcelectronic.com/tech-library/mastering.html

NTTY · May 15, 2025

Scytales said:
Some add-ons to the library of links.

The website of TC Electronic has some more downloadable papers from Nielsen and Lund about the subject of inter-sample overs: https://www.tcelectronic.com/tech-library/mastering.html

Excellent, there’s one about the ASRC issue with overs which is what I measured with the Teac VRDS 25x. Thanks!

Scytales · May 16, 2025

A yes, this paper : https://cdn.mediavalet.com/aunsw/mu...CNagg/Original/nielsen_lund_2003_overload.pdf ?

It gives a beginning of answer to the concerns I expressed above in the thread about DTS coding.

It also confirms with measurements what @John_Siau has stated about some perceptual coding schemes.

Jazz · Jun 24, 2025

This has been posted already in this giant thread but, well, here it is again…

krabapple · Jun 24, 2025

ASR/Amir gets a shout out at the end there, which is nice. But the topic of routine audibility is not much addressed and certainly not demonstrated --- bumping a test signal up so it's basically constantly overing at +3dBFS -- the scary stuff around 27 minutes in -- is hardly a real world case. None of John S.'s myths --including the myth that most recording do not have overs -- address how frequent/dense the overs have to be, to be audible in normal listening. When Gene makes the (predictable) leap around 1:05 to leveraging overs to argue why hi rez can sound better than CD, John S. doesn't unreservedly endorse - he admits to not "running enough tests" to prove it -- but does aver that it's the single most audible problem in CD audio. (Btw, I heard the phrase 'blind comparison' or anything like it, zero times in this convo.)

Near the end John S. cites a Tidal Eagles remaster that sounded terrible to him compared to the older CD, but of course since we're dealing with a remaster -- where EQ and loudness are commonly manipulated -- *and* we are talking streamed music, with unknown processing -- you can't just blame overs for that without actual accounting for all the differences.

I'd also like more details on the 'large music servers' that were searched for overs, and under what conditions, to determine that most CDS reach +1 dBFs (the number reaching +2 or higher seems to be minimal)

Sure, this is an easy problem to fix, and the industry should make it so (e.g. more headroom). And at home, if it mentally bothers us, we can take measures like lowering the digital signal before serving it to a higher-bit depth SRC. Or using replaygain with clip protection.

Davey · Jun 24, 2025

Regardless of frequency/density/duration/etc, John S is demonstrating a characteristic here that is actually measurable. Whether it might be audible (or not) is beside the point.
I'm not sure why some of the peeps here on ASR are nitpicking this objective testing. John is exhibiting "audio science".

edit: modified my post to remove the "somewhat."

krabapple · Jun 24, 2025

Davey said:
Regardless of frequency/density/duration/etc, John S is demonstrating a characteristic here that is actually measurable. Whether it might be audible (or not) is somewhat beside the point.

'Somewhat' to what degree? That's the question. Not unlike questions about the relation of SINAD number to audibility. I decline to be 'regardless' about them.

Davey said:
I'm not sure why some of the peeps here on ASR are nitpicking this objective testing. John is exhibiting "audio science".

And so is my critique. I am asking for more scientific data on audibility and more details on method. It's an annoying thing scientists do. ( And since we're talking science, curious to know if the work presented in the video been written up/published in a peer-reviewed paper.)

Davey · Jun 24, 2025

@krabapple Unless you believe John is manipulating his objective testing and conclusions for marketing purposes.....
Do you?

krabapple · Jun 24, 2025

Davey said:
@krabapple Unless you believe John is manipulating his objective testing and conclusions for marketing purposes.....
Do you?

Really? That's your go-to?

No, I'm not at all accusing John Siau of manipulating data. Or anything particularly heinous. Go take a nap.

Davey · Jun 24, 2025

krabapple said:
No, I'm not at all accusing John Siau of manipulating data. Or anything particularly heinous. Go take a nap.

Good. Glad you cleared that up.
For context, the marketing and manipulating claim was brought up previously in this thread by other posters. (Considering John's reputation, I kinda found that heinous.)
Thank you for the advice though. It is past my nap time.

Let's develop an ASR inter-sample test procedure for DACs!

Member

Member

Major Contributor

Member

Major Contributor

Member

Major Contributor

Member

Major Contributor

Addicted to Fun and Learning

Addicted to Fun and Learning

Addicted to Fun and Learning

Active Member

Major Contributor

Member

Major Contributor

Member

Major Contributor

Member

Similar threads