Since yesterday I've been doing heavy ABX tests on 128kbps Opus vs others at 96 to 256k, It's pretty much the quality of 192 to 320kbps AAC/Vorbis/MP3.
While Opus is certainly extremely good, grouping AAC/Vorbis and MP3 is a grave mistake. A bit like saying that MPEG-2 and MPEG-4 AVC are the same.Since yesterday I've been doing heavy ABX tests on 128kbps Opus vs others at 96 to 256k, It's pretty much the quality of 192 to 320kbps AAC/Vorbis/MP3.
How? Lossy compression is all about psychoacoustic tuning. Same way we can't really measure video encoder performances without looking, for now; yes, even with VMAF getting better.Our ears are rather insensitive, but a spectral analysis will give one better insights...
How?
Do you not know what a spectral analysis is?
I think he is right, spectral analysis won't really tell you how good lossy codec is
Well, you can think whatever you like, but that doesn't mean that's what actually happens when these things are developed and evaluated: a spectral analysis would very much be part of that process, along with ABX testing...
@j_j might be willing to enlighten some on the development of audio codec's .
I believe it dose come down to having folks listen to it.
Don't be so sure - with lossy codecs it's mostly about dropping parts of the signal that are masked and that happens dynamically meaning not always will the same frequency be cut and spectral analysis won't show that. 2 lossy codecs can have very similar spectrum graph yet they will sound completely different. The point is that when looking at spectral analysis you can't even begin to guess about the true strategy of a lossy codec.
That's a good point - looking at the spectra of the total signal at once may not be terribly revealing.
More useful would be to difference the files in the time domain - lossless minus lossy - then create a spectrogram of that result.
Maybe an experiment for a lazy weekend (i.e. not this one ).
EDIT: oops, meant "lossy minus lossy".
Depends on the bit rate and the material. At 128 kbps or lower, the differences can be quite apparent. If you want to compare lossy codecs, you need to push them until they start falling apart. That's where the differences will reveal themselves.Good idea. Although looking at these 2 spectrums I suspect that even that one wouldn't be able to tell much about differences. I never did ABX comparison between mp3 and AAC but I suspect the difference would be far from obvious.
Depends on the bit rate and the material. At 128 kbps or lower, the differences can be quite apparent. If you want to compare lossy codecs, you need to push them until they start falling apart. That's where the differences will reveal themselves.
The only way to evaluate these is by blind listening tests (MUSHRA, ABX, etc) with enough listeners. And with high rates, the first step is to find critical material (stuff that is hard to encode). Examples that come up a lot are applaud (try stereo too), castanets, speech (German is tough to encode for some reason), bagpipes and other tonal instruments. These stress either the time or frequency behavior of a codec.
Have fun, but accept that this is hard work.
That's true, a problem with MUSHRA is too many things to compare at the same time, so people don't put the effort where it counts if some items are too similar. Since the OP is interested in high bit rates, your recommendation for ABC/hr is spot on.Except that ABC/hr is better than MUSHRA, please, all of that ^^^^
We can make measures of video encoder performance (not ignoring the requirement for controlled subjective analysis)While Opus is certainly extremely good, grouping AAC/Vorbis and MP3 is a grave mistake. A bit like saying that MPEG-2 and MPEG-4 AVC are the same.
How? Lossy compression is all about psychoacoustic tuning. Same way we can't really measure video encoder performances without looking, for now; yes, even with VMAF getting better.
Are there tests that show opus becomes transparent at a bitrate, if it's better than aac at high rates, not just low it might manage it.
Is there a good reason it's not being adopted more, other than inertia.
We very much can make measures of video encoder performance
PSNR