La La Land (2016) is a really fun movie that won a bunch of Academy Awards including Best Original Score and Best Original Song.

The jazz songs were recorded at Conway Studios which features nice Genelec monitors

and the final mixes were done with Neumann KH monitors

Justin Hurwitz,the composer of La La Land also famously uses JBL 3 series speakers.

If you're interested in the soundtrack, you'll see that the "high-resolution" version on streaming is 24-bit/44.1 kHz. If you want physical media, the only high resolution option is a SACD released in 2023!

DeltaWave v2.0.13, 2024-09-14T15:27:02.0361363-07:00

Reference: 05 - Herman's Habit.dsf[L] 9848832 samples 88200Hz 24bits, mono, MD5=00

Comparison: 05. Justin Hurwitz - Hermans Habit.flac[L] 4923912 samples 44100Hz 24bits, stereo, MD5=00

Settings:

Gain:True, Remove DC:True

Non-linear Gain EQ:False Non-linear Phase EQ: False

EQ FFT Size:65536, EQ Frequency Cut: 0Hz - 0Hz, EQ Threshold: -500dB

Correct Non-linearity: False

Correct Drift:True, Precision:30, Subsample Align:True

Non-Linear drift Correction:False

Upsample:False, Window:Kaiser

Spectrum Window:Kaiser, Spectrum Size:32768

Spectrogram Window:Hann, Spectrogram Size:4096, Spectrogram Steps:2048

Filter Type:FIR, window:Kaiser, taps:262144, minimum phase=False

Dither:False bits=0

Trim Silence:True

Enable Simple Waveform Measurement: False

Resampled Reference to 44100Hz

Discarding Reference: Start=0s, End=0s

Discarding Comparison: Start=0s, End=0s

Initial peak values Reference: -0.013dB Comparison: 0dB

Initial RMS values Reference: -14.929dB Comparison: -14.928dB

Null Depth=13.472dB

Trimming 273 samples at start and 1 samples at the end that are below -90.31dB level

X-Correlation offset: 10337 samples

Trimming 1 samples at start and 0 samples at the end that are below -90.31dB level

Drift computation quality, #1: Excellent (0μs)

Trimmed 423 samples ( 9.591837ms) front, 0 samples ( 0.00ms end)

Final peak values Reference: -0.013dB Comparison: 0.154dB

Final RMS values Reference: -14.919dB Comparison: -14.918dB

Gain= -0.0006dB (0.9999x) DC=0 Phase offset=234.393601ms (10336.758 samples)

Difference (rms) = -47.03dB [-45.85dBA]

Correlated Null Depth=66.72dB [63.92dBA]

Clock drift: 0 ppm

Files are NOT a bit-perfect match (match=1.75%) at 16 bits

Files are NOT a bit-perfect match (match=0.01%) at 24 bits

Files match @ 49.5625% when reduced to 8.75 bits

---- Phase difference (full bandwidth): 3.33510941694337°

0-10kHz: 4.92°

0-20kHz: 3.50°

0-24kHz: 3.34°

Timing error (rms jitter): 2.8μs

PK Metric (step=400ms, overlap=50%):

RMS=-45.8dBr

Median=-46.6

Max=-40.2

99%: -40.98

75%: -44.58

50%: -46.61

25%: -49.0

1%: -60.19

gn=1.00007013560789, dc=-1.46849237377185E-06, dr=0, of=10336.7578204906

DONE!

Signature: 1a7eb58e9fc8c2e9fa6768c9c4608e7c

RMS of the difference of spectra: -118.030933852178dB

DF Metric (step=400ms, overlap=0%):

Median=-37.5dB

Max=-24.1dB Min=-71.2dB

1% > -52.94dB

10% > -45.06dB

25% > -41.86dB

50% > -37.5dB

75% > -34.15dB

90% > -30.88dB

99% > -25.29dB

Linearity 17.7bits @ 0.5dB error

Let's compare the DSD version of track 5 against the 24/44.1 kHz PCM version. @pkane 's DeltaWave converts DSD to PCM using high precision mathematics, but it's up to the user to choose the resampling frequency and cut off frequency. I chose 88 kHz, 50 kHz cut off, and 5kHz bandwidth.

As expected, we get a very good fit

The integrated loudness matches overall

The aligned spectrum looks good but there is a difference well above the audible threshold but note that the SACD (blue) rolls off faster (?)

What I don't get is that the PK Metric is -45.8 dBr, which means that the two recordings should be different.

The jazz songs were recorded at Conway Studios which features nice Genelec monitors

and the final mixes were done with Neumann KH monitors

### Inside Track: La La Land

Achieving a naturalistic sound on a Hollywood film set isn’t easy — but was central to director Damien Chazelle’s vision for La La Land.

www.soundonsound.com

Justin Hurwitz,the composer of La La Land also famously uses JBL 3 series speakers.

If you're interested in the soundtrack, you'll see that the "high-resolution" version on streaming is 24-bit/44.1 kHz. If you want physical media, the only high resolution option is a SACD released in 2023!

Reference: 05 - Herman's Habit.dsf[L] 9848832 samples 88200Hz 24bits, mono, MD5=00

Comparison: 05. Justin Hurwitz - Hermans Habit.flac[L] 4923912 samples 44100Hz 24bits, stereo, MD5=00

Settings:

Gain:True, Remove DC:True

Non-linear Gain EQ:False Non-linear Phase EQ: False

EQ FFT Size:65536, EQ Frequency Cut: 0Hz - 0Hz, EQ Threshold: -500dB

Correct Non-linearity: False

Correct Drift:True, Precision:30, Subsample Align:True

Non-Linear drift Correction:False

Upsample:False, Window:Kaiser

Spectrum Window:Kaiser, Spectrum Size:32768

Spectrogram Window:Hann, Spectrogram Size:4096, Spectrogram Steps:2048

Filter Type:FIR, window:Kaiser, taps:262144, minimum phase=False

Dither:False bits=0

Trim Silence:True

Enable Simple Waveform Measurement: False

Resampled Reference to 44100Hz

Discarding Reference: Start=0s, End=0s

Discarding Comparison: Start=0s, End=0s

Initial peak values Reference: -0.013dB Comparison: 0dB

Initial RMS values Reference: -14.929dB Comparison: -14.928dB

Null Depth=13.472dB

Trimming 273 samples at start and 1 samples at the end that are below -90.31dB level

X-Correlation offset: 10337 samples

Trimming 1 samples at start and 0 samples at the end that are below -90.31dB level

Drift computation quality, #1: Excellent (0μs)

Trimmed 423 samples ( 9.591837ms) front, 0 samples ( 0.00ms end)

Final peak values Reference: -0.013dB Comparison: 0.154dB

Final RMS values Reference: -14.919dB Comparison: -14.918dB

Gain= -0.0006dB (0.9999x) DC=0 Phase offset=234.393601ms (10336.758 samples)

Difference (rms) = -47.03dB [-45.85dBA]

Correlated Null Depth=66.72dB [63.92dBA]

Clock drift: 0 ppm

Files are NOT a bit-perfect match (match=1.75%) at 16 bits

Files are NOT a bit-perfect match (match=0.01%) at 24 bits

Files match @ 49.5625% when reduced to 8.75 bits

---- Phase difference (full bandwidth): 3.33510941694337°

0-10kHz: 4.92°

0-20kHz: 3.50°

0-24kHz: 3.34°

Timing error (rms jitter): 2.8μs

PK Metric (step=400ms, overlap=50%):

RMS=-45.8dBr

Median=-46.6

Max=-40.2

99%: -40.98

75%: -44.58

50%: -46.61

25%: -49.0

1%: -60.19

gn=1.00007013560789, dc=-1.46849237377185E-06, dr=0, of=10336.7578204906

DONE!

Signature: 1a7eb58e9fc8c2e9fa6768c9c4608e7c

RMS of the difference of spectra: -118.030933852178dB

DF Metric (step=400ms, overlap=0%):

Median=-37.5dB

Max=-24.1dB Min=-71.2dB

1% > -52.94dB

10% > -45.06dB

25% > -41.86dB

50% > -37.5dB

75% > -34.15dB

90% > -30.88dB

99% > -25.29dB

Linearity 17.7bits @ 0.5dB error

Let's compare the DSD version of track 5 against the 24/44.1 kHz PCM version. @pkane 's DeltaWave converts DSD to PCM using high precision mathematics, but it's up to the user to choose the resampling frequency and cut off frequency. I chose 88 kHz, 50 kHz cut off, and 5kHz bandwidth.

As expected, we get a very good fit

The integrated loudness matches overall

The aligned spectrum looks good but there is a difference well above the audible threshold but note that the SACD (blue) rolls off faster (?)

What I don't get is that the PK Metric is -45.8 dBr, which means that the two recordings should be different.

Last edited: