respice finem
Major Contributor
- Joined
- Feb 1, 2021
- Messages
- 3,398
- Likes
- 6,465
Recording, esp. microphones choice and placement is an Art.
The concept of a microphone pickup of any such instrument being "In phase" is hard to understand.
Loudspeaker transducers, individually over their operating frequency ranges, behave as minimum-phase devices.
Given that our hearing is, under certain conditions, sensitive to the shape of the waveform – not just its frequency content – and that frequency-dependent phase shifts from speakers and room acoustics can alter that shape, it seems fair to ask:Strong room resonances also behave like minimum-phase systems, so they too respond to matched equalization, but only at one point in a room. The room is not changed, but the sounds delivered through it are.
And can someone clarify for me what "timing" means in relation to group delay and phase? If both of those are below thresholds of audibility, does this make timing acceptable?
How do you alter the shape of a waveform without changing its frequency content?Given that our hearing is, under certain conditions, sensitive to the shape of the waveform – not just its frequency content – and that frequency-dependent phase shifts from speakers and room acoustics can alter that shape, it seems fair to ask.
If you only alter the phases - in any way - it will not change the power spectrum.How do you alter the shape of a waveform without changing its frequency content?
I always show this example:How do you alter the shape of a waveform without changing its frequency content?

I said transducers, not combined transducers and enclosures. Above low bass frequencies enclosures are well-damped and sealed - simple acoustical systems with a single low frequency resonance that, if possible, is designed to be below the used bandwidth. Woofers and subwoofers are the issue.What about bandpass subwoofers of higher order?
Bear in mind that a large cascaded set of allpass filters can very closely approximate a delay. Again, the only phase shift that matters is the phase shift AFTER the delay is removed. In at least one of those famous papers, the allpass filter set very closely matched a delay. Offhand I don't recall which one.This is indeed the case and was tested by David Clark in his AES paper: Measuring Audible Effects of Time Delays in Listening Tests
Here are his conclusions:
"7 SUMMARY AND CONCLUSIONS
In this study a cascaded all-pass circuit and three full-band dual path systems were evaluated in a realistic listening environment. Two speaker mono was considered superior to the one speaker, one path mono. A reflection from a vertical surface was barely audible but a horizontal reflector were more audible. An electronic delay comb filter was highly audible and annoying. The all-pass filter was inaudible. With careful interpretation e TOS analyzer wee able to show probable cause for the audibility differences. It ia felt that elementary "two eared processing would give even closer correlation."
Actually Mr Toole clearly states in his document that listeners are able to listen through the room (or simlar wording). So we are capable of somehow "substract" the room from the speaker sound.
^Thanks^ and I got it…If you read what I have actually said, many times, it comes down to a few simple observations. Loudspeaker transducers, individually over their operating frequency ranges, behave as minimum-phase devices. That means that an anechoic flat frequency response is a reliable indicator that there are no resonances, no associated phase shift, and no ringing. This is a good start, it seems to me, but evidence of that behaviour is only available in anechoic or equivalent measurements, not room curves. Any sound, impulses included, will be fairly treated by such devices - over their operating frequency ranges. Things can go astray in the crossover regions, which is where serious measurements, including phase, are needed to ensure a smooth summation in those regions. So, the conclusion is that frequency response is not "uber alles" but if it is wrong, nothing else may matter as much as one might think.
I got it.If moderate resonances should exist in such minimum-phase transducers, they can be addressed by matched equalization based on anechoic or equivalent data, not room curves. This is why Amir's data is so useful. A loudspeaker that is "almost" really good, can be improved by equalization, especially those with well-behaved directivity.
For first arrival you can listen through the room very effectively. Short-term nonlinearities in hearing suppress early reflections.
I dunno.Strong room resonances also behave like minimum-phase systems, so they too respond to matched equalization, but only at one point in a room. The room is not changed, but the sounds delivered through it are.
And can someone clarify for me what "timing" means in relation to group delay and phase? If both of those are below thresholds of audibility, does this make timing acceptable?
Of course, you're right, but my direct sound must be in phase. An instrument, a violin or a cello, playing in my room is in phase
I asked the same question earlier in the thread, but got no response.Somebody might like to explain where one should place a microphone relative to a complex musical instrument - like violin, cello, bass, piano, etc. etc.- where the "sound" is like what one hears in a decent auditorium/concert hall. If that is the reference. Such sources radiate extremely complex sound patterns from various parts of the instrument, all acoustically interfering with each other, altering amplitude and phase response, when they arrive at the mic. The concept of a microphone pickup of any such instrument being "In phase" is hard to understand. It is what it is, and neither amplitude or phase are "standardized" for any musical instrument, before we even consider reflections from nearby surfaces at the point of capture - like a floor. There is a room at the beginning of the recording process too.
Care to elaborate?
This argument is exactly the same as the argument that super-high sampling rates and bit depths eg 32/768 are an extremely good idea because..."getting the input and output signals to exactly match" is "technical excellence" and hence 24/48 will never do. And any arguments that 24/48 is completely transparent to the ear will be dismissed as "yada yada and more yada"....I want that acoustic energy to exactly match the electrical signal given to the speaker.
Whatever the signal is...phase corrupted, phase perfect, whatever any of those nebulous terms even mean.....it doesn't matter one iota imo.
A truly technically excellent speaker will match signal completely....mag and phase, impulse et al.
The only arguments against such technical excellence that I see, are the time -is we can't hear phase, small-rooms mask everything anyway, source material is screwed up to begin with....yada yada and more yada.
Well, I say who knows...maybe the reason for all that 'yada why bother' is that we never had the ability to make such phase and time aligned technically excellent speakers before. Muti-way active DSP has truly changed what speakers are capable of.
I say, why not get on board with the idea we can at least make one component in the whole "why bother/can't hear" circle of confusion quagmire ....at least make the speakers.... NOT be part of the problem.
Technical excellence means both mag and phase matter. To the degree speaker phase eventually proves to be audible/inaudible remains to be seen.
Anyone saying it's already proven, is operating under a very subjective state of mind, imnsho.
I said transducers, not combined transducers and enclosures. Above low bass frequencies enclosures are well-damped and sealed - simple acoustical systems with a single low frequency resonance that, if possible, is designed to be below the used bandwidth. Woofers and subwoofers are the issue.
Sealed woofers are simple single resonance systems. Bass reflex adds one superimposed Helmholtz resonance and the combination amplitude and time performance is predictable within limits, as discussed below. More elaborate systems add more coupled Helmholtz resonances.
At sound levels sufficiently high to generate turbulence the effective mass is reduced and the result is a different system tuning for every sound level, and this is not equalizable. The resonance frequency varies with sound level, but one supposes, and experience shows, that over a range of useful sound levels these are not audible problems.
Sealed vs. reflex arguments tend to arise with bookshelf loudspeakers, where the resonance is within the musical frequency range. But even these cease to be problems if crossed over (high pass and low pass) to a subwoofer, where, with good design the resonance can be so low that it is in the feeling rather than hearing frequency range.
The advantage of passive radiators is that they are truly moving masses, and any non-linearity is in the suspension of the diaphragm. Even so, at very high sound levels it will become non-linear. These are good alternatives, but cost more than ports.
I think it is reasonable to assume that one can expect significant non-ideal acoustical output from small multi-reflex systems that are designed to maximize acoustical output over a narrow bandwidth with steeply declining output at the high and low frequency extremes - meaning transient misbehaviour. But, let's face it, the intended audience for those products is assumed not to be a critical one. Right?
However I usually get stumbled up on step-response and transducers playing 180 degrees out of phase.
When the output of the speaker inverts the signals in frequency bands corresponding to the driver’s bands, then in a mathematical sense… if we difference the input (signal) with output (SPL) it seems like fidelity has been lost.

This “Step-1” to me looks like the tweeter is 0.3- 0.4 seconds ahead of the MR. So ~4” ahead, and not time coherent if it is a 2-way.
The “Step-2” has the tweeter in phase, but the MR is 180 out, and then the woofer is back to being “in phase”… I am assuming that this is a 3 way.So - what do you read from isolated time domain responses ?
View attachment 467996 View attachment 467997
This “Step-1” to me looks like the tweeter is 0.3- 0.4 seconds ahead of the MR. So ~4” ahead, and not time coherent if it is a 2-way.
That is not much information from the step responses.This “Step-1” to me looks like the tweeter is 0.3- 0.4 seconds ahead of the MR. So ~4” ahead, and not time coherent if it is a 2-way.
Or if it is a 3-way then the MR must be inverted… however I would expect it to be deeper if it was a 3-way.
The “Step-2” has the tweeter in phase, but the MR is 180 out, and then the woofer is back to being “in phase”… I am assuming that this is a 3 way.
Step response is IMHO rather useless for direct predictions on audible aspects.
Sorry I didn't answer you sooner. Are you aware of all the studies conducted by these disciplines, including physics, mathematics, engineering, and neurophysiology, and have you created speakers that respect or utilize this knowledge? Have you ever listened to speakers designed and built for time and phase coherence in controlled systems? Just answer yes/no to these two questions.I asked the same question earlier in the thread, but got no response.
This seems like reasonable performance from a pair of 5 1/4" woofers in Revel's entry level floorstander, the F35. If one wants serious high level kick drum sound this little speaker will be stressed, but if high passed and supplemented by a subwoofer, things would be more satisfactory. The port would not be energized, and the system can play louder. Otherwise, according to Amir's and Erin's data, this is a respectable speaker at the price. As always there are choices, and knowledge and data help make them.This Revel model is a floorstander afaik. We see compression occurring in the 80-150Hz region as well as dynamic expansion around 50-60Hz. The latter is indicative of an increase in port fs with rising SPL. Imagine a kick drum sound with approx 50Hz fundamental and 100Hz first harmonic. The spectrum of its transient sound will shift in relative 1.5dB.
I would guess that the first is a 3 way with 1st order or no xover between low and mid drivers ( mid hi pass from a separate enclosure ) all drivers are in phase and I don't know about the tweeter slope.So - what do you read from isolated time domain responses ?
View attachment 467996 View attachment 467997