To-the-best-of-my-knowledge / limit-of-my-understanding, a speaker driver is a dynamic transducer with the characteristics of a bandwidth-limited filter.
Within the pass-band, everything works quite well, with reasonably flat response - BOTH amplitude vs freq AND phase vs freq.
At the top end, the wavelength gets smaller and approaches the size of the driver, which starts beaming and breaking up.
At the bottom end, set by the system resonance, the acoustic suspension resistance dominates the inertia of the driver mass, and the acoustic output rolls off.
In this condition it transitions from being a dynamic transducer, to being a static transducer.
The resistance to motion is now in phase with the displacement, rather than with the acceleration, so there's a big phase shift there (when you drop below the pass band).
Therefore there's increased GD at low frequencies as mentioned above.
The tweeter (and midrange) will ideally work within their passband, so shouldn't have too much phase variation BUT when they hand over to another drive unit, there's an additional phase discontinuity, simply due to the crossover. A 3-way speaker has an additional x-over and phase delay. That's what I'm most focussed on, because of this:
Thanks to Mitch, here's an ideal step response for a loudspeaker with a finite bandwidth:
View attachment 522387
Most speakers get nowhere this. There's not much data around, but I found these phase and step measurements for the complex 3-way Neumann KH420:
View attachment 522388 View attachment 522389
I knew that electrostatics, like this Quad ESL-2804, had good time domain / step response, and I figured it was because of their light diaphragms:
View attachment 522390
However, single drive unit speakers like the Fujitsu Ten Eclipse TD712z and the Meyer Sound MM-4XP have similar step response, showing the effect of the crossover:
View attachment 522391 View attachment 522393
A German magazine called Sound and Recording used to take thorough studio monitor speaker measurements, including amplitude, phase and step response. I've got lots of links to them, but they seem to have gone, and been replaced by a magazine that only focusses on keyboards, and the test results have been lost. I saved a few useful charts, like the KH420 test above, and the Neumann KH150 (1) and KH120ii (2) below. These are digital active speakers with phase compensation that keeps the midrange flat, from around 200Hz upwards. The effect is to make the step response close to ideal, and similar to what Mitch achieves with Audiolense (3), or Trinnov with Optimiser (4).
View attachment 522394 View attachment 522395 View attachment 522396 View attachment 522397
This is what I'm after - a fairly modest level of phase correction in the midrange that will overcome what a crossover does to any two or three way speaker, and allow it to reproduce steps, squares and impulses properly. It sounds like the obvious way to do that is with a linear phase FIR filter - and accept the inevitable (hopefully modest) time delay.