I do think that his research is relevant. The time domain has been played down and frequency response had become the de-facto criterion in many aspects, arguably, it is much easier to measure. Look at the loudspeaker tests on this site and others, phase errors and delays aren't visible.
Yet, what makes loudspeakers sound so different, even if they have all have flat frequency response?
My guesses: wide vs. narrow beam patterns / omnidirectionality, resonances/decays, diffraction on the bevel edges or on steps on the cabinet and as in this paper the step response / time+phase alignment of the drivers.
On this test you can easily score 10/10 for 1ms in (
https://www.audiocheck.net/blindtests_timing_2w.php?time=1) with a desktop speaker.
So you should be able to hear alignment differences between woofer and tweeter as in here:
https://audiojudgement.com/loudspeaker-step-response-measurement
1ms equals 42cm and time alignment of multiple drivers on a larger speaker is thus difficult. First of all, they probably would need to be coaxial to avoid seating position errors. Second, you will have comb filtering in the cross-over region(s).