I've been playing with XTC for a long-long time, tried everything from a mattress to all kind of plugins.
Recently I've made a full sphere SOFA file of my personal HRTF and I thought I would simulate with a computer what are the theoretical limits of the different XTC methods. With the simulation I can get exact signals at the entrance of my ear canal without any room interaction, and evaluate how does this correspond to virtual and direct sources in different directions.
Every measurement below is using my personal HRIR simulation at the ear canal entrance and corrected with my diffuse field curve, so FLAT response IS NOT the desired response. It is more for comparison for different responses from different direction. I am using only the left ear, supposing that the HRIR is symmetrical, ( which is not), but it does not substantially affect the conclusions. I've processed the measurement signal with the XTC plugins then ran it through the SOFA convolution to simulate the summed response at the ear.
I compared the XTC solution playing L or R only signal with the perceived sound at the L ear with a real speaker at 60 deg, and playing mono signal in both channels with the sound of a real speaker at the center.
First the basic +-30 degree stereo triangle:
1. L 30deg: Left speaker heard at the left ear
2. R 30deg: Right speaker heard at the left ear
3. C 30deg: Center speaker heard at the left ear
4. L+R -6dB 30deg: Amplitude corrected virtual Center speaker heard at the left ear
Crosstalk cancellation from head shadowing is ~5db in the 600-8K range. in a room it is worse than that.
Here we can also see the famous BBC dip (green vs yellow) around 1.8k, because of the comb filtering from the 2 speakers producing the virtual center.
This is what is happening with a real Center speaker at the left ear when I turn my head left or right:
Basically the smooth change in FR tells us a little horizontal shift in the source position, No dramatic change especially in the position of the HF dips.
In case of a virtual Center speaker (L and R playing the same signal):
The BBC dip remains but shifts, below and above some considerable changes are coming in FR an in the HF both the position and volume of the dips are changing. These are causing tonality changes and localization confusion. It is getting messy, not something you can EQ out. And this is with keeping the center position, just turning your head, moving to the left or right is much worse.
Let's see the crosstalk cancellations. All the measurement below are made with the simulated speakers moved to the +- 10 deg position.
First the simplest one, a barrier between the speakers. For this measurement I simulated the barrier as progressively increasing damping above 200 Hz up to 15 dB at 20kHz.
Considerable damping between the ears, but how does this compare to a real speaker at +60, basically to a double wide soundstage compared to the stereo triangle. Crosstalk cancellation is usually promising wider soundstage.
Actually not that bad. The closer ear is not loud enough, because at 60 deg the left speaker would have a straight shot at the ear, while the real speaker at +10 degree is a little shadowed by the head. The other ear gets somewhat similar signal what would come from a real speaker on the other side at the +60 degree position. The HF peaks and valleys are different which means the virtual images created with the barrier are not too resilient to head rotation. This is the effect Mr. Glasgal wanted to overwhelm with the pan-ambio setup, where he put an other XTC speaker pair in the back to confuse the HF pinna cues. Center images have no problem because the speakers are basically in the center and playing non-processed signal.
Lets see Bacch (Bacch plugin, 5degree half-span angle, 0 center boost:
It shows the Bacch philosophy, pushing the XTC to as low as possible. It creates >15dB XTC in the 200-5000 Hz range. But it also shows what I always felt, the Bacch's lack of impact in the midbass. The separation effect is great, but the sound is very lightweight.
Compared to a real speaker at 60deg, it has the right amount of cancellation above 1k, but below 1kHz it is much more what we can ever hear in real life. But pumping that much energy in to XTC that low causes a cancellation level drop. I think they hope that the off axis response of the stereo dipole together with the room will boost this range to the proper level, but I've measured the same hole in my room with real speakers and microphones.
Same funny business is going on with the center image. Also, it is a bit too hot in the 1-4k range, otherwise would be ok.
Neutron Player:
I can't test the CTXMatrix plugin, as I am running on a Mac, but the Neutron is also a straightforward RACE implementation.
Effective XTC range 400-4k, ~10dB.
General tonality is not bad. Little less separation at the ear than the real speaker at 60 deg, but not horrible. Rolls off above 4k too on the speaker side, but too high on the other side, so in a real room it might not feel lacking in HF.
What is good is the center image, it tracks the real center speaker very well up 3k. Above that the important dips areas 8 and 10K are the right position, but the level is wrong. This is one of the reason why race was originally bandwidth limited the XTC effect to 400-4000Hz. This simulation Is full range, the peaks and dips in the HF can be reduced with that limiting, or the pan-ambio setup mentioned at the barrier setup.
Level diffs above 3k between L and C image are in the same direction, you can try to correct it using EQ.
Worth mentioning that Prof Angelo Farina's method of measuring in ear HRTF for the +-10 deg speaker position and calculating an amplitude limited inverse full stereo filter gives in trend very similar curves to the RACE.
Finally XtalkShaper:
Lots of separation in the 400-4000 range, but a bit jagged. But also there is a huge latency from the plugin.
.
It exceeds what we really hear.
Center is good, but HF rolloff starts early. There is a similar drop in the L only signal in the 4-8k range, so general EQ boost in this range can help with tonality.
All this seems a bit of gloom and doom at the first sight, but have a look at the next graph. It is Neutron with the head turned left and right, but the other XTCs behave similarly:
Smooth and gradual changes on the speaker side, preserved HF dips and still good XTC results on the other side. So if you can get good tonality with the XTC and EQ, it will retain good imaging and consistent tonality even with head turns and side movements too. Not like with the stereo triangle.
So that's it for today. Make your own conclusions. Mine was that XTC works as a part of the solution, which is to simulate a venue with multichannel convolution, and place the direct sound in this sound field with XTC (ambiophonics). Tonality comes from the (simulated) room as much as from the direct signal.