Over the years I found that by far the most "enveloping" effect comes from the recording itself - at least in a home scenario where there's never heavy, long reverb unless your place is huge. Differences between room acoustics pale in comparison to the room on the actual recording.
The subjective "enveloping" is indeed phase related and much stronger on bass. I made us an illustrative "test signal":
app.box.com
That's a rather simple strings/pad patch using two "supersaw" oscillators that are detuned stacks of 7 sawtooth waveforms each. The detuning (dynamically played too via aftertouch on keyboard) alone gives a sense of space, even though the dry signal is mono. Then come stereo phaser, picking that up and widening it further (more phase modulation), and then drowned in stereo reverb making it even "worse".
You will notice the subjective "width" or "hugginess" or "enveloping", whatever you wanna call it, is noticeably more pronounced on the low notes from 0:50. The chaotic phase relations and beating of the whole signal chain is the same, I just perceive it as "more stereo" on bass.
If you're into this sort of thing like me, you'll definitely want stereo bass, either from two big speakers, or ideally two subwoofers. Mono bass is nice and danceable and vinyl and club compatible, but there's some "magic" about stereo bass that's hard to grasp and must have to do with how we perceive direction in that range.