Long time HF user here. Decided to start this discussion on ASR since afaik you are more welcoming to science-related discussions.
The proximity effect in audio is an increase in bass or low frequency response when a sound source is close to a microphone. When you speak closely into a microphone, your voice sounds more chesty. When you speak far into a microphone, your voice sounds more "normal". So far so good. Our eardrums also have proximity effects, because they function much like a microphone. When someone speaks to you at a distance, they sound like what they sound like. When someone speaks next to you with their lips almost touching your outer ears, they sound more chesty and intimate. Whispers sound especially different when heard close vs far.
Our brain is, unfortunately, adjusted to the proximity effect. When someone you know speaks to you at an extremely close distance, you expect them to sound more bassy. If you've only heard someone at an extremely close distance, you'd expect their "real voice", the voice if they were to speak to you at a normal distance, to be much less bassy than what you've heard, even if you've only heard their "close voice".
And this is why all IEM and headphone targets that intend to "mimic a 2-channel system in a room" are problematic. When we know the transducer is playing music next to our ears (because we have physical contact with the headphones), our brain expect the music to sound a certain way, a way that is different from what we know this music should sound in a good 2-channel system. We'd expect to not feel the bass through the body, but hear it in the ears; we'd expect every instrument to sound drastically closer. Having never heard the music in a 2-channel system likely won't help, because our brain is used to hearing things at different distances and perceiving them at said distance. Having never heard *any* music in a 2-channel system probably helps, or if one lived their life in hearing aids, but then we are talking about a different mindset of musical enjoyment entirely.
But why can't we just apply a known downward-sloping proximity effect FR on top of our existing headphone targets, problem solved? It is more complicated, actually. Every proximity effect I've mentioned so far is mono: a microphone picking up the sound of an instrument, a person whispering into *one of* your ears. The "stereo" proximity effect, by my estimation is a less steep downward-slope than mono. Imagine a person speaking in front of you. Now imagine a person speaking in front of the tip of your nose. They'd sound chestier, but less so than if they were speaking into only one of your ears. By how much? We don't know. In fact afaik there isn't a concept of "stereo proximity effect" in acoustic engineering.
Ok, then do a research on this topic and apply this "stereo proximity effect" on top of our existing headphone targets, problem solved? It is more complicated, again. Remember how the images of instruments are always "in your head" when listening to headphones? The distance of things in music reproduced by headphones are neither "in front of you, closer than the tip of your nose" nor "next to you, closer than touching your outer ears". The distance, far or near, is literally "in your head". I'd guess that our brain is initially quite confused by this too. So what does it say about the proximity effect? I don't know. Could be somewhere between the stereo proximity effect and the mono proximity effect. Could be something totally different.
The "tilted target FR curves" we are seeing these days are a great start. I hope in the future we can use tilt for specific frequencies and/or tilt with a curvature; that'll enable us to mimic proximity effect to a large degree. If you just want to enjoy a good sound, inductive science will certainly get you to where you need the target to be. Who is to say our brain will like a proximity-effect-adjust in-room target anyway? Once you hear it, you'll know it's right! But for the sake of a deductive science, a lot more work needs to be done to understand proximity effect of our eardrum and the psychoacoustics of perceived "distances."
The proximity effect in audio is an increase in bass or low frequency response when a sound source is close to a microphone. When you speak closely into a microphone, your voice sounds more chesty. When you speak far into a microphone, your voice sounds more "normal". So far so good. Our eardrums also have proximity effects, because they function much like a microphone. When someone speaks to you at a distance, they sound like what they sound like. When someone speaks next to you with their lips almost touching your outer ears, they sound more chesty and intimate. Whispers sound especially different when heard close vs far.
Our brain is, unfortunately, adjusted to the proximity effect. When someone you know speaks to you at an extremely close distance, you expect them to sound more bassy. If you've only heard someone at an extremely close distance, you'd expect their "real voice", the voice if they were to speak to you at a normal distance, to be much less bassy than what you've heard, even if you've only heard their "close voice".
And this is why all IEM and headphone targets that intend to "mimic a 2-channel system in a room" are problematic. When we know the transducer is playing music next to our ears (because we have physical contact with the headphones), our brain expect the music to sound a certain way, a way that is different from what we know this music should sound in a good 2-channel system. We'd expect to not feel the bass through the body, but hear it in the ears; we'd expect every instrument to sound drastically closer. Having never heard the music in a 2-channel system likely won't help, because our brain is used to hearing things at different distances and perceiving them at said distance. Having never heard *any* music in a 2-channel system probably helps, or if one lived their life in hearing aids, but then we are talking about a different mindset of musical enjoyment entirely.
But why can't we just apply a known downward-sloping proximity effect FR on top of our existing headphone targets, problem solved? It is more complicated, actually. Every proximity effect I've mentioned so far is mono: a microphone picking up the sound of an instrument, a person whispering into *one of* your ears. The "stereo" proximity effect, by my estimation is a less steep downward-slope than mono. Imagine a person speaking in front of you. Now imagine a person speaking in front of the tip of your nose. They'd sound chestier, but less so than if they were speaking into only one of your ears. By how much? We don't know. In fact afaik there isn't a concept of "stereo proximity effect" in acoustic engineering.
Ok, then do a research on this topic and apply this "stereo proximity effect" on top of our existing headphone targets, problem solved? It is more complicated, again. Remember how the images of instruments are always "in your head" when listening to headphones? The distance of things in music reproduced by headphones are neither "in front of you, closer than the tip of your nose" nor "next to you, closer than touching your outer ears". The distance, far or near, is literally "in your head". I'd guess that our brain is initially quite confused by this too. So what does it say about the proximity effect? I don't know. Could be somewhere between the stereo proximity effect and the mono proximity effect. Could be something totally different.
The "tilted target FR curves" we are seeing these days are a great start. I hope in the future we can use tilt for specific frequencies and/or tilt with a curvature; that'll enable us to mimic proximity effect to a large degree. If you just want to enjoy a good sound, inductive science will certainly get you to where you need the target to be. Who is to say our brain will like a proximity-effect-adjust in-room target anyway? Once you hear it, you'll know it's right! But for the sake of a deductive science, a lot more work needs to be done to understand proximity effect of our eardrum and the psychoacoustics of perceived "distances."
Last edited: