at 14:49 sec stand as text in this video (from your link)
then at 30:54 stand as text
I understand from 30:54 that the time relationship detect work from 500 hz- 2 khz but in diffrent ways. he use the word envelope. maybe he mean with that the ears set the start point of the phase compare between left and right ear to transient envelope highest point on frequency over 500 hz. a envelope is this
https://www.teachmeaudio.com/recording/sound-reproduction/sound-envelopes
and for frequency upto 500 hz there is the level of the wave cycle use. but i guess it use highest level of the wave cycle. because the beginning of the wave cycle have very few level it is not easy to detect.
EDIT: Also should notice even when level relationship work at this frequency too, that because of less level diffrence on low volumes there need hear louder. this also show my experience. the sound of mtm and eris i think good on 10 db less level as the sound from Kali. so fast midrangve protect also ears. in the video he tell at 19:30 sec that the detectors are uniform from erb and then he speak about hearing damage.that many young people have damage ears because they hear often loud. this stand not in the text but you hear when you hear what he say.