Nothing

And I am actually willing to pay less for a 1-120 speaker. Same as my 2023 4K TV did cost less than my (way worse) 80s PAL TV.
So, I think you need to take your own premise a little more seriously and think it through. The analogy between audio tech and TV / computer tech only works for the digital stuff.
Half of the audio revolution you're talking about already happened in audio a long time ago. The advancements in silicon fabrication that brought faster GPUs and cheaper memory have also helped audio, so we have cheap DACs that can do 384KHz perfectly, etc.
Speakers rely on physical motion, they're a type of motor. Attempts to fundamentally improve on an old-fashioned voice coil have happened, but none has succeeded yet. A better analogy than computer for speakers is internal combustion engines. The engines (and speakers) of 2024 are much better than 1974, but not fundamentally different.
The problem with extending the frequency range either higher or lower is that it depends on physical motion.
...
But it does happen. Almost everywhere but audio.
Computer hardware/software is the poster child of "just make tech better". And to stay with the vision/TVs comparison: a 2024 TV can do xx more pixels than a 70s one, each on of them being xx times better. And it does that for xx less money.
...
That is why audio needs a "revolution". Of the same magnitude as moving from cathode-rays to OLEDs. And OLEDs also do PAL resolution much better than a PAL TV of the 70s. So no, there is notthing to give up in the 20-20 range. On the contrary, that range will be much improved too..
Yes, you will need a fundamentally new transducer technology, but this is not trivial by any means. We're already at the point of using beryllium for diaphragms. Maybe MEMS or highly computed DML panels will be it, who knows?
Guess we are kinda back to one of the initial questions I posted: what keeps audio from evolving like the other domains?
It's just simple physics. And I mean simple, that's why we're not seeing the same revolution in audio as we saw in digital technology.
Digital tech was built on the back of the quantum revolution in physics from the turn of the 20th century. New physics enabled new tech.
Audio is based on classical physics, normal macroscopic wave mechanics. There's nothing new there, nor will there be. If you want more air to move, you just have to move it. The energies and wavelengths in question are fixed.
Maybe there is a more efficient transducer technology out there, but at the end of the day, you simply have to push a lot of air around. For the same reason you can't have a car that runs on a 100w battery, you can't have a 10hz subwoofer in the palm of your hand.
By the same token, there's no easy way around beaming because it's caused by waves interfering with themselves in the air. You can't just have a normal tweeter go up to 50khz and have the dispersion characteristic be good, you need a physically smaller tweeter.
There's no easy way to produce louder, infrasonic bass because you just have to move large volumes of air. That's what low bass IS. It's exponentially more difficult (in terms of energy) to create bass the lower you go in frequency, no matter what. Unlike silicon circuits, there is no straightforward way to get more performance out of the same raw materials. You can't get 1000w more power handling out of the same amount of copper just by being really clever about it.
I hope this answers your question. Audio
has evolved exactly like the other domains (via the digital revolution) but when it comes to transducers, your answer is "physics".
I do think your revolution will happen eventually, though. At least to the extent physics allows it. However, I personally think it will happen from the bottom up. Transducer tech like MEMS or similar will be developed for low-cost / small-size applications and eventually scale up for full loudspeakers when the quality and economics make sense.