I will add that this is mostly down to average speaking distance, which directly impacts the ratio of direct to reflected sound in a major way. If you have a vocal mic tuned for use at 2" (5 cm) and use it at that kind of distance, of course it'll pick up a lot less ambience than a condenser at 8" (20 cm). The difference is going to be biggest in the low end as proximity effect is involved as well. Your average condenser will be tuned to be flat in far-field, so at short distances they'd get pretty wooly. (Conversely, you wouldn't want to be taking ambient recordings with a dynamic, as the lack of low end and uneven und fairly directional high end make them anemic and dull sounding.)
Condenser vocal mics also exist though, mainly for live stage use, and they handle much like their dynamic counterparts. Which is both a blessing and a curse as short distances make getting rid of plosives a lot more tricky. At a larger distance a bit of off-axis placement may well be sufficient, even without a dedicated pop screen in front of your mouth (good enough for ABBA, good enough for me).
If you (OP) are serious about this, look up tutorials on how to make a budget vocal booth (it's not necessarily hard or expensive) and go for a traditional LDC. Audio Technica AT2035, RØDE NT1 (not so much the -A which is the "radio voice" version) and Sennheiser MK4 (+ MKS4) would be typical "relatively neutral" choices with more or less slight treble tilt.
If you want more of a typical streaming setup instead, there's multiple ways of skinning this particular cat... LDCs, pencil condensers or dynamics can all be made to work decently in some fashion.