you forget to talk about the need to have impedances equal to its inputs under penalty of a large OFFSET unless you add link capacitors, which is not an "acceptable solution" in itself.
An OPAMP of the FET or JFET type will not have this problem of OFFSET and their noise level (eN) has become so efficient nowadays that we are almost wondering why bipolar OPAMPS are still produced...
Apart from a factor of ca. 3x regarding input noise voltage density en above 1kHz, the bipolar input stage has the advantage of a usually far lower 1/f corner of en. For the OPA1612 it's typ. ca. 20Hz and for the OPA1656 it's typ. ca. 1kHz.
The corner frequency strongly depends on how "clean" the technology process is when the batch is processed.
In order to get into the 1nV/sqrt(Hz) region, you still have to employ large discrete JFETs - it's design effort and they aren't cheap.
Offset voltage of a bipolar input stage still typically is somewhat smaller although JFET stages have made huge progress in the past.
Bias current (I*R drop at source resistance): The OPA1612 is "bias current compensated" (you can see this in the datasheet, the bias current is specified +/- and it is about the same size as the offset current specification).
Even assuming the max. specification of 250nA together with ca. 250 Ohm source resistance (390 Ohm of ES9039q2m || ca. 750 Ohm feedback resistance in the I/V stage), this ends up at ca. 63uV. Not much compared to the
typical 100uV for the offset voltage of the OPA1612.
And even if there would be significant bias current (not compensated, so same sign for both branches of the I/V stage, this bias-current-induced-offset-voltage would show up in the non-inverting and the inverting branch with similar magnitude. It's common-mode and gets subtracted by the subsequent stage (either in the amplifier connected or in the stage that feeds the RCA output).
SMSL uses a topology where they divide the AVCC that supplies the analog stage of the ES9039q2m and trim this divider (potentiometer) to obtain <1mV offset at the individual pins of the differential outputs.
There is a really clever topology in the ES9039pro datasheet that has a servo (no need to trim manually) and that removes most of the common-mode "dirt" present on the DAC chip outputs.
I would really love to see this topology implemented. I suspect the D50iii might have this topology, but that is mostly speculation until someone reverse-engineers it.