Turn off one speaker and listen.
One speaker of the stereo pair, the 5.1, the 7.1, or the Atmos setup? ;-) I find it amusing that you speak of lack of experience with listening tests.
my collection of test tracks is from what sounds excellent on both stereo headphone and speaker listening...Here is the key thing: on any well designed speaker, all 200 to 300 tracks sound wonderful in mono!
That is hinting to a selection of tracks which are not capable of revealing flaws other than massive ones, and which do not allow a proper judgement of tonality, transparency and all imaging-related aspects.
I can't think of a single scenario where it is the aim of a mix/mastering engineering to produce poor tonality in one speaker as to then compensate with the other.
I can. One is called runtime stereophony and vastly popular among classical recordings of the last 25 years. For example recordings which are based on A/B main mic arrangements, should not sound wonderful in mono, particularly not downmixed to mono. Their whole stereo effect relies on masking by first wavefront preference. If you make that audible in mono, it usually sounds like a garage, not to speak of comb-filtering and cancellation effects.
Another main phenomenon of poor tonality in ´stereo>mono´ manipulated tracks is rooted in the different angles from which the sound is coming in, how it is affected by our own HRTF and how HRTF interaural crosstalk components are masked in stereo (but not in mono). You can easily do an experiment yourself by switching from true center to phantom center in a surround setup.
As mentioned, I am not against mono listening tests and for some very specific purpose it might be useful, as described by Floyd Toole, but then please use material being mixed and mastered in mono.
Then you have to understand what makes a good test track:
Thanks, I have been auditioning and selecting test tracks for various purposes of listening tests on a professional base. It is impossible to calculate but if I count the usual number of tracks being auditioned during the process of making a selection, and how often that happened in my previous jobs, I have auditioned something like several hundred thousands of tracks.
I mostly use female vocals that bring this to the forefront and per above, sound very nice on stereo system as well as single speakers.
In my understanding, test tracks are not meant to ´sound nice on every setup´, but to either reveal flaws, allow distinction of differences or represent a vast variety of mixing styles, tonality differences, difficulties for speakers, or a different mixture of recording techniques.
If you use close-mic´ed solo female vocals like audiophile jazz and folk mostly, you are most probably going to miss aspects which are revealed only for example by a male ´chorus latens´ or a 16-voice mixed choir, a contralto, some problematic 1950s opera live recordings, some Autotune-distorted R´n´B singing, heavy-metal-style falsetto or some guttural growl, and vice versa.
I have test tracks for specific things such as sub-bass performance. My chosen track has this spectrum in both channels so I don't need at all to play in stereo.
Testing sub-bass is definitely possible in mono, and I would not overly worry whether the material is made for that or not. It is more important to have a vast selection of tracks representing different dominating frequency bands and bass transient behavior. So I do not think a number of 300 tracks for bass and bass impulse judgement alone, is sufficient.
Stuff you talk about you is positioning and room dependent. Conveying that to others is useless as no one is going to be able to replicate your setting.
It is not about replicating a setting but about defining properties of the speakers and matching them with the room so everyone would be able to replicate the test results. That is doable in most of cases. If problems occur, for example with judging localization and ambience, go for a nearfield setup, get some constant directivity speakers as a reference or optimize the room.
Further, you don't know those elements for when the music was produced. So you have no idea if what you are determining is right anyway.
Yes, I do. At least for all the recordings I have been involved in the recording or production process and I know the concert hall really well. I have a selection from those tracks for listening tests as well, and I know from many listening tests with recording engineers that everyone has his or her favorites from own oeuvre. As mentioned, I prefer to use those recordings as a reference which I had the chance to witness both from the auditorium and the local broadcast control room.
A dipole speaker surely generates spatial effects that are not real and were never heard in the studio.
With my method and the right track to test depth-of-field, it would take me 10 seconds to identify this. I doubt that it is possible to test that in mono, and as member
@gnarly has mentioned, there are situations in which reverb gets fully masked. Mono testing is one of them.
I am not a fan of dipole speakers, but having read a lot of papers by the late Siegfried Linkwitz, sharing a lot of common goals with him and his successors, I fail to understand why dipole speakers are said to sound ´not real´, but other speakers creating much weirder reverb and tonality effects due to kinked directivity, are not labelled as such. But maybe that is worth a separated thread.
Bottom line is simple: you need to go and perform single speaker testing.
I have done this numerous times, starting with experiments early in my career when reviewing codecs as a successor to MP3 was a big thing. The interesting aspect is, this idea of mono testing was never implemented, as the whole idea of testing and optimizing lossy codecs was found to work much better with headphones.
I do not see any point in testing speakers in mono, except for very specific questions (resonances, distortion, bass quality and alike). So far you did not bring up a valid point why it should be superior in a preference test. I have been explaining numerous reasons why is might lead to misjudgments and does not allow to judge important aspects such as imaging and ambience.
Please stop arguing from what you think happens and spend time learning the "new" science.
´Learning the new science´? Which ´old science´ are you planning to replace? And did I understand it correctly that the idea of mono speaker tests got promoted by Dr. Toole as early as in 1985, never being properly verified by any other scientific institution nor being adopted by anyone in the pro audio, recording community and researcher alike?
What is new about it? And what is scientific about a general technology that was replaced by a superior one as early as 1955, and not without good reason?
Despite its regrettable limitations (I personally prefer surround when it comes to ambience), I am still surprised how good stereo can sound and how well some pioneering recording engineers were using it from the very beginning. Occasionally, I implement the first stereo recording I regard to being well-executed, in a selection of listening test tracks, and people are usually amazed by the high standard of this one. Give it a try in stereo, as it creates good mood as a side effect:
We do mono testing because it works far better and is far cheaper.
Can tell you from quite some experience with Dolby Atmos and Auro3D setups and listening tests, that the difference in cost and effort between mono and stereo is negligible.
And I still did not read a single convincing reason
why mono is better in your eyes. I have layed out numerous explanations what it cannot achieve and why it is prone to producing misjugdments.