@Blumlein 88 , thanks for your comment on the Deruty-Pachet (2015)!
Yes, you are right that there may be hidden biases in the selection of empirical data. This is a researcher's worst nightmare; that his data is faulty and full of biases.
Let me now try and comment your remarks as well as having other ASR readers in mind when I comment.
The authors have a discussion (chapter 4.3) of their findings (on year being more important than genre), and here they write that "This may bring the suspicion that dynamics are only dependent on the trends followed by the most represented genres, such as the subgenres of rock represented in Figure 3, but independent from the trends followed by most other genres, in which case our
conclusion would not stand" (my underlining).
They try and show that their results are year related controlling for genres. And that analysis may implicitly give us an insight into what would have happened with their results if they controlled for total album releases per year. The bars with 5-95 percentiles in figure 1 may also be an indication of how a control for total album releases per year would have fared.
See also this ISMIR poster that accompanies their 2015 article:
http://www.emmanuelderuty.com/pages/publications/2015_ISMIR_poster.pdf
FWIW, the entire dataset of Deruty-Pachet (2015) can be found here:
http://emmanuelderuty.com/pages/dynamics/Corpus7200/Values.xlsx
But you are right: It would have been great if we would have a complete dataset of all recordings ever made since the 1950s, 1960s.
One interesting finding in their paper, is their note on micro vs macro dynamic:
"A notable exception lies in macrodynamics as measured by the EBU3342 Loudness Range, which are more independent from both genre and year of release. In other words, dynamic range in the musical sense (pianissimo tofortissimo) is only marginally dependent on either mainstream genre or trend (...) As an exception, macrodynamics, which have not been significantly influenced by the loudness war, appear to increase since the loudness war’s peak, and are currently reaching very high values".
In other words, the debate on loudness has been little nuanced if it doesn't make a distinction between micro and macro dynamics.
Please note that the paper was presented at International Society for Music Information Retrieval, ISMIR:
http://www.ismir.net/society.php
This perspective, big data in audio, is highly interesting because it replaces opinion and anecdotes with fact. And this perspective may also draw more computer people into audio science. Needless to say, habit and convention will predict that old school audio people are a bit skeptical towards this new breed of audio scientists that are more pattern oriented than case and anecdote oriented.
Many people, including people at ASR, think loudness is a battle lost. So they keep on fighting as guerilla fighters. However, other people, that are in high regard, are of a totally different opinion. Bob Katz, thinks (2013) the war is over, due to normalization features in distributors like iTunes (Katz' original blog post is no longer available):
https://www.soundonsound.com/techniques/end-loudness-war
Lastly, just a couple of words on the authors, Deruty and Pachet. Deruty is a frequent publisher of scientific audio articles:
http://emmanuelderuty.com
Pachet is the better-known name of the two:
https://en.wikipedia.org/wiki/François_Pachet
He is, among other things, a fellow of The European Association for Artificial Intelligence. Which may be an indication of his ability to deal with datasets.