I would argue that these are not independent events as long as the test setup hasn’t changed. Would you consider a one minute break between trial groups something that would make this two events? Five minutes? An hour?
If you're interested in the probability of passing 3 independent tests of 7 guesses each you should be multiplying together P(x >= 4), not P(X == 5).Great example. It intuitively explains why 10 of 14 is easier to get through luck, than 5 of 7 twice. If you pass 5 of 7 twice, your aggregate score can't be lower than 10 of 14. But you can score 10 of 14 without passing 5 of 7 twice (you could get 4 of 7 then 6 of 7, or 3 of 7 then 7 of 7).
Yet this suggests that when computing the overall confidence of a series of tests, we should use Bayes rule, because we know the subject passed each of the shorter tests. He didn't fail some then make it up by doing better on others.
I'm not so sure... When flipping a coin, must you wait in between flips or change the setup in order for the flips to be independent events? Of course not.I would argue that these are not independent events as long as the test setup hasn’t changed. Would you consider a one minute break between trial groups something that would make this two events? Five minutes? An hour?
If there is no audible difference, each guess is a coin flip. That's what we are calculating here: "how likely is this result by chance, assuming the null hypothesis (no audible difference) is true" .I would argue that these are not independent events as long as the test setup hasn’t changed. Would you consider a one minute break between trial groups something that would make this two events? Five minutes? An hour?
In that sense, yes. In the sense of the original question (essentially, are these two separate experiments), no.If there is no audible difference, each guess is a coin flip. That's what we are calculating here: "how likely is this result by chance, assuming the null hypothesis (no audible difference) is true" .
So yes, they are independent events.
Coming back to this one. Just ran a simple minded spreadsheet.Question for the stats folks: consider someone who ABX tests and gets 5 of 7. This is 77.3% confidence (22.7% chance to do that well by guessing). He does this in 3 separate/independent tests, on different days. We could aggregate this as 3*5 = 15 of 3*7 = 21 and compute 15 of 21 is 96.1% confidence (3.9% chance to do that well by guessing).
But we could tackle this differently: There are 3 different tests, each independent and having 23% chance to pass by guessing. If you pass all three, the probability you're guessing should be .23 * .23 * .23 = 1.2%. This would be 98.8% confident.
These numbers are different so they can't both be right. Which is correct?
I'm not so sure... When flipping a coin, must you wait in between flips or change the setup in order for the flips to be independent events? Of course not.
I think the concept of independence is critical to this discussion. But I don't think independence relies on a time interval or changing the test setup.
... So as I've said, both predictions are correct, because they aren't the same prediction. 15 of 21 is one prediction, and 15 of 21 made up of three 5 of 7 results is more specific and less common. There is really no disagreement.
Right. But this restriction represents our knowledge of the actual test results. Suppose we know that somebody scored 5 of 7 on 2 separate tests. Then using 10 of 14 includes other outcomes (like 4 and 6, 3 and 7) that we know did not happen.... This additional restriction lowers the probability of success.
Yes so knowing the results, 5 of 7 on two separate tests, is a subset of all the possible ways to get 10 of 14.Right. But this restriction represents our knowledge of the actual test results. Suppose we know that somebody scored 5 of 7 on 2 separate tests. Then using 10 of 14 includes other outcomes (like 4 and 6, 3 and 7) that we know did not happen.
I just used a random number generator to get the number 681354325677. The chance of getting this exact number is 10^-12 and we know nothing else happened, so I have witnessed a miracle.Right. But this restriction represents our knowledge of the actual test results. Suppose we know that somebody scored 5 of 7 on 2 separate tests. Then using 10 of 14 includes other outcomes (like 4 and 6, 3 and 7) that we know did not happen.
Right. But this restriction represents our knowledge of the actual test results. Suppose we know that somebody scored 5 of 7 on 2 separate tests. Then using 10 of 14 includes other outcomes (like 4 and 6, 3 and 7) that we know did not happen.
There are 3 different tests, each independent and having 23% chance to pass by guessing. If you pass all three, the probability you're guessing should be .23 * .23 * .23 = 1.2%.
Wrong user.@MZKM actually, I think I've worked out why we're at cross purposes.
You wrote:
Note the words I've highlighted in bold. The probability that you passed each of the three tests by guessing is not the same thing (qualitatively or quantitatively) as the probability that you got 15 out of 21 in total and were guessing.
Hopefully this Venn diagram explains the subtle differences fairly well:
View attachment 94158
EDIT: realise the shape and scale are a bit weird, but was working it out in MS paint as I went along, lol...
Wrong user.
Yep, of course it's not. They're clearly 2 different things. One is more specific than the other, and thus lower probability / higher confidence.@MRC01 actually, I think I've worked out why we're at cross purposes. ... ...
It can also make you sick.No, not really.
What it can do as ameliorate effects of the illness that have a large subjective component, i.e., 'feelings' -- pain, stress, fatigue, nausea.
Yep, of course it's not. They're clearly 2 different things. One is more specific than the other, and thus lower probability / higher confidence.
The question I'm struggling with is: which of these 2 approaches represents the correct probabilities?
I can think of rational explanations for each, yet I can also think of conundrums that each leads to.
Alright I think we are in agreement now. But you still have questions.Yep, of course it's not. They're clearly 2 different things. One is more specific than the other, and thus lower probability / higher confidence.
The question I'm struggling with is: which of these 2 approaches represents the correct probabilities?
I can think of rational explanations for each, yet I can also think of conundrums that each leads to.