Hi Anita,
Glad to see that you're exploring some steps towards tightening up your protocol.
You've experimented with covering the cups with paper, this may have failed. I'm quite concerned about that. You've suggested that this may be because paper blocks some types of low energy radiation. Indeed it does. Light is one of the types of radiation that is severely attenuated by a sheet of paper. Although you've positioned yourself so that you can't see the inside rim of the cup this doesn't completely eliminate the possibility that you're picking up the cereal's influence on ambient light. If the cups are on top of a grill then I assume that they're close to a wall. That would make ambient light a concern.
You've suggested that your difficulties in this run might equally have been because of the larger number of potential targets, that would seem to be supported your feedback with procedure 2.
I also note that this first trial was before you hit upon the idea of wetting the active cereal. With regard to wetting the active cereal it might be best to wet all cereals to ensure that it is the bacillus that you're detecting rather than the moisture. That said, skeptics have a time honoured tradition of testing people who think they can detect water under unusual circumstances so that might equally be a paranormal claim. Wet it if that works for you, if it doesn't then you've got a "water dowsing" protocol, and couple of pillocks who think that invalidates your test. Their opinion doens't matter if this works out for you then you should be able to apply for the JREF on the basis of being able to tell wet lactobacillus from empty cups, cups with dry cereal, or cups with wet cereal, so long as there's no outward way of telling you're good for the million. However that's just my opinion which doesn't matter either the JREF would have the final say.
However back to that paper. I do think that a propperly controlled challenge should involve covers. It'd be strange if paper should have such an effect as you first reported noticing your sensitivity to this bacillus when it was obscured within a friends' stomach and later through the cardboard of a cereal box. Given your other attempts to improve your sensitivity, i.e. reducing to three cups and wetting the cereal it might be worth revisiting this issue and seeing if you can try again with the paper in place when it's just three cups and the target cereal is moist. If not then perhaps the cup can be elevated in a location where ambient light would be less of a concern – i.e not near a wall.
Let's look at the suggestion that more than three cups may have been overwhelming your ability. This would certainly be the case if your ability was predominantly based upon receiving feedback and working out how your assistant might place the cups next time.
Let me demonstrate.
Have a look at these replications of your initial test.
Experiment 1
Trial|1|2|3|4|5|6|7|8|9|10|11|12|13|14|15|16|17|18
Target|B|A|C|A|B|A|C|A|B|A|C|A|B|C|B|A|B|C
Prediction|A|A|B|B|B|A|B|A|B|C|C|A|B|A|B|A|C|C
Hit|0|1|0|0|1|1|0|1|1|0|1|1|1|0|1|1|0|1
Total Hits|11|
Percentage|0.611111|
Signifcance|0.014434|1 in 69.2826776893141
Trend|0.054528|
Experiment 2
Trial|1|2|3|4|5|6|7|8|9|10|11|12|13|14|15|16|17|18
Target|B|A|B|C|A|C|B|A|C|B|C|A|C|B|C|A|C|B
Prediction|B|A|B|A|B|C|B|C|C|B|C|A|B|B|A|A|B|B
Hit|1|1|1|0|0|1|1|0|1|1|1|1|0|1|0|1|0|1
Total Hits|12|
Percentage|0.666667|
Signifcance|0.003919|1 in 255.148967768236
Trend|0.051683|
Experiment 3
Trial|1|2|3|4|5|6|7|8|9|10|11|12|13|14|15|16|17|18
Target|C|B|A|B|C|A|B|C|B|A|C|B|C|A|B|A|C|A
Prediction|A|B|A|B|C|B|B|C|B|A|B|A|A|A|B|C|C|A
Hit|0|1|1|1|1|0|1|1|1|1|0|0|0|1|1|0|1|1
Total Hits|12|
Percentage|0.666667|
Signifcance|0.003919|1 in 255.148967768236
Trend|0.053106|
Experiment 4
Trial|1|2|3|4|5|6|7|8|9|10|11|12|13|14|15|16|17|18
Target|A|C|A|B|C|A|B|C|A|B|C|A|B|A|C|A|B|C
Prediction|B|B|A|B|C|B|C|A|A|B|A|A|B|C|B|B|B|A
Hit|0|0|1|1|1|0|0|0|1|1|0|1|1|0|0|0|1|0
Total Hits|8|
Percentage|0.444444|
Signifcance|0.22326|1 in 4.4790794408513
Trend|0.034614|
Experiment 5
Trial|1|2|3|4|5|6|7|8|9|10|11|12|13|14|15|16|17|18
Target|B|C|B|A|C|A|B|C|B|A|C|B|C|A|B|A|C|B
Prediction|A|C|A|A|C|A|B|A|A|A|C|A|A|A|C|C|C|B
Hit|0|1|0|1|1|1|1|0|0|1|1|0|0|1|0|0|1|1
Total Hits|10|
Percentage|0.555556|
Signifcance|0.043348|1 in 23.0690798442699
Trend|0.044571|
Experiment 6
Trial|1|2|3|4|5|6|7|8|9|10|11|12|13|14|15|16|17|18
Target|B|C|B|A|C|B|C|A|B|C|A|C|B|C|A|C|B|C
Prediction|C|A|B|A|B|A|A|A|C|A|A|B|B|A|A|C|B|C
Hit|0|0|1|1|0|0|0|1|0|0|1|0|1|0|1|1|1|1
Total Hits|9|
Percentage|0.5|
Signifcance|0.107602|1 in 9.29347421757414
Trend|0.049787|
Experiment 7
Trial|1|2|3|4|5|6|7|8|9|10|11|12|13|14|15|16|17|18
Target|B|A|C|B|C|A|C|B|C|A|C|B|A|C|A|B|C|A
Prediction|B|C|C|B|C|A|B|B|A|A|C|B|A|C|A|B|A|B
Hit|1|0|1|1|1|1|0|1|0|1|1|1|1|1|1|1|0|0
Total Hits|13|
Percentage|0.722222|
Signifcance|0.000853|1 in 1172.88901435911
Trend|0.055951|
Experiment 8
Trial|1|2|3|4|5|6|7|8|9|10|11|12|13|14|15|16|17|18
Target|C|A|B|A|C|B|A|B|C|A|B|C|A|B|C|B|A|B
Prediction|C|B|B|C|B|A|A|B|A|B|B|C|B|B|C|A|A|C
Hit|1|0|1|0|0|0|1|1|0|0|1|1|0|1|1|0|1|0
Total Hits|9|
Percentage|0.5|
Signifcance|0.107602|1 in 9.29347421757414
Trend|0.041726|
Experiment 9
Trial|1|2|3|4|5|6|7|8|9|10|11|12|13|14|15|16|17|18
Target|B|A|C|A|B|C|B|A|C|B|C|A|B|A|C|A|B|A
Prediction|B|C|C|A|B|A|B|A|B|B|A|A|B|A|B|B|B|A
Hit|1|0|1|1|1|0|1|1|0|1|0|1|1|1|0|0|1|1
Total Hits|12|
Percentage|0.666667|
Signifcance|0.003919|1 in 255.148967768236
Trend|0.053106|
Experiment 10
Trial|1|2|3|4|5|6|7|8|9|10|11|12|13|14|15|16|17|18
Target|C|A|B|C|B|A|B|C|B|A|B|C|B|A|C|B|A|B
Prediction|B|A|B|A|B|A|C|A|A|A|B|C|B|A|B|B|A|B
Hit|0|1|1|0|1|1|0|0|0|1|1|1|1|1|0|1|1|1
Total Hits|12|
Percentage|0.666667|
Signifcance|0.003919|1 in 255.148967768236
Trend|0.060218|
Now you may have already noticed these random sequences aren't very random at all. On the face of it they appear to jump about all over the place as random as a random thing and there's not long streches where a value doesn't appear. In fact those are the very factors that make the sequences deceptively structured rather than random at all.
These sequence are extreme examples of common human biases. Joe Soap approaches the roulette tables in his local casino, with the intention of betting on the 2nd Dozen (13 to 24 inclusive) When he sees that on the table to his left the last spin was 22 and on the table to his right none of the numbers from 13 to 24 have come up in the last three spins then of course he places his bet on the table to the right. His bet has just come up on the left hand table so what are the odds of it coming up twice in a row? On the table to the left it's due to come up. In fact his bet is equally likely to come up on either table as the roulette wheel is unaware of the previous spins and makes no attempt to even itself out.
In the sequences above are generated in an attempt to artificially even themselves out. The odds of a repeat should be one in three. In fact in the sequences above there are deliberately no repeats. The predictor knows that repeats are unlikely and so narrows the field of possible guesses from 3 to 2.
Under those circumstances we expect a hit rate of 8 and 5/6 out of 18. The signifcance of getting at least 13 out of 18 drops from around 1 in 1000 to just 1 in 25
This is rule 1 - the application of a rule imposes structure.
But there's another rule. We also have the idea that a value might be "due." If that value hasn't come up in the last three, it's "due" to come up now. The sequences above follow that logic, the predictor follows that logic too. On the fourth trial there's a 50-50 chance that one of the values won't have come up yet.
This is rule 2 it imposes even more structure on our supposedly random sequence. From my trials it appears to raise the expected level of success to around 12 out of 18. Getting 13 or better is distinctly commonplace.
Following these rules is clearly a distinct advantage with 3 options.
Incidently The one thing these simulations don't replicate is any downward bias. They give no advantage at all to the first prediction and the advantage from the second rule doesn't kick in until trial 4. This introduces an upward bias to the trend where in 1000 runs I only got 36 downward trends of greater magnitude to that in your initial test. That make your downward trend significant and in potential need of explanations when before - assuming a constant accuracy rate - no such explanation was necessary. Many such explanations are possible, wondering if you benefitted from warm ups before the test proper to ranging from simply accepting the effects of fatigue. Perhaps your subconcious is triggering fatigue when it knows you're not doing so well and so selction bias means we're more likely to end on a failure.
Back to the advantage of feedback and structured sequences.
With 4 options this advantage is significantly reduced.
In my 3 option scenario the most common result was 12 out 18 such a result been a fair test (with a genuinely random sequence and no feedback) would have had odds of one in 69.
In my 4 option scenario the most common result was 7 out of 18 such a result been a fair test (with a genuinely random sequence and no feedback) would have had odds of one in 7.
Obviously feedback and the two rules being in effect, still give an advantage that means you're expected to beat the odds in the four option scenario but by an amount that is no longer anywhere near as significant.
Obviously my dabblings in excel are a mere caricature of human behaviour, where I've placed hard and fast rules actual human behaviour is influenced by more subtle biases. However the advantage that this gives to the human acting as a random number generator, in being slightly less predictable, may be negated by the fact that the predictor is human too and has access to all the range of subtlety present in the human brain. Your subconscious knows how it might react to a request to eliminate the biases of rule1 and can project that point of view onto another person, especially if you know your assistant quite well.
It's a shame that you don't have the sequence used in your initial test so that we could see if there were some sort of structure that might have been apparent.
It should have been our last chance as there's no really good reason for you to be continuing what is essentially a game of rock – paper – scissors. However the fact that you've given in to the temptation for immediate feedback and somehow found it difficult to obtain a proper randomizer can be construed as circumstantial evidence that your subconscious knows that this is how it's playing its games and is resisting changes to a favourable set-up. As such it would be hardly surprising that you've not presented the sequence to be investigated for any hidden structure.
Given that you've effectively told your assistant to eliminate the bias associated with what I called rule 1 it would be interesting to see if he'd over compensated. Earlier I said that rule 2 kicks in 50% of the time, that's only true with rule 1 in effect. Without rule 1 in effect this changes. 2/3 of the times one number is "due", 1/9 of the time two numbers are "due" so we'd have had opportunity to see if this bias was picking up the slack in providing you with information.
If you're really having difficulty obtaining a die may I suggest drawing lots, making a cardboard spinner or using the computer that you post from to generate a sequence of pseudo random numbers.
http://www.penpaperpixel.org/tools/d20dicebag.htm
If your assistant won't have access to the web at the time of the test they can pre-generate a sequence of random numbers and take pains to ensure that you don't see it.
Asking your assistant to eliminate their bias just won't work. Whatever bias is being eliminated will be replaced by another. Its a bit like asking someone not to think of an elephant. The fact is if it's not generated by a proper randomizer thent he sequence is structured.
Any further test with immediate feedback will merely help demonstrate that it's an integral part of your ability. Only by eliminating this feedback can you demonstrate that your ability is based upon the present location of the target rather than an ability to extrapolate from its previous locations. As such you do need to be recording your predictions and your assistant needs to be recording the target locations. I'm sure all of us would welcome this being data being included in your reports.
I appreciate your efforts in shielding the cups of cereal within second cups. It greatly reduces opacity concerns and goes someway towards addressing the possibility that wear and tear starts to indentify the cups. However the rims of the cups will still be visible, removing and replacing the cups by those rims will surely increase wear and tear.
Perhaps the cups could be placed inside a shoebox? Do you think that would work with your ability?
I glad that you've taken some steps towards eliminating some of our concerns but it is a slight issue that after trying with covers you decided to go without and that you may have replaced one target for ambient light, the inside of the cup, with another the wall behind the grill. Also after trying with a larger number of potential targets you've now reduced back to three. It's of far greater concern to me that you've resisted using random numbers, are still getting immediate feedback and aren't publishing the actual sequences of target locations and predictions.
In fact in your last test, the only concern you've effectively eliminated is the one about the opacity of the cups.
Working from the hypothesis that your successes are related to structured sequences and feedback it would perhaps be interesting to see these eliminated one at a time and find out at what point your ability fails. However surely it'd be far more interesting if you could prove that your successes had nothing to do with these factors and simply eliminate feedback until the end, use a proper randomizer and allow the randomness of your sequence to be analysed.
Also please please please, strictly predifine the test conditions and the number of trials then stick to that even if you feel things aren't going well. By all means limit yourself to ten attempts in a session to avoid the fatigue and nausea you experience but specify this in advance. Otherwsie selection bias will invalidate your test.
Even without these other concerns it could look as if all you were doing was tinkering with the protocol each time you failed waiting for a run of luck then quitting one that lucky streak had expired.
Anyway thanks for looking into this for us it is greatly appreciated.
FYI if test procedure 5 had been a fair test, getting at least 9 out of 12 correct with three options to choose from has odds of 1 in 259.
trial|hit|running totals|significance|odds
1|1|1|0.333333|1 in 3
2|1|2|0.111111|1 in 9
3|1|3|0.037037|1 in 27
4|0|3|0.111111|1 in 9
5|1|4|0.045267|1 in 22.0909090909091
6|1|5|0.017833|1 in 56.076923076923
7|1|6|0.006859|1 in 145.8
8|1|7|0.002591|1 in 385.941176470588
9|1|8|0.000965|1 in 1035.94736842105
10|1|9|0.000356|1 in 2811.85714285714
11|0|9|0.001372|1 in 728.999999999999
12|0|9|0.003856|1 in 259.366032210834
You should have stopped at 10, and that should have been prespecified. You should have used a randomizer. You shouldn't have had feedback. You should have had covers on the cups. Then we'd have had something really special. As it is all we've proved is that this didn't happen by chance, something we pretty much already knew.