And that's the problem. Too many of the psi proponents do not accept that PEAR's protocols are valid (although they would accept the validity of a real effect, if shown). And too many psi opponents also do not accept that PEAR's protocols are valid (although they would accept the validity of a negative effect, if shown).
Agreed. Which is why I never conceived that even a meticulously done experiment would be accepted by everybody.
If you take stuff like the sheep/goat effect seriously, then you can't just handwave it away and pretend that it won't change anything.
I understand. It depends on how strong somebody considers the sheep/goat effect. For example, can a goat somehow reach back though time after the results were published and change all copies of the encrypted recorded data, rewriting the historic record retroactively? If one believes that, then no scientific experimentation of any sort is possible. Let's but that level of sheep/goat effect outside the target testing envelope.
(If if the sheep/goat effect is that strong, I think we'd be living in a rather more magical world, but that's another matter. Why would goats be vastly more powerful than sheep? Unless there was some asymmetry, like "goats are aided by the resilience of an objective universe, while sheep are having to painfully and partially override that objective universe").
So here's what we could offer in this thought experiment: believers can have as many believers as they can round up in the next room, sheeping like crazy while the experiments are going on, with no goats nearby (except some but not all of the subjects one at a time). The schedule of the experiment will not be disclosed to goats before publicatio, so even if a goat could affect the 1's and 0's from Africa, they don't know which way to root for until too late.
I'm quite aware that this wouldn't stop objections from true believers. But it *would* define the limitations. A negative result would be "Even in the presence of many sheep who know what results are being tested for, and without goats having advance knowledge of the timing, we found no effect above 1 bit in N". Those disputing the experiment would be left with either "the effect is less than 1 bit in N", or "the goats have so much power that they could overcome even that". It sets an envelope, pushes the remaining believers to stipulate an especially powerful goat effect as the "minimum" threshold.
And, from a physics standpoint, I'd need to be convinced that any physical white-noise generator can be calibrated to be balanced and stable to one part in 10,000 over an extended period.
Suppose it doesn't have to. Why would you use only one contiguous control period, eg: before all the tests for your baseline? Calibrate your baseline before, after, and interspersed with subject testing. If the interspersed "control" periods are statistically like each other but the "test" periods differ, you have a result. Even if there was a small systematic or random drift.
(By the way, this is similar to what I do to compensate for a drifting clock in data aquisition. I can't get clocks within my power and financial budget which do not drift over the years, but I can record time deltas every few weeks and interpolate to well within my accuracy needs).
Suppose the generator slowly drifted from 49.99% 1's to 50.02% ones over the course of the weeks of the experiment. At the early part of the experiment (when the control periods were giving 49.99% 1's before and after subject testing periods) the subjects could tweak that upward or downward by 1 in 10000. In the later end of the experiment, the subjects were tweaking the results up or down by 1 in 10000 from the adjacent "control" periods before and after. I'm just making up numbers, but the point is that one can scientifically and statistically show a or discredit correlation between cloud cover and sunspot count, without either being a perfect random event.
The better the randomness, the easier it becomes, but perfect randomness isn't required (and is hard to even define), just correlation. A better generator just means fewer trials. No, I'm not assuming linear drift - all the interspersed control periods would detect nonlinear drifting as well.
The only reason to do a lengthy calibration period beforehand would be if one wanted to estimate in advance how many trials would be needed to overcome the instability of the random generator - but this "pre experiment" would not be used in the primary analysis of the results - the time interspersed control periods would be.
Zeph