If you only test 3 devices but over a range of conditions...how do you justify that the results from the 3 devices will generalize to all devices produced? i.e. manufactured devices will meet there specification over the conditions with a certain confidence and reliability?
Thanks
The short answer is conditional probability.
But there a few things to clarify. First when I say 3 units, it's 3 units at a min or max of the designed tolerances run or used at the worst case use conditions. So there may actually be more than simply 3 units depending on the complexity of the thing and the nature of the use.
I always think about this in terms of Development, OQ and PQ validation. In development we should be screening experiments, full factorials and response surface experiments to determine the critical (input) characteristics and what tolerances are required on those critical characteristics to guarantee that the output meets the requirements. If we utilize physics and good industrial experimentation we can design product that will have no or very few defects. OQ is used to confirm (or validate) that at the worst case tolerance levels and under worst case use conditions we have no or very few defects/failures. If you can meet requirements at the worst case conditions the nominal will certainly meet requirements. This is just physics. of course if you have defects at the worst case conditions you can use conditional probability to determine the actual failure rate. PQ is typically run at nominal. this is where you can and should use the standard sample size calculations as you can are creating things at manufacturing volumes.
I have found that the use of sample size calculations is needed when we haven't done a good job of design. The downside of these sample sizes is that they REQUIRE a uniformly distributed population from which to sample such that the sample is representative of the population. Of course if we haven't done a good job in development our knowledge of the physics is meager and our knowledge of the nature of the distribution of the population is also meager.
All of that said I know that many regulatory groups cling to the old fashioned random statistics and there may be little one can do to change their minds.