# Shewhart Constants vs Central Limit Theorem in calculating Control Limits

I

#### Inspector-71

I am confused by the use of shewart constants instead of +/- 3 standard deviations in calculating control limits

Say for an xbar-r chart I have my data set and will need to calculate the control limits via the relevant D3 and D4 constants. I believe I have to use these instead of +/- 3 standard deviations as the constants allow for the effects of sample sizes and different distribution types (if anyone could expand on that, it would be appreciated). +/-3 standard devitions for control limits could only be used if the distribution was normal?

However, in CLT if I am sampling any consistent distribution shape with say a sample of 5 parts per hour, then the mean of these samples when plotted should produce a normal distribution once there are a decent amount of means to plot. Since the distribution is normal, the empirical rule can be applied and 99.7% of data should be within +/- 3 standard deviations.

so why do xbar-r charts not use +/-3 standard deviations?

To add further confusion, when calculating Pp/Ppk we do use +/- 3 standard deviations when and compare this to the specification limits. Why use +/- standard deviations here and not in the control charts?

Thanks for any help.

#### reynald

##### Quite Involved in Discussions
Re: shewart constants vs central limit theorem

so why do xbar-r charts not use +/-3 standard deviations?
---Actually it does use +/-3 standard deviations, but the standard deviations are estimated from the range-bar. To gest the estimated value of this standard deviation the range is multipled by a certain constant which depends on the sample size. The factor +/-3 is already incorporated in that constant multiplier.

To add further confusion, when calculating Pp/Ppk we do use +/- 3 standard deviations when and compare this to the specification limits. Why use +/- standard deviations here and not in the control charts?
--Ppk assumes that you use the overall data and not in its subgrouped form. You use the same contants as in the control charts when computing for the Cp/Cpk.

I

#### Inspector-71

Re: shewart constants vs central limit theorem

Thankyou. I had talked myself into that direction but it's great to have it confirmed.

#### Steve Prevette

##### Deming Disciple
Staff member
Super Moderator
A lot of the basis for the use of range calculations is that one cannot calculate the standard deviation of a sample very easily with an adding machine and a slide rule. Thus, the reliance (prior to computers) on the use of range to estimate standard deviation.

Today, most authors (especially Dr. Wheeler) are still proponents of the range versus simply typing in "STDEV" in Excel spreadsheet. One thing that does happen is that if your data has an outlier, the STDEV will "blow up" to a larger quantity (due to the squared difference in the formula) rather than the range. Moving range also takes into account the sequence of the data, while the standard deviation calculation does not.

I believe an argument can be made for using the standard deviation estimator (even Shewhart documents it is more statistically "powerfull" than the moving range), but it has not been accepted.

I've done a lot of comparisons, and the moving range estimate usually comes out quite close to the sigma standard deviation calculation anyways.

C

#### Curtis317

"I've done a lot of comparisons, and the moving range estimate usually comes out quite close to the sigma standard deviation calculation anyways."

The estimate only comes close to the sigma standard deviation if the data is normal. The futher it diverges from normal the bigger the difference will be.

#### Bev D

##### Heretical Statistician
Staff member
Super Moderator
I'm sorry but this is a common misperception. the standard deviation is not based on Normality. and estimates of the total standard deviation from the within subgroup variation is not based on Normality either. It is based on the homogeneity of the process stream. IF the I, MR chart is sampled in time order and the process stream is homogenous and the movign range is calculated on the time ordered data the moving range estimate will be very close to the total standard deviation calculated from all of the individual values...close enough for SPC. Remember that control charts are not precise statistical estimators...

If the process stream is NOT homogenous the within subgroup variation will NOT provide an accurate estimate of the overall standard deviation or the variation of the subgroup averages. This is exactly why Shewhart used the within subgroup variation to estimate the between subgroup variation. If it does, the process is in statistical control; if it doesn't it is out of statistical control. (this part is a bit more complicated and involves rational subgrouping and understanding that non-homogenous processes can be stabel and predictable...)

C

#### Curtis317

I'm sorry but this is a common misperception. the standard deviation is not based on Normality. and estimates of the total standard deviation from the within subgroup variation is not based on Normality either. It is based on the homogeneity of the process stream. IF the I, MR chart is sampled in time order and the process stream is homogenous and the movign range is calculated on the time ordered data the moving range estimate will be very close to the total standard deviation calculated from all of the individual values...close enough for SPC. Remember that control charts are not precise statistical estimators...

If the process stream is NOT homogenous the within subgroup variation will NOT provide an accurate estimate of the overall standard deviation or the variation of the subgroup averages. This is exactly why Shewhart used the within subgroup variation to estimate the between subgroup variation. If it does, the process is in statistical control; if it doesn't it is out of statistical control. (this part is a bit more complicated and involves rational subgrouping and understanding that non-homogenous processes can be stabel and predictable...)
If you calculate the Ppk and Cpk for the same set of data they can be much different. The closer they are to each other the more "Normal" the data will be. I have no issue with your comments about the control charts. The standard deviation is just a statistic generated from the data and "normality" has nothing to do with the number.

#### Steve Prevette

##### Deming Disciple
Staff member
Super Moderator
The estimate only comes close to the sigma standard deviation if the data is normal. The futher it diverges from normal the bigger the difference will be.
The standard deviation is the standard deviation! The statistical definition of the standard deviation of a set of data is sum(Xi - Xbar)^2 / N.

For a sample, N-1 is used in the denominator in order to give an unbiased estimator of the population.

Shewhart even states in Economic Control of Quality of Manufactured Product (page 289) "It appears, therefore, that there is good reason to choose the standard deviations sigma of the sample as the basis for estimate of the standard deviation sigma of the universe to detect a change delta sigma.

#### Bev D

##### Heretical Statistician
Staff member
Super Moderator
If you calculate the Ppk and Cpk for the same set of data they can be much different. The closer they are to each other the more "Normal" the data will be. I have no issue with your comments about the control charts. The standard deviation is just a statistic generated from the data and "normality" has nothing to do with the number.
actually again Normality still has nothing to do with the situation you describe.
the difference between Cpk and Ppk (in the traditional formulas) is where the SD comes from and the centering of the process within the spec limits. teh difference between Cpk and Ppk is the exactly like control charts. Cpk uses within subgroup variation - IF the process is homogenous the within subgroup variation will provide an accurate calculation of the total variation because the between sample variation is just sample error; in oether words teh process steam is homogenous.

The accuracy of the Cpk or Ppk index to the actual spread of real vaules vs the sepc limits and the resutling defect rate IS dependent on the normality of the process. but within any given process the closeness of the Cpk and Ppk index to each other is due to the homogeneity of the process and the centering.

#### Steve Prevette

##### Deming Disciple
Staff member
Super Moderator
I would just suggest that Cpk and Ppk are off-topic for the original question.

Personally, I am no fan of either number. If I really want a good estimate for percent defective, then I would say I need to know the distribution of the source data.

S used excel formula calculated the ARL's with Rule1&Rule2 for Shewhart control chart Statistical Analysis Tools, Techniques and SPC 0
M Shewhart Control Chart for SPC In Excel - Explain the yellow and red marked column Statistical Analysis Tools, Techniques and SPC 2
G Why for a Shewhart chart the minimum feasible value for APL is 2401 units? Statistical Analysis Tools, Techniques and SPC 8
D Are Four Sigma Control Limits based on Shewhart's work Statistical Analysis Tools, Techniques and SPC 3
Shewhart Control Chart - ISO/TS 13530:2009(E), Page 22. General Measurement Device and Calibration Topics 3
U Shewhart, Deming and Data - a thought provoking article Statistical Analysis Tools, Techniques and SPC 3
Deming and Shewhart information for Power Point slides related to P-D-C-A Philosophy, Gurus, Innovation and Evolution 8
How to calculate the A4 constants in Median/Range control charts? Statistical Analysis Tools, Techniques and SPC 2
R Control Chart Constants and Confidence Interval Statistical Analysis Tools, Techniques and SPC 7
S Help with Table of Constants and Formulas for Control Charts Statistical Analysis Tools, Techniques and SPC 5
S Are the Values for Constants determined by Sample Size or Sub-Group Size Statistical Analysis Tools, Techniques and SPC 14
J Gage R&R studies with 5 appraisers and K constants Gage R&R (GR&R) and MSA (Measurement Systems Analysis) 6
V MSA - Which edition constants are these? What is definition of Tolerance? Gage R&R (GR&R) and MSA (Measurement Systems Analysis) 4
A Name for the Control Chart Constants (A2, A3, d2, D3 etc) Statistical Analysis Tools, Techniques and SPC 8
R How to choose constants when sample size larger than 25? Statistical Analysis Tools, Techniques and SPC 7
E Formulas and Constants - Control Limits for Individual and Moving Range Control Chart Statistical Analysis Tools, Techniques and SPC 2
D Tolerance Limit Constants - Calculating tolerance limits using JMP Statistical Analysis Tools, Techniques and SPC 1
V Gage R&R constants - Appraisers vs K2 for 1 to 5 Appriasers - Seeking table in a file Gage R&R (GR&R) and MSA (Measurement Systems Analysis) 8
B Formulas for calculating control chart constants Statistical Analysis Tools, Techniques and SPC 9
R Sources for chemical constants Misc. Quality Assurance and Business Systems Related Topics 1
M MSA - d4 values above 3 trials - statistical constants tables Gage R&R (GR&R) and MSA (Measurement Systems Analysis) 2
Y AIAG's MSA (Measurement Systems Analysis) 3rd Edition - K1, K2, K3 constants Gage R&R (GR&R) and MSA (Measurement Systems Analysis) 6
I Partcertall v6.15 & R&R worksheet with the correct constants for the 3rd edition APQP and PPAP 26
Y Table of Constants for Control Charts Statistical Analysis Tools, Techniques and SPC 6
N More Central Limit Theorem Questions Six Sigma 4
K Half-Fractional vs. Full Factorial Central Composite Design Using Minitab Software 6
Temperature Controlled Biopharmaceutical Logistics in Central America [infographic] Pharmaceuticals (21 CFR Part 210, 21 CFR Part 211 and related Regulations) 3
Locating Medical Device Regulations in Central America Other Medical Device Regulations World-Wide 4
C X-Ray Producing Product Regulations in Mexico, Central America, and South America Other Medical Device Regulations World-Wide 2
A Central/South America Regulatory Approval Other Medical Device Regulations World-Wide 8
SPC for Precision Machining Presentation March 25, 2010 ASQ Central Kentucky Section ASQ - American Society for Quality 1
ISO 13485 in Hospital Central Sterilization and Clinical Engineering Services ISO 13485:2016 - Medical Device Quality Management Systems 27
S Distributions - You can't approximate everything with central limit theorem? Six Sigma 12
Time Warner Cable may Lose Viacom: MTV, Nickelodeon, Comedy Central MAY Go Dark After Work and Weekend Discussion Topics 3
E Using Median for Central Location - Analysis of Nonparametric Data Quality Tools, Improvement and Analysis 7
Central European Automotive Manufacturers and TS 16949 IATF 16949 - Automotive Quality Systems Standard 3
D T tests - To test for central measure (means) - Steel supplier Statistical Analysis Tools, Techniques and SPC 8