Process capability in a single batch

Eliud Kipchoge

Registered
Hello,

Excuse me for asking, but I have an issue that is driving me crazy. I have a single batch, with 110k datapoints on it. These datapoints are automatically recorded and have both good and bad products. Bad products get discarded inmediately afterwards, and therefore, do not continue in the process.

We usually calculate process capability parameters for each batch, filtering only for the good products, using the reasoning that a bad product is 'only' a yield problem. Is this approach correct?

To calculate the capability, we use Cp and Cpk. It is only one batch, but with over 100k datapoints (around 9k are gone after removing the bad products). I understand that Cp and Cpk both use sigma (in other words, population standard deviation, or the one with N in the denominator). I also know that Pp and Ppk (which we usually do not calculate) use s (i.e, sample standard deviation, the one with N-1 in the denominator).

So far, for each batch, we have always used Cp and Cpk. Therefore, we calculated the parameters with sigma (also known as OVERALL on Minitab software). Is it acceptable to use Cp and Cpk in our case (for single batch, but with a lot of datapoints) or should we use Pp and Ppk for each individually batch (and Cp/cpk when?). If Pp and Ppk, what would be my subgroup size?

There is a final twist (that is the bane of my existance...). The machine prints out a report for the batch. This report states both specification limits, mean of the datapoints, standard deviation, RSD% (relative standard deviation %) and Cp and Cpk (named exactly like this, Cp and Cpk, not Pp(k)). Specs limits and mean are ok. On "standard deviation" they calculate sigma (population), which I agree with (let's suppose the value is 4).

But for RSD% calculation, they do not use 4 (population std dev), they use 3.26, which is the value Minitab calculates for sample standard deviation (taking subgroup size as 1, which I do not understand...). And for Cp and Cpk they also use again the sample standard deviation. If Pp and Ppk should be calculated for this, then the report should show s and not sigma. But if Cp and Cpk should be calculated, then the parameters are plain wrong! Which situation is it?

Sorry for the sea of doubts. Any help or insights would be highly appreciated. Thanks a lot!!
 

Bev D

Heretical Statistician
Leader
Super Moderator
There are so many incorrect things in this post that i don’t know where to begin. No, actually I do. You are doing the right thing by coming here to ask questions. Let me start with the most important things:
1. Process capability indices are all but useless. They should be avoided at all costs. They may be used by a lot of people but so is THIS IS SPAM - PLEASE REPORT THIS POST.
2. It is completely stupid to calculate a process capability number for a single batch. Capability indices are intended for whole processes where 100% data is not available.
3. You NEVER exclude the out of spec parts in the calculation. If you do you are only reporting the distribution of the shipped batch of material which is not the capability of the process at all…
4. The concern with which formula to use is misplaced. With your sample size there is no real difference.
5. In general Cpk is intended to use the within subgroup sample standard deviation which should have n-1 in the denominator. Ppk is the overall capability index which uses all of the data in the (global) SD. Sometimes software - and a misguided automotive approach for pre-production calculations reversed these indices. In the very beginning there was only Cpk where the global SD was used BUT there was no prediction of yield or translation to defect rate or reliability of a Normal distribution.
6. I will let Miner or someone else untangle the math in the end of your post but it is irrelevant in the end because of 1, 2 3, 4 and 5 above.

For references and a longer explanation see my article on “statistical Alchemy“ in the Resources section. There is simply no way to have a valuable discussion regarding this topic in a simple thread…
 

Eliud Kipchoge

Registered
First and foremost, thank you for your speedy reply.

1. Process capability indices are all but useless. They should be avoided at all costs. They may be used by a lot of people but so is THIS IS SPAM - PLEASE REPORT THIS POST.

I do not like them due to the fact that the number itself does not allow me to improve my process, but due to my current job situation I'm forced to calculate and report them. This is why I'm trying to learn more.

2. It is completely stupid to calculate a process capability number for a single batch. Capability indices are intended for whole processes where 100% data is not available.

Please allow me to rephrase. I calculate process capability for a single operation (in this case, filling) but not for the whole process itself (i.e., not combining mixing + filling steps). Only that operation. Is it bad to use process capability in this case?

3. You NEVER exclude the out of spec parts in the calculation. If you do you are only reporting the distribution of the shipped batch of material which is not the capability of the process at all…

In the original 110k datapoints, I have several that are machine errors during set-up (values that are 999999 artificially). These I remove because they are noise and fake data.

There are others, not many, that happen to be 0 or -1. Those are really not filled (scale before and after the filler measures the same). These get automatically rejected and I also reject those. If I calculate any process parameter with those values, I get literally 0.05 and -0.05 values for my process. Also, mean is way lower than it should be.

Lastly, there are datapoints that have the correct amount of filled liquid, but are missing stoppers, which is a "subproduct" of another operation (stoppers loading). I also discarded this, because it is not related to the filling process itself. This point I think can be controversial or might need further explanation. Do you think that, with the out of spec parts, should stay in the data?

4. The concern with which formula to use is misplaced. With your sample size there is no real difference.

I feel like this is true indeed. Also sample size is why I don't bother with normality (data is not normal, which worries me much more than anything else..)

5. In general Cpk is intended to use the within subgroup sample standard deviation which should have n-1 in the denominator. Ppk is the overall capability index which uses all of the data in the (global) SD. Sometimes software - and a misguided automotive approach for pre-production calculations reversed these indices. In the very beginning there was only Cpk where the global SD was used BUT there was no prediction of yield or translation to defect rate or reliability of a Normal distribution.

I must have been missguided then. I thought Cp(k) used population SD, not the other way around. Perhaps I read information about the beginnings of the parameters... In this line, is using MR/d2 a better alternative than using sample/global SD?

For references and a longer explanation see my article on “statistical Alchemy“ in the Resources section. There is simply no way to have a valuable discussion regarding this topic in a simple thread…

This one, right?
elsmar.com/elsmarqualityforum/resources/statistical-alchemy.70/

Will look at it thoroughly. Again thank you for your knowledge!!
 

Bev D

Heretical Statistician
Leader
Super Moderator
I will only comment on the Normality statement. Normal distributions are not normal…in a filing process I would expect it to be a truncated distribution.
 
Thread starter Similar threads Forum Replies Date
T New process capability analysis required when changing a product on the same line? IATF 16949 - Automotive Quality Systems Standard 6
J Process capability analysis in ISO 9001:2015 ISO 9000, ISO 9001, and ISO 9004 Quality Management Systems Standards 3
J Process Capability - Determining the process capability of certain equipment Lean in Manufacturing and Service Industries 6
B Two excellent examples of process capability analysis from Quality Magazine Capability, Accuracy and Stability - Processes, Machines, etc. 5
M Measuring Capability of Process with Multiple Specifications Capability, Accuracy and Stability - Processes, Machines, etc. 6
L How to evaluate the process capability of a data set that is non-normal (cannot be transformed and does not fit any known distribution)? Capability, Accuracy and Stability - Processes, Machines, etc. 14
T Final process capability results - What I am supposed to present? Cp and CpK? APQP and PPAP 11
Q How to perform Process Capability for true position Statistical Analysis Tools, Techniques and SPC 11
L IATF 16949 Cl. 8.3.3.2 Manufacturing process design input - Process capability IATF 16949 - Automotive Quality Systems Standard 1
Y Process Capability for Non-Normal Data - Philosophical Questions Capability, Accuracy and Stability - Processes, Machines, etc. 6
M Process Capability in a High Precision Environment Capability, Accuracy and Stability - Processes, Machines, etc. 7
V Product Development - When to start calculating Process Capability Capability, Accuracy and Stability - Processes, Machines, etc. 3
S Proving Process Capability of a Packing Machine Capability, Accuracy and Stability - Processes, Machines, etc. 6
N Process Capability Study - Aggregate vs. Individual Processes Statistical Analysis Tools, Techniques and SPC 27
V Process Capability for parameters with non-normal distribution. Capability, Accuracy and Stability - Processes, Machines, etc. 16
J Process Capability - How much percentage of tolerance? VDA Standards - Germany's Automotive Standards 3
S Regression Control Chart - Process Capability Statistical Analysis Tools, Techniques and SPC 18
M How to determine Process Capability using Minitab 17 Capability, Accuracy and Stability - Processes, Machines, etc. 6
B Process Capability - Changing Limits to Improve Cpk or find Root Causes? Capability, Accuracy and Stability - Processes, Machines, etc. 8
B Long Term Process Capability from miltiple Machines and Fixtures Capability, Accuracy and Stability - Processes, Machines, etc. 3
E What Data do we need for Welding and Repair Process Capability? Six Sigma 1
K Calculating Capability of a process when data is skewed in the upper range Capability, Accuracy and Stability - Processes, Machines, etc. 4
S What is the difference between Process Capability and Machine Capabilty Capability, Accuracy and Stability - Processes, Machines, etc. 13
V How to Assess Process Capability at the Development or Optimization Phase Capability, Accuracy and Stability - Processes, Machines, etc. 6
C Minitab Binomial Process Capability Analysis Chart Explanation Using Minitab Software 1
kedarg6500 Sample Size for Short Term & Long Term Process Capability Capability, Accuracy and Stability - Processes, Machines, etc. 2
J Is it the same process capability or performance in 8.2.3.1 and SPC? IATF 16949 - Automotive Quality Systems Standard 1
E Binomial Process Capability Sample Size - Variables Data Capability, Accuracy and Stability - Processes, Machines, etc. 4
M On Subgroups in Control Charts and Process Capability Statistical Analysis Tools, Techniques and SPC 3
N SPC and Process Capability Using a Romer Arm Statistical Analysis Tools, Techniques and SPC 6
J Interpreting Process Capability results and ratios Capability, Accuracy and Stability - Processes, Machines, etc. 1
J Go or No-Go Process Capability Quality Tools, Improvement and Analysis 8
S Process Capability with Negative Exponential Distribution Capability, Accuracy and Stability - Processes, Machines, etc. 4
C Process Capability - Attribute Data Analysis (Template) Document Control Systems, Procedures, Forms and Templates 1
C Process Capability - Variables Data Analysis (Template) Document Control Systems, Procedures, Forms and Templates 0
A Cnpk (Non-Parametric Capability Analysis) to assess whether the Process is Stable Capability, Accuracy and Stability - Processes, Machines, etc. 7
V Process Capability to reconfirm the Inspection Frequency Manufacturing and Related Processes 3
B Sample Size based on Process Capability Indices (Pp/Ppk)? Capability, Accuracy and Stability - Processes, Machines, etc. 20
P Calculating Process Capability from Data of attached spreadsheet Statistical Analysis Tools, Techniques and SPC 4
D How to measure Process Capability for Homogeneous Batch Manufacturing Capability, Accuracy and Stability - Processes, Machines, etc. 4
A Burrs affecting measurements and Process Capability Capability, Accuracy and Stability - Processes, Machines, etc. 7
G Assessing Process Capability on Variation (Hardware Adjustment Mean Shift) Capability, Accuracy and Stability - Processes, Machines, etc. 4
D Calculating Upper-CL/Lower-CL and Process Capability Statistical Analysis Tools, Techniques and SPC 8
V Defining Subgroups and Importance of Subgroups in SPC and Process Capability Statistical Analysis Tools, Techniques and SPC 6
T Process Capability - Understanding what Process Capability Indices mean Capability, Accuracy and Stability - Processes, Machines, etc. 17
Q TS 16949 Process Capability Requirements - Cpk ≥ 1.67 on All Key Parameters? IATF 16949 - Automotive Quality Systems Standard 8
S Process Capability questions - Calculating Cpu using Minitab Six Sigma 6
C Sample Size Calculation Formula for Process Capability Statistical Analysis Tools, Techniques and SPC 25
D Capability Analysis for Packing Process in Food Manufacturing Capability, Accuracy and Stability - Processes, Machines, etc. 11
Q When to use Cpk or Ppk for a Process Capability Study? Capability, Accuracy and Stability - Processes, Machines, etc. 14

Similar threads

Top Bottom