Hi Frank
Thanks for the informative link. It was very useful.
Though I have another question to ask. When I was doing some research on Type IV SS. It said the following:
"
Type IV sums of squares are not recommended for testing hypotheses for lower-order effects in ANOVA designs with missing cells, even though this is the purpose for which they were developed. This is because Type IV sum-of-squares are invariant to some but not all g2 inverses of X'X that could be used to solve the normal equations. Specifically, Type IV sums of squares are invariant to the choice of a g2 inverse of X'X given a particular ordering of the levels of the categorical predictor variables, but are not invariant to different orderings of levels
"
My question is that what is meant by the following line - "Type IV sums of squares are invariant to the choice of a g2 inverse of X'X given a particular ordering of the levels of the categorical predictor variables, but are not invariant to different orderings of levels"??
Also
Type V sums of squares can be illustrated by using a simple example. Suppose that the effects considered are A, B, and A by B, in that order, and that A and B are both categorical predictors with, say, 3 and 2 levels, respectively. The intercept is first entered into the model. Then A is entered into the model, and its degrees of freedom are determined (i.e., the number of non-redundant columns for A in X'X, given the intercept). If A's degrees of freedom are less than 2 (i.e., its number of levels minus 1), it is eligible to be dropped. Then B is entered into the model, and its degrees of freedom are determined (i.e., the number of non-redundant columns for B in X'X, given the intercept and A). If B's degrees of freedom are less than 1 (i.e., its number of levels minus 1), it is eligible to be dropped. Finally, A by B is entered into the model, and its degrees of freedom are determined (i.e., the number of non-redundant columns for A by B in X'X, given the intercept, A, and B). If B's degrees of freedom are less than 2 (i.e., the product of the degrees of freedom for its factors if there were no missing cells), it is eligible to be dropped.
Q: why did they say in effect ‘A’ that if the df is less than 2. How did they come up with this statement? I mean why did they choose 2? Similarly for Effect ‘B’ and ‘A by B’?
Q: In Effect ‘A by B’ will that be dropped or not. Since it has a df of 2, and the rule say that its eligible to drop if less than 2?
Note that Type V sums of squares involve determining a reduced model for which all effects remaining in the model have at least as many degrees of freedom as they would have if there were no missing cells
Q: so the total df of the complete model is dfA + dfB + dfA by B = 2+1+2 = 5
And the df of the model after the effects are dropped is dfA by B = 2
So how does that equal. Or am I being really stupid and interpreting this completely wrong.
PS: Does anyone know where I can find more information on Type V and Type VI SS?
Thanks