If I was reviewing your study, I would ask how were the 20 parts selected. The legitimacy of the result doesn't matter whether calculations were done correctly if the samples were not chosen appropriately.
AIAG (2nd edition section 6) says in a short study of 20 parts, some of the parts should be slightly below and some slightly above acceptance limits. You have not explained whether both directions are represented in this study, and how far out of limits are the NoGo parts. It is recommended that this determination (how far out of limits) be made using variable measurements.
Making Sense of Attribute Gage R&R Calculations - iSixSigma
"Practically speaking, if 'clearly good' parts and 'clearly bad' parts are chosen, the ability of the measurement system to accurately categorize the ones in between will not be tested."
A better approach (the long method study) involves 8 sample parts which are known to reflect the full range of variation, equally spaced. The 8 parts are measured 20 times, and what you want to see are one part accepted by the attribute gauge 20 times out of 20 and one part rejected 20 out of 20. The other six parts should be accepted 1 < a < 19.
The Minitab support website has good explanations of how the long or default study methods are conducted. You don't have to be a user of the software to learn from their website.