AI/ML training dataset

shimonv

Trusted Information Resource
Hello everyone,

I'd appreciate your feedback on AI/ML training dataset:
1. Do we need a board-certified physician for labeling of training dataset?
2. Are there other requirements / consideration for preparing the training datasets, based on your experience with FDA?



Thank you!

Shimon
 

Ronen E

Problem Solver
Moderator
1. The quality of the training (and thus the AI) is only as good as the quality of the dataset expert grading - GIGO

2. Don't forget to leave a fresh dataset (not used in initial training) for validation of the AI model after it's locked.

^ FWIW
 

QuinnM

Involved In Discussions
I can't speak to what will work with the FDA, and I need to read the read the new FDA PCCP guidance document, but I can tell you what we do. We have an annotation process and form that covers all aspects of the annotation/labeling activity. This includes requirements for individuals who are performing the annotation/labeling.
 

shimonv

Trusted Information Resource
I can't speak to what will work with the FDA, and I need to read the read the new FDA PCCP guidance document, but I can tell you what we do. We have an annotation process and form that covers all aspects of the annotation/labeling activity. This includes requirements for individuals who are performing the annotation/labeling.
Cool. So, what determines who does the annotation?
 

Ed Panek

QA RA Small Med Dev Company
Leader
Super Moderator
Try to present the results based on typical FDA requirements: Sex, Age, Disease states, Etc
Make sure you positive predictive and negative predictive numbers meet your stated specification.
Be prepared with expert opinion to defend the values you've chosen for the model
 

Ed Panek

QA RA Small Med Dev Company
Leader
Super Moderator

QuinnM

Involved In Discussions
Cool. So, what determines who does the annotation?
We consider real world activities, that is who is doing this activity now with out AI? What are their qualifications and experience. We balance between who performs the annotation and QA on the annotation. Most, about 90%, are MDs, while others are certified in their field but may not be an MD. We also define years of experience in the specification too. The QA process has been executed by MDs.
 

Ronen E

Problem Solver
Moderator

Yikes
Exactly my point - GIGO

Probably obtaining large-enough, high quality annotated datasets for training (and validation) is the biggest obstacle for developing really good AI expert systems.
 
Top Bottom