Determine applicability: Consider this question if you are collecting and labeling datasets to develop AI algorithms or models in the public sector, and determine if the requirement has been satisfied.
• A supervised learning model requires labeling of training data. However, there may be biases in labeling due to the reflection of labeler’s specific intentions, mistaken omission of feature information, and unconscious judgment.
• Prevent such biases by training and providing labelers with clear standards and work guidelines in advance for labelers to be able to recognize issues during data labeling and avoid other problems in the future.
• Perform tasks to prevent bias and minimize errors and biases from labelers by recruiting a diverse group of labelers and reviewers qualified with background knowledge and requirements for labeling based on thorough situation analysis and their understanding of source data (e.g. text, audio) used in the training.