Determine applicability: Consider this question if creating training data for the AI model or if normality and errors in training data for an pre-trained AI model are unclear, and determine if the requirement has been satisfied.
• Abnormal data include various possible errors in the collection and processing of datasets and outliers that significantly deviate from the general scope of data. Hence, you cannot ensure the performance and robustness of the AI model if no inspection and management of abnormal data are carried out.
• Particularly, you must inspect and manage bias in training data in advance to ensure trustworthiness from the perspective of respect for diversity in the public sector. Using pre-trained and validated models can reduce the impact due to abnormal data during additional training and minimize the efforts required for revalidation.
• Prepare a separate technique for the identification of abnormal data in data pre-processing when using unstructured data for training.