Determine applicability: Consider this question if there is a possibility of creating a dataset for developing AI algorithms or models for the healthcare sector or of collecting data in addition to the dataset in the future, and determine if the requirement has been satisfied.
• It is difficult to verify the source of the data for medical data due to the de-identification of personal data. Metadata should be provided to identify the features of raw data if you need to reuse data or collect additional data in the same format.
• Also, information about the data, including the training data, metadata, and labeling work instructions, must be obtained to assist developers and stakeholders associated with the AI system in understanding the collected data and preventing potential biases or errors.
• Examples of information to be provided to stakeholders are the provenance and format of collected data; the collection, cleansing, and processing methods for data; data licenses; and protected attributes with potential biases.