AI 신뢰성 센터

본문 바로가기 주메뉴 바로가기

Planning and Design
Data Collection and Processing
AI Model Development
System Implementation
Operation and Monitoring
1. Requirement 14 Ensuring traceability and modification history of the AI system
  1. 14-1 Have you established measures to track the AI system’s decision-making?
  2. 14-2 Have you obtained the modification history of training data and managed the impact of data modifications?
2. Requirement 15 Explanation about the scope of services provided and the subject of interactions
  1. 15-1 Do you provide an explanation to encourage proper usage of the AI service?
    1. 15-1a Do you provide an explanation about the goal and objective of the AI service?
    2. 15-1b Do you provide an explanation about the limitation and scope of the AI service?
  2. 15-2 Do you accurately explain the subject of the interaction?
    1. 15-2a Have you accurately explained to users that they are interacting with the AI?

04-1aHave you explained the data attributes before and after cleansing?

• Data cleansing is a stage where data are selected and processed to create training data before labeling. Users who only use cleansed data cannot accurately identify the attributes of raw data. Therefore, data attributes before and after cleansing and any related information for the cleansing in consideration of possibly collecting additional data in the future must be explained.

• Generally, data cleansing can be performed by excluding or converting parts of the data according to predefined rules using open-source tools, or by visual inspection. You can analyze data attributes by visualizing the cleansed data.

• If you have collected the raw data yourself, provide information about the purpose of building the data, the type of data, the criteria for cleansing (e.g. domain characteristics), and the cleansing tool. The following are examples of data cleansing standards for each data type.

✔ Image data: Image size, aspect ratio, resolution, imaging equipment, personal data processing, copyright, etc.
✔ Text data: Amount of text, grammatical accuracy in text, appropriateness in the content of the text, relevancy to the topic, etc.
✔ Audio data: Volume, accuracy in pronunciation, noise and static, inaudible (based on acceptance range), personal data, copyright, etc.