바로가기 메뉴 본문 바로가기 주메뉴 바로가기
  • 14-2cHave you managed the versions during data change?
    • In case of changes in data, such as updating the training data or relabeling due to error, while developing an AI model, the model—the output of the training—changes as well. Also, additional training may be needed if the entire dataset is replaced or the dataset used in previous training is completely different from the features.

    • When changing the training data, the version of the used training data, along with the AI model trained with the version, should be managed. If you must change the training data by adding new data, record the percentage of the new data used in the training or testing, and the resulting changes in the model’s performance should be trackable.

    • Consider using data version control tools (e.g. Data Version Control or DVC) based on open sources for machine learning projects, or build a training data version management system to manage the versions of the training data and the model.