바로가기 메뉴 본문 바로가기 주메뉴 바로가기
  • 06-3bHave you made an effort to recruit diverse data labelers?
    • Having multiple data labelers is prioritized to reduce human bias in the data labeling phase. It is also ideal to include labelers with diverse and evenly distributed demographic attributes and background knowledge. The major factors to take into account for even distribution are as follows:
    Race, religion, gender, ethnicity, disabilities, linguistics, nationality, and economic conditions.

    • The following two things must be examined to ensure the diversity and even distribution of labelers: the implementation of methods such as crowdsourcing; and data labelers’ demographic attributes and background knowledge, which must undergo inspections and analyses.
    Crowdsourcing refers to outsourcing data labeling to enhance the involvement of participants trained in labeling. This can increase the diversity in labelers than the existing group of labelers.