• Having multiple data labelers is prioritized to reduce human bias in the data labeling phase. It is also ideal to include labelers with diverse and evenly distributed demographic attributes and background knowledge. The major factors to take into account for even distribution are as follows:
✓ Race, religion, gender, ethnicity, disabilities, linguistics, nationality, and economic conditions.
• The following two things must be examined to ensure the diversity and even distribution of labelers: the implementation of methods such as crowdsourcing; and data labelers’ demographic attributes and background knowledge, which must undergo inspections and analyses.
✓ Crowdsourcing refers to outsourcing data labeling to enhance the involvement of participants trained in labeling. This can increase the diversity in labelers than the existing group of labelers.
• Labelers must be recruited from healthcare professionals when medical expertise is need for the data labeling process. Consider the following experts from numerous medical disciplines for the role of labelers.
✓ Healthcare professionals: Physicians (e.g. internal medicine, orthopedics), dentists, etc.
✓ Pharmaceutical and medical physics professionals: Pharmacists, medical physicists, etc.
✓ Medical assistants: Nursing assistants, child care assistants, ambulance drivers, dental assistants, nurses, kinesiologists, chiropractors, psychomotor therapists, speech therapists, radiation operators, medical laboratory technicians, hearing prosthesists, optometrists, orthotists, etc.