logo
ResearchBunny Logo
On responsible machine learning datasets emphasizing fairness, privacy and regulatory norms with examples in biometrics and healthcare

Computer Science

On responsible machine learning datasets emphasizing fairness, privacy and regulatory norms with examples in biometrics and healthcare

S. Mittal, K. Thakral, et al.

This study highlights the critical need for responsible machine learning datasets, focusing on fairness, privacy, and compliance with regulations. Conducted by Surbhi Mittal, Kartik Thakral, Richa Singh, Mayank Vatsa, Tamar Glaser, Cristian Canton Ferrer, and Tal Hassner, it uncovers widespread issues within computer vision datasets, especially in biometrics and healthcare, and proposes a new evaluation rubric.

00:00
00:00
Playback language: English
Abstract
This study emphasizes the importance of responsible machine learning datasets by focusing on fairness, privacy, and regulatory compliance. A large audit of computer vision datasets, particularly in biometrics and healthcare, reveals widespread issues in these areas. The authors propose a rubric for evaluating dataset responsibility and, after analyzing 60 datasets, demonstrate the urgent need for improved dataset creation methodologies to address fairness-privacy paradoxes and comply with global data protection legislation.
Publisher
Nature Machine Intelligence
Published On
Aug 12, 2024
Authors
Surbhi Mittal, Kartik Thakral, Richa Singh, Mayank Vatsa, Tamar Glaser, Cristian Canton Ferrer, Tal Hassner
Tags
machine learning
fairness
privacy
data protection
dataset responsibility
computer vision
healthcare
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny