Timnit Gebru, Jamie Morgenstern, Briana Vecchione, Jennifer Wortman Vaughan, Hanna Wallach, Hal Daumé III, & Kate Crawford (2021)
Communications of the ACM, 64(12), 86-92.
DOI: https://doi.org/10.1145/3458723
Abstract. Proposes 'datasheets for datasets', structured documentation of a dataset's motivation, composition, collection process, recommended uses, and potential harms, to promote accountability and mitigate bias in machine learning pipelines.
Tags: ethics transparency datasets