About | HeinOnline Law Journal Library | HeinOnline Law Journal Library | HeinOnline

104 Calif. L. Rev. 671 (2016)
Big Data's Disparate Impact

handle is hein.journals/calr104 and id is 695 raw text is: Big Data's Disparate Impact
Solon Barocas* & Andrew D. Selbst* *
Advocates of algorithmic techniques like data mining argue that
these techniques eliminate human biases from the decision-making
process. But an algorithm is only as good as the data it works with.
Data is frequently imperfect in ways that allow these algorithms to
inherit the prejudices of prior decision makers. In other cases, data
may simply reflect the widespread biases that persist in society at
large. In still others, data mining can discover surprisingly useful
regularities that are really just preexisting patterns of exclusion and
inequality. Unthinking reliance on data mining can deny historically
disadvantaged and vulnerable groups full participation in society.
Worse still, because the resulting discrimination is almost always an
unintentional emergent property of the algorithm's use rather than a
conscious choice by its programmers, it can be unusually hard to
identify the source of the problem or to explain it to a court.
This Essay examines these concerns through the lens of
American antidiscrimination law-more particularly, through Title
DOI: http://dx.doi.org/10.15779/Z38BG31
California Law Review, Inc. (CLR) is a California nonprofit corporation. CLR and the
authors are solely responsible for the content of their publications.
*  Postdoctoral Research Associate, Center for Information Technology Policy, Princeton
University; Ph.D. 2014, New York University, Department of Media, Culture, and Communication.
This research was supported in part by the Center for Information Technology Policy at Princeton
University.
** Scholar in Residence, Electronic Privacy Information Center; Visiting Researcher,
Georgetown University Law Center; Visiting Fellow, Yale Information Society Project; J.D. 2011,
University of Michigan Law School. The authors would like to thank Jane Bambauer, Alvaro Bedoya,
Marjory Blumenthal, Danielle Citron, James Grimmelmann, Moritz Hardt, Don Herzog, Janine Hiller,
Chris Hoofnagle, Joanna Huey, Patrick Ishizuka, Michael Kirkpatrick, Aaron Konopasky, Joshua
Kroll, Mark MacCarthy, Arvind Narayanan, Helen Norton, Paul Ohm, Scott Peppet, Joel Reidenberg,
David Robinson, Kathy Strandburg, David Vladeck, members of the Privacy Research Group at New
York University, and the participants of the 2014 Privacy Law Scholars Conference for their helpful
comments. Special thanks also to Helen Nissenbaum and the Information Law Institute at New York
University for giving us an interdisciplinary space to share ideas, allowing this paper to come about.
Copyright © 2016 by Solon Barocas and Andrew Selbst. This Essay is available for reuse under the
Creative     Commons       Attribution-ShareAlike  4.0      International   License,
http://creativecommons.org/licenses/by-sa/4.0/. The required attribution notice under the license must
include the article's full citation information, e.g., Solon Barocas & Andrew D. Selbst, Big Data's
Disparate Impact, 104 CALIF. L. REV. 671 (2016).

671

What Is HeinOnline?

HeinOnline is a subscription-based resource containing thousands of academic and legal journals from inception; complete coverage of government documents such as U.S. Statutes at Large, U.S. Code, Federal Register, Code of Federal Regulations, U.S. Reports, and much more. Documents are image-based, fully searchable PDFs with the authority of print combined with the accessibility of a user-friendly and powerful database. For more information, request a quote or trial for your organization below.



Short-term subscription options include 24 hours, 48 hours, or 1 week to HeinOnline.

Contact us for annual subscription options:

Already a HeinOnline Subscriber?

profiles profiles most