Fairness in Clustering with Multiple Sensitive Attributes

Savitha Sam Abraham, Deepak Padmanabhan, Sowmya S. Sundaram

Research output: Chapter in Book/Report/Conference proceedingConference contribution

24 Citations (Scopus)
245 Downloads (Pure)

Abstract

A clustering may be considered as fair on pre-specified sensitive attributes if the proportions of sensitive attribute groups
in each cluster reflect that in the dataset. In this paper, we consider the task of fair clustering for scenarios involving multiple
multi-valued or numeric sensitive attributes. We propose a fair
clustering method, FairKM (Fair K-Means), that is inspired by
the popular K-Means clustering formulation. We outline a computational notion of fairness which is used along with a cluster
coherence objective, to yield the FairKM clustering method. We
empirically evaluate our approach, wherein we quantify both
the quality and fairness of clusters, over real-world datasets. Our
experimental evaluation illustrates that the clusters generated by
FairKM fare significantly better on both clustering quality and
fair representation of sensitive attribute groups compared to the
clusters from a state-of-the-art baseline fair clustering method.
Original languageEnglish
Title of host publicationInternational Conference on Extending Database Technology: Proceedings
Pages287-298
Number of pages12
ISBN (Electronic)978-3-89318-083-7
Publication statusPublished - 31 Mar 2020
EventEDBT/ICDT 2020 Joint Conference - Copenhagen, Copenhagen, Denmark
Duration: 30 Mar 202002 Apr 2020
https://diku-dk.github.io/edbticdt2020/

Publication series

NameAdvances in Database Technology: Proceedings
ISSN (Electronic)2367-2005

Conference

ConferenceEDBT/ICDT 2020 Joint Conference
Country/TerritoryDenmark
CityCopenhagen
Period30/03/202002/04/2020
Internet address

Fingerprint

Dive into the research topics of 'Fairness in Clustering with Multiple Sensitive Attributes'. Together they form a unique fingerprint.

Cite this