Balancing Fairness and Accuracy in Data-Restricted Binary Classification (2403.07724v1)

Published 12 Mar 2024 in cs.LG, cs.AI, cs.CY, and stat.ML

Abstract: Applications that deal with sensitive information may have restrictions placed on the data available to a ML classifier. For example, in some applications, a classifier may not have direct access to sensitive attributes, affecting its ability to produce accurate and fair decisions. This paper proposes a framework that models the trade-off between accuracy and fairness under four practical scenarios that dictate the type of data available for analysis. Prior works examine this trade-off by analyzing the outputs of a scoring function that has been trained to implicitly learn the underlying distribution of the feature vector, class label, and sensitive attribute of a dataset. In contrast, our framework directly analyzes the behavior of the optimal Bayesian classifier on this underlying distribution by constructing a discrete approximation it from the dataset itself. This approach enables us to formulate multiple convex optimization problems, which allow us to answer the question: How is the accuracy of a Bayesian classifier affected in different data restricting scenarios when constrained to be fair? Analysis is performed on a set of fairness definitions that include group and individual fairness. Experiments on three datasets demonstrate the utility of the proposed framework as a tool for quantifying the trade-offs among different fairness notions and their distributional dependencies.

References (43)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/StatMLPapers/status/1767763423058620530

https://twitter.com/WGOV/status/1767840793627177403

Balancing Fairness and Accuracy in Data-Restricted Binary Classification (2403.07724v1)

Summary

Related Papers

Tweets