Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fairness in Streaming Submodular Maximization: Algorithms and Hardness (2010.07431v2)

Published 14 Oct 2020 in cs.LG and cs.DS

Abstract: Submodular maximization has become established as the method of choice for the task of selecting representative and diverse summaries of data. However, if datapoints have sensitive attributes such as gender or age, such machine learning algorithms, left unchecked, are known to exhibit bias: under- or over-representation of particular groups. This has made the design of fair machine learning algorithms increasingly important. In this work we address the question: Is it possible to create fair summaries for massive datasets? To this end, we develop the first streaming approximation algorithms for submodular maximization under fairness constraints, for both monotone and non-monotone functions. We validate our findings empirically on exemplar-based clustering, movie recommendation, DPP-based summarization, and maximum coverage in social networks, showing that fairness constraints do not significantly impact utility.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Marwa El Halabi (13 papers)
  2. Slobodan Mitrović (35 papers)
  3. Ashkan Norouzi-Fard (24 papers)
  4. Jakab Tardos (11 papers)
  5. Jakub Tarnawski (25 papers)
Citations (37)

Summary

We haven't generated a summary for this paper yet.