Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Self-Supervised Learning of Face Representations for Video Face Clustering (1903.01000v1)

Published 3 Mar 2019 in cs.CV and cs.LG

Abstract: Analyzing the story behind TV series and movies often requires understanding who the characters are and what they are doing. With improving deep face models, this may seem like a solved problem. However, as face detectors get better, clustering/identification needs to be revisited to address increasing diversity in facial appearance. In this paper, we address video face clustering using unsupervised methods. Our emphasis is on distilling the essential information, identity, from the representations obtained using deep pre-trained face networks. We propose a self-supervised Siamese network that can be trained without the need for video/track based supervision, and thus can also be applied to image collections. We evaluate our proposed method on three video face clustering datasets. The experiments show that our methods outperform current state-of-the-art methods on all datasets. Video face clustering is lacking a common benchmark as current works are often evaluated with different metrics and/or different sets of face tracks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Vivek Sharma (54 papers)
  2. Makarand Tapaswi (41 papers)
  3. M. Saquib Sarfraz (30 papers)
  4. Rainer Stiefelhagen (155 papers)
Citations (49)

Summary

We haven't generated a summary for this paper yet.