Hierarchical Clustering with Hard-batch Triplet Loss for Person Re-identification (1910.12278v2)

Published 27 Oct 2019 in cs.CV

Abstract: For most unsupervised person re-identification (re-ID), people often adopt unsupervised domain adaptation (UDA) method. UDA often train on the labeled source dataset and evaluate on the target dataset, which often focuses on learning differences between the source dataset and the target dataset to improve the generalization of the model. Base on these, we explore how to make use of the similarity of samples to conduct a fully unsupervised method which just trains on the unlabeled target dataset. Concretely, we propose a hierarchical clustering-guided re-ID (HCR) method. We use hierarchical clustering to generate pseudo labels and use these pseudo labels as monitors to conduct the training. In order to exclude hard examples and promote the convergence of the model, We use PK sampling in each iteration, which randomly selects a fixed number of samples from each cluster for training. We evaluate our model on Market-1501, DukeMTMC-reID and MSMT17. Results show that HCR gets the state-of-the-arts and achieves 55.3% mAP on Market-1501 and 46.8% mAP on DukeMTMC-reID. Our code will be released soon.

Authors (1)

Kaiwei Zeng (2 papers)

Citations (241)

View on Semantic Scholar

Summary

The paper proposes Hierarchical Clustering with Hard-batch Triplet Loss (HCT), a novel method for fully unsupervised person re-identification that improves pseudo label quality.
Empirical results demonstrate HCT achieves significantly improved performance, including 56.4% mAP on Market-1501, surpassing prior unsupervised methods.
This method provides a scalable solution for deploying person re-identification in real-world settings by not requiring labeled source data.

An Examination of Hierarchical Clustering with Hard-batch Triplet Loss for Person Re-identification

The paper presents a novel approach named Hierarchical Clustering with Hard-batch Triplet loss (HCT) aimed at enhancing performance in person re-identification (re-ID), particularly under fully unsupervised setups. Building upon existing methodologies, this work proposes a significant shift towards improving the quality of pseudo labels generated during unsupervised learning tasks, which is crucial in enhancing model performance.

Key Contributions and Methodology

Hierarchical Clustering Integration: The authors leverage a hierarchical clustering process to assign pseudo labels to unlabeled data sets. This procedural enhancement seeks to exploit latent sample similarities within the target dataset more effectively.
Triplet Loss Optimization: By employing a hard-batch triplet loss, the model is fine-tuned to distinguish hard examples better. This method is intended to mitigate the convergence issues typically associated with difficult, or “hard,” instances where traditional clustering methods struggle to differentiate closely related samples.
Iterative PK Sampling: Through PK sampling, the paper introduces a mechanism to refresh the datasets iteratively at each training cycle, essentially recalibrating the cluster's composition as learning progresses. This technique supports hard-batch triplet loss by ensuring that hard samples within clusters are accurately maintained for training, thus improving model robustness over successive iterations.

Results and Analysis

The robustness of the HCT approach is validated using two prominent re-ID datasets: Market-1501 and DukeMTMC-reID. The empirical results are noteworthy, showcasing that HCT achieves a mean average precision (mAP) of 56.4% on Market-1501 and 50.7% on DukeMTMC-reID. These results indicate a substantial outperformance over existing state-of-the-art methods in fully unsupervised re-ID tasks and even surpass many Unsupervised Domain Adaptation (UDA) methods that rely on labeled data.

The comparative analysis underscores HCT's efficacy, particularly when juxtaposed against methods like the Bottom-Up Clustering (BUC), which had previously dominated this research space. HCT effectively resolves BUC’s shortcomings, such as its inability to correctly handle hard instances, which usually results in deterioration of model efficacy as training progresses.

Implications and Future Directions

The implications of HCT extend beyond its immediate application to fully unsupervised re-ID. By refining the generation of pseudo labels without requiring labeled source data, the HCT method provides a more scalable solution for deploying re-ID systems in practical, real-world environments where manual annotation is prohibitively expensive.

Theoretically, the use of hierarchical clustering and hard-batch triplet loss could inspire similar unsupervised learning techniques in other domains where sample similarity structure is a pivotal consideration. Future research could explore the integration of HCT with more advanced deep learning models or alternative feature extraction pipelines, potentially enhancing its scalability and adaptability to other complex identification tasks.

Continued development in this area may focus on further improving pseudo label quality, leveraging more intricate sampling methods, or incorporating adaptive learning rates that adjust to the clustering convergence to extend and optimize unsupervised learning paradigms across varied technological contexts.

PDF Markdown