Improving CNN-based Person Re-identification using score Normalization (2307.00397v2)
Abstract: Person re-identification (PRe-ID) is a crucial task in security, surveillance, and retail analysis, which involves identifying an individual across multiple cameras and views. However, it is a challenging task due to changes in illumination, background, and viewpoint. Efficient feature extraction and metric learning algorithms are essential for a successful PRe-ID system. This paper proposes a novel approach for PRe-ID, which combines a Convolutional Neural Network (CNN) based feature extraction method with Cross-view Quadratic Discriminant Analysis (XQDA) for metric learning. Additionally, a matching algorithm that employs Mahalanobis distance and a score normalization process to address inconsistencies between camera scores is implemented. The proposed approach is tested on four challenging datasets, including VIPeR, GRID, CUHK01, and PRID450S, and promising results are obtained. For example, without normalization, the rank-20 rate accuracies of the GRID, CUHK01, VIPeR and PRID450S datasets were 61.92%, 83.90%, 92.03%, 96.22%; however, after score normalization, they have increased to 64.64%, 89.30%, 92.78%, and 98.76%, respectively. Accordingly, the promising results on four challenging datasets indicate the effectiveness of the proposed approach.
- Y. Himeur, S. Al-Maadeed, H. Kheddar, N. Al-Maadeed, K. Abualsaud, A. Mohamed, and T. Khattab, “Video surveillance using deep transfer learning and deep domain adaptation: Towards better generalization,” Engineering Applications of Artificial Intelligence, vol. 119, p. 105698, 2023.
- H. Liu, Z. Xiao, B. Fan, H. Zeng, Y. Zhang, and G. Jiang, “Prgcn: Probability prediction with graph convolutional network for person re-identification,” Neurocomputing, vol. 423, pp. 57–70, 2021.
- Y. Himeur, S. Al-Maadeed, N. Almadeed, K. Abualsaud, A. Mohamed, T. Khattab, and O. Elharrouss, “Deep visual social distancing monitoring to combat covid-19: A comprehensive survey,” Sustainable cities and society, p. 104064, 2022.
- O. Elharrouss, S. Al-Maadeed, N. Subramanian, N. Ottakath, N. Almaadeed, and Y. Himeur, “Panoptic segmentation: A review,” arXiv preprint arXiv:2111.10250, 2021.
- Y. Himeur, S. Al-Maadeed, I. Varlamis, N. Al-Maadeed, K. Abualsaud, and A. Mohamed, “Face mask detection in smart cities using deep and transfer learning: lessons learned from the covid-19 pandemic,” Systems, vol. 11, no. 2, p. 107, 2023.
- R. Prates and W. R. Schwartz, “Kernel cross-view collaborative representation based classification for person re-identification,” Journal of Visual Communication and Image Representation, vol. 58, pp. 304–315, 2019.
- M. Gou, Z. Wu, A. Rates-Borras, O. Camps, R. J. Radke et al., “A systematic evaluation and benchmark for person re-identification: Features, metrics, and datasets,” IEEE transactions on pattern analysis and machine intelligence, vol. 41, no. 3, pp. 523–536, 2018.
- T. Matsukawa, T. Okabe, E. Suzuki, and Y. Sato, “Hierarchical gaussian descriptor for person re-identification,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 1363–1372.
- S. Liao, Y. Hu, X. Zhu, and S. Z. Li, “Person re-identification by local maximal occurrence representation and metric learning,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 2197–2206.
- D. Gray and H. Tao, “Viewpoint invariant pedestrian recognition with an ensemble of localized features,” in Computer Vision–ECCV 2008: 10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part I 10. Springer, 2008, pp. 262–275.
- A. Chouchane, M. Bessaoudi, A. Ouamane, and O. Laouadi, “Face Kinship Verification Based VGG16 and new Gabor Wavelet Features,” in 2022 5th International Symposium on Informatics and its Applications (ISIA), IEEE, 2022, pp. 1–6.
- Y. Himeur and K. A. Sadi, “Robust video copy detection based on ring decomposition based binarized statistical image features and invariant color descriptor (rbsif-icd),” Multimedia Tools and Applications, vol. 77, pp. 17 309–17 331, 2018.
- B. J. Prosser, S. Gong, and T. Xiang, “Multi-camera matching using bi-directional cumulative brightness transfer functions.” in BMVC, vol. 8. Citeseer, 2008, pp. 164–1.
- C. Su, S. Zhang, F. Yang, G. Zhang, Q. Tian, W. Gao, and L. S. Davis, “Attributes driven tracklet-to-tracklet person re-identification using latent prototypes space mapping,” Pattern Recognition, vol. 66, pp. 4–15, 2017.
- C. Su, F. Yang, S. Zhang, Q. Tian, L. S. Davis, and W. Gao, “Multi-task learning with low rank attribute embedding for multi-camera person re-identification,” IEEE transactions on pattern analysis and machine intelligence, vol. 40, no. 5, pp. 1167–1181, 2017.
- L. Wei, S. Zhang, W. Gao, and Q. Tian, “Person transfer gan to bridge domain gap for person re-identification,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 79–88.
- G. Chen, T. Gu, J. Lu, J.-A. Bao, and J. Zhou, “Person re-identification via attention pyramid,” IEEE Transactions on Image Processing, vol. 30, pp. 7663–7676, 2021.
- Z. Yang, Y. Wu, J. Cheng, S. Peng, L. Wang, and D. Tao, “Incremental xqda metric learning for person reidentification,” in 2018 IEEE International Conference on Information and Automation (ICIA). IEEE, 2018, pp. 433–438.
- O. Javed, K. Shafique, Z. Rasheed, and M. Shah, “Modeling inter-camera space–time and appearance relationships for tracking across non-overlapping views,” Computer Vision and Image Understanding, vol. 109, no. 2, pp. 146–162, 2008.
- L. Zhang, T. Xiang, and S. Gong, “Learning a discriminative null space for person re-identification,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 1239–1248.
- Y. Gavini, A. Agarwal, and B. Mehtre, “Thermal to visual person re-identification using collaborative metric learning based on maximum margin matrix factorization,” Pattern Recognition, vol. 134, p. 109069, 2023.
- X. Chang, T. M. Hospedales, and T. Xiang, “Multi-level factorisation net for person re-identification,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 2109–2118.
- J. Song, Y. Yang, Y.-Z. Song, T. Xiang, and T. M. Hospedales, “Generalizable person re-identification by domain-invariant mapping network,” in Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, 2019, pp. 719–728.
- Y. Liu, W. Zhou, J. Liu, G.-J. Qi, Q. Tian, and H. Li, “An end-to-end foreground-aware network for person re-identification,” IEEE Transactions on Image Processing, vol. 30, pp. 2060–2071, 2021.
- Z. Ming, M. Zhu, X. Wang, J. Zhu, J. Cheng, C. Gao, Y. Yang, and X. Wei, “Deep learning-based person re-identification methods: A survey and outlook of recent works,” Image and Vision Computing, vol. 119, p. 104394, 2022.
- C. C. Loy, T. Xiang, and S. Gong, “Time-delayed correlation analysis for multi-camera activity understanding,” International Journal of Computer Vision, vol. 90, pp. 106–129, 2010.
- W. Li, R. Zhao, and X. Wang, “Human reidentification with transferred metric learning,” in Computer Vision–ACCV 2012: 11th Asian Conference on Computer Vision, Daejeon, Korea, November 5-9, 2012, Revised Selected Papers, Part I 11. Springer, 2013, pp. 31–44.
- P. M. Roth, M. Hirzer, M. Köstinger, C. Beleznai, and H. Bischof, “Mahalanobis distance learning for person re-identification,” Person re-identification, pp. 247–267, 2014.
- M. Koestinger, M. Hirzer, P. Wohlhart, P. M. Roth, and H. Bischof, “Large scale metric learning from equivalence constraints,” in 2012 IEEE conference on computer vision and pattern recognition. IEEE, 2012, pp. 2288–2295.
- A. Nautsch, J. Patino, A. Treiber, T. Stafylakis, P. Mizera, M. Todisco, T. Schneider, and N. Evans, “Privacy-preserving speaker recognition with cohort score normalisation,” arXiv preprint arXiv:1907.03454, 2019.