Papers
Topics
Authors
Recent
2000 character limit reached

DSFD: Dual Shot Face Detector (1810.10220v3)

Published 24 Oct 2018 in cs.CV

Abstract: In this paper, we propose a novel face detection network with three novel contributions that address three key aspects of face detection, including better feature learning, progressive loss design and anchor assign based data augmentation, respectively. First, we propose a Feature Enhance Module (FEM) for enhancing the original feature maps to extend the single shot detector to dual shot detector. Second, we adopt Progressive Anchor Loss (PAL) computed by two different sets of anchors to effectively facilitate the features. Third, we use an Improved Anchor Matching (IAM) by integrating novel anchor assign strategy into data augmentation to provide better initialization for the regressor. Since these techniques are all related to the two-stream design, we name the proposed network as Dual Shot Face Detector (DSFD). Extensive experiments on popular benchmarks, WIDER FACE and FDDB, demonstrate the superiority of DSFD over the state-of-the-art face detectors.

Citations (383)

Summary

  • The paper presents DSFD, a dual shot face detection framework that enhances feature representation using a Feature Enhance Module.
  • It employs Progressive Anchor Loss to optimize small face detection and uses Improved Anchor Matching to boost detection of occluded faces.
  • Empirical results on WIDER FACE and FDDB show high AP scores, indicating significant advancements in real-world face detection performance.

Dual Shot Face Detector: An Expert Review

Introduction to DSFD

The "DSFD: Dual Shot Face Detector" introduces a robust framework for face detection, addressing challenges in feature learning, loss design, and anchor matching. The Dual Shot Face Detector (DSFD) leverages a unique two-stage architecture that significantly enhances the ability of neural networks to detect faces with high variability in real-world scenarios. By employing a Feature Enhance Module (FEM), Progressive Anchor Loss (PAL), and Improved Anchor Matching (IAM), DSFD showcases substantial improvements over traditional face detection methods.

Core Components of DSFD

Feature Enhance Module (FEM)

FEM is designed to produce more discriminative and robust features by extending the capabilities of a Single Shot Detector (SSD) to incorporate dual-shot detection. The module effectively captures complex semantic information through the interaction of feature map cells, enhancing the network's performance without altering its computational efficiency. Figure 1

Figure 1: Illustration on Feature Enhance Module, in which the current feature map cell interacts with neighbors in current and up feature maps.

Progressive Anchor Loss (PAL)

The PAL mechanism involves computing auxiliary supervision loss with two distinct sets of anchors. This allows the network to adjust dynamically and optimize the detection of smaller faces using a progressive strategy. This dual-stage approach provides a nuanced method for improving classification accuracy through progressively staged learning objectives, tailored for various scales of anchor sizes.

Improved Anchor Matching (IAM)

In DSFD, the IAM strategy integrates an advanced anchor assignment methodology which enhances the positive anchor match rate for small and occluded faces. By enabling better initialization for the regression network, IAM leads to improvements in detection accuracy. This method effectively increases the recall of challenging faces through enhanced anchor matching fidelity.

Empirical Evaluation

The DSFD model is empirically validated against several leading facial detection benchmarks, notably WIDER FACE and FDDB. The empirical results highlight DSFD's dominance in precision metrics across various difficulty levels. Figure 2

Figure 2: Precision-recall curves on WIDER FACE validation and testing subset.

  • WIDER FACE Dataset: DSFD achieved AP scores of 96.6% (Easy), 95.7% (Medium), and 90.4% (Hard) on the validation set.
  • FDDB Dataset: Demonstrated cutting-edge accuracy, achieving 99.1% and 86.2% precision in respective tests.

Implications and Future Prospects

DSFD's architectural innovations push the boundaries of robust face detection, offering a framework that adapts to dynamic scales and occlusions. The dual shot strategy combined with feature enhancement modules positions DSFD as a competitive tool in real-time face detection applications, where variance in illumination, pose, and size is prevalent. Future research could explore the integration of DSFD with real-time systems or other domains requiring precise object localization such as autonomous navigation. Figure 3

Figure 3: Illustration of our DSFD's robustness to various large variations on scale, pose, occlusion, blurry, makeup, illumination, modality, and reflection.

Conclusion

DSFD establishes a prominent methodological advancement in face detection, underpinned by its innovative use of feature enhancement and anchor strategy. The documented improvements in detection accuracy and computational efficiency underscore its significance and broad applicability within computer vision fields. The culmination of techniques within DSFD offers new avenues for enhancing deep learning-based object detection frameworks, with implications that extend beyond facial recognition alone.

Slide Deck Streamline Icon: https://streamlinehq.com

Whiteboard

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.