DL-Droid: Deep learning based android malware detection using real devices (1911.10113v1)

Published 22 Nov 2019 in cs.CR, cs.LG, cs.NE, and stat.ML

Abstract: The Android operating system has been the most popular for smartphones and tablets since 2012. This popularity has led to a rapid raise of Android malware in recent years. The sophistication of Android malware obfuscation and detection avoidance methods have significantly improved, making many traditional malware detection methods obsolete. In this paper, we propose DL-Droid, a deep learning system to detect malicious Android applications through dynamic analysis using stateful input generation. Experiments performed with over 30,000 applications (benign and malware) on real devices are presented. Furthermore, experiments were also conducted to compare the detection performance and code coverage of the stateful input generation method with the commonly used stateless approach using the deep learning system. Our study reveals that DL-Droid can achieve up to 97.8% detection rate (with dynamic features only) and 99.6% detection rate (with dynamic + static features) respectively which outperforms traditional machine learning techniques. Furthermore, the results highlight the significance of enhanced input generation for dynamic analysis as DL-Droid with the state-based input generation is shown to outperform the existing state-of-the-art approaches.

Citations (299)

View on Semantic Scholar

Summary

The paper proposes a novel deep learning system that leverages dynamic, stateful input generation on real devices to enhance Android malware detection.
It extracts a comprehensive set of 420 features and achieves up to 97.8% accuracy with dynamic features, reaching 99.6% when combined with static features.
The comparative study shows DL-Droid outperforms seven popular machine learning classifiers, demonstrating its practical impact in real-world cybersecurity.

DL-Droid: A Deep Learning Based System for Android Malware Detection

The paper "DL-Droid: Deep learning based android malware detection using real devices" addresses the significant challenge posed by Android malware, exacerbated by the rapid proliferation of Android devices and the sophistication of obfuscation techniques. The authors propose a novel approach leveraging deep learning to improve the detection of malicious Android applications through dynamic analysis executed on real devices.

The proposed DL-Droid system advances existing frameworks by integrating dynamic stateful input generation, markedly enhancing the efficiency of malware detection. The paper is grounded in comprehensive experimentation with over 30,000 applications, comprising both benign and malware applications, utilizing real devices to foster a realistic environment. This methodological choice is a departure from previous studies predominantly based on emulators and underlines the robustness of the proposed system against detection evasion techniques that target emulated environments.

Key results demonstrate that DL-Droid achieves a detection accuracy of up to 97.8% using solely dynamic features and an impressive 99.6% when combining dynamic and static features. These results underscore the superiority of DL-Droid over traditional machine learning techniques and highlight the efficacy of the stateful input generation approach in ensuring improved code coverage compared to stateless methods.

This research contributes significantly to the body of knowledge in Android malware detection in several ways:

Dynamic and Stateful Analysis: By employing dynamism and stateful input generation, the authors overcome the limitations of traditional static and stateless dynamic analysis techniques, which are often susceptible to obfuscation and code hiding strategies by malware developers.
Comprehensive Feature Extraction: The work extracts and utilizes 420 features, including application attributes, actions/events, and permissions. This comprehensive dataset facilitates better learning and detection performance.
Comparison and Superiority: DL-Droid's performance is benchmarked against seven popular machine learning classifiers. Notably, DL-Droid surpasses these classifiers, with the Random Forest algorithm being the closest competitor in terms of detection performance.
Real-World Application: The use of real devices for testing rather than emulators is a pragmatic approach that accounts for real-world conditions, improving the robustness and applicability of the findings.
Enhancements Over Existing Systems: The paper conducts an extensive comparative paper with existing deep learning solutions, demonstrating significant improvements in accuracy metrics, reinforcing DL-Droid's place as a leading framework in this domain.

From a theoretical perspective, the integration of deep learning within dynamic analysis frameworks showcases the evolving role of artificial intelligence in cybersecurity. Practically, the deployment of such systems can markedly enhance the security posture of Android ecosystems by preempting and mitigating emerging malicious threats.

Future research may explore the extension of DL-Droid's capabilities to encompass even broader datasets across varied Android device implementations and configurations. Additionally, continuous updates and refinements of the deep learning models are crucial to addressing the evolving tactics of malware developers, ensuring sustained efficacy and reliability in dynamic threat landscapes.

DL-Droid stands as a testament to the potential of combining deep learning with adept input generation techniques, illustrating a proficient approach to addressing pressing cybersecurity challenges in the prevalent Android domain. As the Android operating system continues to dominate, the necessity for sophisticated and resilient malware detection mechanisms like DL-Droid becomes increasingly critical.

PDF Markdown

DL-Droid: Deep learning based android malware detection using real devices (1911.10113v1)

Summary

DL-Droid: A Deep Learning Based System for Android Malware Detection

Related Papers