Semantics-Aware Android Malware Classification

The drastic increase of Android malware has led to a strong interest in developing methods to automate the malware analysis process. Existing automated Android malware detection and classification methods fall into two general categories: (1) signature-based and (2) machine learning-based. Signature...

Full description

Saved in:

Bibliographic Details
Published in	Android Application Security pp. 19 - 43
Main Authors	Zhang, Mu, Yin, Heng
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 2016 Springer International Publishing
Series	SpringerBriefs in Computer Science
Subjects	Android Malware Anomaly Detection Behavioral Graph Communications engineering / telecommunications Computer security Malware Instances Malware Samples Network hardware
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The drastic increase of Android malware has led to a strong interest in developing methods to automate the malware analysis process. Existing automated Android malware detection and classification methods fall into two general categories: (1) signature-based and (2) machine learning-based. Signature-based approaches can be easily evaded by bytecode-level transformation attacks. Prior learning-based works extract features from application syntax, rather than program semantics, and are also subject to evasion. In this paper, we propose a novel semantic-based approach that classifies Android malware via dependency graphs. To battle transformation attacks, we extract a weighted contextual API dependency graph as program semantics to construct feature sets. To fight against malware variants and zero-day malware, we introduce graph similarity metrics to uncover homogeneous application behaviors while tolerating minor implementation differences. We implement a prototype system, DroidSIFT, in 23 thousand lines of Java code. We evaluate our system using 2200 malware samples and 13,500 benign samples. Experiments show that our signature detection can correctly label 93 % of malware instances; our anomaly detector is capable of detecting zero-day malware with a low false negative rate (2 %) and an acceptable false positive rate (5.15 %) for a vetting purpose.
ISBN:	9783319478111 3319478117
ISSN:	2191-5768 2191-5776
DOI:	10.1007/978-3-319-47812-8_3