CFP last date
15 May 2024
Reseach Article

Feature Mining from APK Files for Malware Detection

by Prerna Agrawal, Bhushan Trivedi
International Journal of Applied Information Systems
Foundation of Computer Science (FCS), NY, USA
Volume 12 - Number 32
Year of Publication: 2020
Authors: Prerna Agrawal, Bhushan Trivedi
10.5120/ijais2020451874

Prerna Agrawal, Bhushan Trivedi . Feature Mining from APK Files for Malware Detection. International Journal of Applied Information Systems. 12, 32 ( August 2020), 6-10. DOI=10.5120/ijais2020451874

@article{ 10.5120/ijais2020451874,
author = { Prerna Agrawal, Bhushan Trivedi },
title = { Feature Mining from APK Files for Malware Detection },
journal = { International Journal of Applied Information Systems },
issue_date = { August 2020 },
volume = { 12 },
number = { 32 },
month = { August },
year = { 2020 },
issn = { 2249-0868 },
pages = { 6-10 },
numpages = {9},
url = { https://www.ijais.org/archives/volume12/number32/1094-2020451874/ },
doi = { 10.5120/ijais2020451874 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2023-07-05T19:10:36.972364+05:30
%A Prerna Agrawal
%A Bhushan Trivedi
%T Feature Mining from APK Files for Malware Detection
%J International Journal of Applied Information Systems
%@ 2249-0868
%V 12
%N 32
%P 6-10
%D 2020
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The practice of using Machine Learning Methods in detecting Malware is growing massively. The prerequisite for implementing Machine Learning methods is the input of the dataset to it. A researcher needs to create a dataset of its own for performing Malware Detection using Machine Learning. Our dataset generation process includes Android File Collection, Decompilation, and Feature Mining Phases. We have already collected 15508 Malware Files and 4000 benign files in our Android File Collection phase and decompiled them in the Decompilation phase. Here we are discussing our Feature Mining Phase. So our goal in this paper is to select appropriate features for dataset generation. For the selection of proper features, we have also performed a Static Analysis process using online Malware Scanners. By using our static Analysis process we have selected a total of 215 features. Here we also propose the process of automating the Feature Mining from the APK files. We also have developed and implemented a Feature Mining Script in Python. Using the automated Feature Mining Script we have generated a final dataset of 16300 files. We have also discussed the working flow of feature mining script and in this paper.

References
  1. Prerna Agrawal, Bhushan Trivedi, “Unstructured Data Collection from APK files for Malware Detection”, International Journal of Computer Applications (IJCA), Vol 176, Issue 28, June 2020, pp. 42-45, ISBN 973-93-80901-12-5, ISSN 0975 – 8887, DOI 10.5120/ijca2020920308
  2. Prerna Agrawal, Bhushan Trivedi, "Automating the process of browsing and downloading APK Files as a prerequisite for the Malware Detection process ", International Journal of Emerging Trends & Technology in Computer Science (IJETTCS), Vol 9, Issue 2, March - April 2020, pp. 013-017, ISSN 2278-685.
  3. Prerna Agrawal, Bhushan Trivedi, “Machine Learning Classifiers for Android Malware Detection”, 4th International Conference on Data Management, Analytics and Innovation (ICDMAI) Springer AISC Series, New Delhi, Jan 2020.
  4. Prerna Agrawal, Bhushan Trivedi, “Analysis of Android Malware Scanning Tools”, International Journal of Computer Sciences and Engineering, Vol.7, Issue.3, pp.807-810, Mar 2019.
  5. Prerna Agrawal, Bhushan Trivedi, “A Survey on Android Malware and their Detection Techniques”, Third International Conference on Electrical, Computer and Communication Technologies (ICECCT) IEEE, Feb 2019.
  6. “AVC UnDroid Online Scanner”, Online Link: https://undroid.av-comparatives.org
  7. “AndroTotal: Scan Android Application”, Online Link: http://andrototal.org.
  8. “VirusTotal: Analyse suspicious files”, Online Link: https://www.virustotal.com
  9. “NVISO ApkScan: Scan Android Applications for Malware”, Online Link: https://apkscan.nviso.be/
  10. “VirSCAN.org: Submit and scan your file”, Online Link: http://www.virscan.org
  11. “Hybrid Analysis Online Scanner”, Online Link: https://www.hybrid- analysis.com
  12. “Sandroid: Android Application Analysis System”, Online Link: http://sanddroid.xjtu.edu.cn/#overview
  13. “Machine Learning Datasets”, Online Link: https://figshare.com/articles/Android_malware_dataset_for_machine_learning_1/5854590/1
Index Terms

Computer Science
Information Sciences

Keywords

APK file Malware Dataset Android Machine Learning Feature Mining and Malware Detection