Google scholar arxiv informatics ads IJAIS publications are indexed with Google Scholar, NASA ADS, Informatics et. al.

Call for Paper


March Edition 2023

International Journal of Applied Information Systems solicits high quality original research papers for the March 2023 Edition of the journal. The last date of research paper submission is February 15, 2023.

Feature Mining from APK Files for Malware Detection

Prerna Agrawal, Bhushan Trivedi in Security

International Journal of Applied Information Systems
Year of Publication:2020
Publisher: Foundation of Computer Science (FCS), NY, USA
Authors:Prerna Agrawal, Bhushan Trivedi
Download full text
  1. Prerna Agrawal and Bhushan Trivedi. Feature Mining from APK Files for Malware Detection. International Journal of Applied Information Systems 12(32):6-10, August 2020. URL, DOI BibTeX

    	author = "Prerna Agrawal and Bhushan Trivedi",
    	title = "Feature Mining from APK Files for Malware Detection",
    	journal = "International Journal of Applied Information Systems",
    	issue_date = "August 2020",
    	volume = 12,
    	number = 32,
    	month = "August",
    	year = 2020,
    	issn = "2249-0868",
    	pages = "6-10",
    	url = "",
    	doi = "10.5120/ijais2020451874",
    	publisher = "Foundation of Computer Science (FCS), NY, USA",
    	address = "New York, USA"


The practice of using Machine Learning Methods in detecting Malware is growing massively. The prerequisite for implementing Machine Learning methods is the input of the dataset to it. A researcher needs to create a dataset of its own for performing Malware Detection using Machine Learning. Our dataset generation process includes Android File Collection, Decompilation, and Feature Mining Phases. We have already collected 15508 Malware Files and 4000 benign files in our Android File Collection phase and decompiled them in the Decompilation phase. Here we are discussing our Feature Mining Phase. So our goal in this paper is to select appropriate features for dataset generation. For the selection of proper features, we have also performed a Static Analysis process using online Malware Scanners. By using our static Analysis process we have selected a total of 215 features. Here we also propose the process of automating the Feature Mining from the APK files. We also have developed and implemented a Feature Mining Script in Python. Using the automated Feature Mining Script we have generated a final dataset of 16300 files. We have also discussed the working flow of feature mining script and in this paper.


  1. Prerna Agrawal, Bhushan Trivedi, “Unstructured Data Collection from APK files for Malware Detection”, International Journal of Computer Applications (IJCA), Vol 176, Issue 28, June 2020, pp. 42-45, ISBN 973-93-80901-12-5, ISSN 0975 – 8887, DOI 10.5120/ijca2020920308
  2. Prerna Agrawal, Bhushan Trivedi, "Automating the process of browsing and downloading APK Files as a prerequisite for the Malware Detection process ", International Journal of Emerging Trends & Technology in Computer Science (IJETTCS), Vol 9, Issue 2, March - April 2020, pp. 013-017, ISSN 2278-685.
  3. Prerna Agrawal, Bhushan Trivedi, “Machine Learning Classifiers for Android Malware Detection”, 4th International Conference on Data Management, Analytics and Innovation (ICDMAI) Springer AISC Series, New Delhi, Jan 2020.
  4. Prerna Agrawal, Bhushan Trivedi, “Analysis of Android Malware Scanning Tools”, International Journal of Computer Sciences and Engineering, Vol.7, Issue.3, pp.807-810, Mar 2019.
  5. Prerna Agrawal, Bhushan Trivedi, “A Survey on Android Malware and their Detection Techniques”, Third International Conference on Electrical, Computer and Communication Technologies (ICECCT) IEEE, Feb 2019.
  6. “AVC UnDroid Online Scanner”, Online Link:
  7. “AndroTotal: Scan Android Application”, Online Link:
  8. “VirusTotal: Analyse suspicious files”, Online Link:
  9. “NVISO ApkScan: Scan Android Applications for Malware”, Online Link:
  10. “ Submit and scan your file”, Online Link:
  11. “Hybrid Analysis Online Scanner”, Online Link: https://www.hybrid-
  12. “Sandroid: Android Application Analysis System”, Online Link:
  13. “Machine Learning Datasets”, Online Link:


APK file, Malware, Dataset, Android, Machine Learning, Feature Mining, and Malware Detection