CFP last date
15 December 2023
Call for Paper
January Edition
IJAIS solicits high quality original research papers for the upcoming January edition of the journal. The last date of research paper submission is 15 December 2023

Reseach Article

Data Mining Techniques for Predicting Immunize-able Diseases: Nigeria as a Case Study

by Adebayo Peter Idowu, Bernard Ijesunor Akhigbe, Olajide Olusegun Adeosun, Aderonke Anthonia Kayode, Adekemi Faidat Osungbade
International Journal of Applied Information Systems
Foundation of Computer Science (FCS), NY, USA
Volume 5 - Number 7
Year of Publication: 2013
Authors: Adebayo Peter Idowu, Bernard Ijesunor Akhigbe, Olajide Olusegun Adeosun, Aderonke Anthonia Kayode, Adekemi Faidat Osungbade

Adebayo Peter Idowu, Bernard Ijesunor Akhigbe, Olajide Olusegun Adeosun, Aderonke Anthonia Kayode, Adekemi Faidat Osungbade . Data Mining Techniques for Predicting Immunize-able Diseases: Nigeria as a Case Study. International Journal of Applied Information Systems. 5, 7 ( May 2013), 5-15. DOI=10.5120/ijais12-450882

@article{ 10.5120/ijais12-450882,
author = { Adebayo Peter Idowu, Bernard Ijesunor Akhigbe, Olajide Olusegun Adeosun, Aderonke Anthonia Kayode, Adekemi Faidat Osungbade },
title = { Data Mining Techniques for Predicting Immunize-able Diseases: Nigeria as a Case Study },
journal = { International Journal of Applied Information Systems },
issue_date = { May 2013 },
volume = { 5 },
number = { 7 },
month = { May },
year = { 2013 },
issn = { 2249-0868 },
pages = { 5-15 },
numpages = {9},
url = { },
doi = { 10.5120/ijais12-450882 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
%0 Journal Article
%1 2023-07-05T17:58:45.579874+05:30
%A Adebayo Peter Idowu
%A Bernard Ijesunor Akhigbe
%A Olajide Olusegun Adeosun
%A Aderonke Anthonia Kayode
%A Adekemi Faidat Osungbade
%T Data Mining Techniques for Predicting Immunize-able Diseases: Nigeria as a Case Study
%J International Journal of Applied Information Systems
%@ 2249-0868
%V 5
%N 7
%P 5-15
%D 2013
%I Foundation of Computer Science (FCS), NY, USA

Disease rates vary between different locations particularly in the rural areas. While a database of diseases occurrence could be easily found, studies have been limited to descriptive statistical analysis, and are mostly restricted to diseases affecting adults. This paper therefore presents a Mathematical Model (MM) for predicting immunize-able diseases that affect children between ages 0 - 5 years. The model was adapted and deployed for use in six (6) selected localized areas within Osun State in Nigeria. Using the MATLAB's ANN toolbox, the Statistics toolbox for classification and regression, and the Naïve Bayesian classifier the MM was developed. The MM is robust in that it takes advantage of three (3) data mining techniques: ANN, Decision Tree Algorithm and Naïve Bayes Classifier. These data mining techniques provided the means by which hidden information were discovered for detecting trends within databases, and thus facilitate the prediction of future disease occurrence in the tested locations. Results obtained showed that diseases have peak periods depending on their epidemicity, hence the need to adequately administer immunization to the right places at the right time. Therefore, this paper argues that using this model would enhance the effectiveness of routine immunization in Nigeria.

  1. Kumar, P. , and Clark, M. L. (2007). Kumar and Clark's clinical medicine, 7th Edition, Saunders Ltd, England
  2. McWhinney, I. R. (1987). Health and disease: problems of definition. CMAJ: Canadian Medical Association Journ -al, 136(8), 815.
  3. Stedmans Medical Dictionary, (2000) 28th Ed,Wolters Kluwer Health Company.
  4. Laxminarayan R. , Mills A. J. , and Breman J. G. , (2006). Advancement of global health: key messages from the Disease Control Priorities Project. Lancet, 367:1193-208. 2006.
  5. World Health Organization (2004a), "The global burden of disease": Public health programme, American journal of public Health 90,707-710.
  6. World Health Organization (2007), ''Annual report and statistics on immunization and diseases'' (http://www. who. int/ countries/nga/immunization/en/). Accessed May 13, 2010.
  7. World Health Organization (2004b), "Deaths by cause, sex and mortality stratum in WHO regions, estimates for 2002". http://www. who. int/research/en/). Accessed April20,2010.
  8. Fayyad, U. M. , Piatetsky-Shapiro, G. , Smyth, P. , and Uthurusamy, R. (1996). Advances in Knowledge Discovery and Data Mining, 4th Ed. Cambridge, MA: MIT Press.
  9. Thuraisingham, B. (2003). Web Data Mining Technologies and Their Applications in Business Intelligence and Counter-terrorism. CRC Press.
  10. Dunham, M. H. (2003). Classification. Data mining Introductory and Advanced Topics. Upper Saddle River, New Jersey, Prentice Hall: 93.
  11. Breiman, L. , Friedman, J. , Olsen, R. , & Stone, C. , (1984). Classification and Regression trees, Pacific Grove: Wadsworth.
  12. Smith, M. (1993). "Neural Networks for Statistical Modeling". Van Nostrand Reinhold Press.
  13. CRISP-DM. (2000). "CRoss Inductry Standard Process for Data Mining. " Retrieved 1 Mar, 2009, from http://www. crisp-dm. org/Process/index. html
  14. Giudici, P. (2003) Applied Data Mining: Statistical Methods for Business and Industry, 2nd Ed. New York: John Wiley.
  15. Kotsiantis, S. , and Pintelas, P. , (2005). Logitboost of Simple Bayesian Classifier, Computational Intelligence in Data mining". Special Issue of the Informatica Journal, Vol. 29 (1), 2005: 53–59.
  16. Verduijn, M. , Sacchi, L. , Peek, N. , Bellazzi, R. , de Jonge, E. , & de Mol, B. A. (2007). Temporal abstraction for feature extraction: A comparative case study in prediction from intensive care monitoring data. Artificial Intelligence in Medicine, 41(1), 1-12.
  17. Silva, Á. , Cortez, P. et al. (2008). "Rating organ failure via adverse events using data mining in the intensive care unit. " Artificial Intelligence in Medicine 43(3): 179-193.
  18. Bellazzi, R. , Larizza, C. , Magni, P. , Bellazzi, R. (2005). Temporal data mining for the quality assessment of hemodialysis services, Artificial Intelligence in Medicine 34, pgs 25–39.
  19. Adlassnig, K. P. , Combi, C. , Das, A. K. , Keravnou, E. T. , Pozzi, G. (2006). Temporal representation and reasoning in medicine: research directions and challenges, Artificial Intelligence in Medicine, 38, pgs 101–113.
  20. Stacey, M. , and McGregor, C. (2007). Temporal abstraction in intelligent clinical data analysis: A survey. Artificial Intelligence in Medicine, 39 1–24.
  21. Bellazzi, R. and B. Zupan (2008). "Predictive data mining in clinical medicine: Current issues and guidelines. " International Journal of Medical Informatics 77(2): 81-97.
  22. Pitt, E. (2009). Application of Data Mining Techniques in the Prediction of Coronary Artery Disease: Use of Anaesthesia Time-series and Patient Risk Factor Data. Unpublished Master's Thesis, Submitted to the School of Information Technology, School of Information Systems, Faculty of Science and Technology, Queensland University of Technology.
  23. Yeh, J. -Y. , Wu, T. -H. , and Tsao, C. -W. (2011). Using data mining techniques to predict hospitalization of hemodialysis patients. Decision Support Systems 50, 439–448 (Elsevier)
  24. Sundar, N. A. , Latha, P. , and Chandra, M. R. (2012). Performance Analysis of Classification Data Mining Techniques Over Heart Disease Data base. International journal of Engineering Science & Advanced Technology, Vol. 2, No. 3, pgs 470 – 478.
  25. Rajkumar, R. , Shim, K. J. , & Srivastava, J. (2010). Data Mining Based Predictive Models for Overall Health Indices. Technical Report submitted to the Department of Computer Science and Engineering, University of Minnesota, Minneapolis, USA.
  26. Rao, C. , Adair, T. , and Kinfu, Y. (2011). Using Historical Vital Statistics to Predict the Distribution of Under-Five Mortality by Cause Clinical Medicine & Research, Vol. 9, No. 2, pgs 66-74
  27. Delen, D. , Walker, G. , and Kadam, A. (2005). Predict breast cancer survivability: a comparison of three data mining methods, Artificial Intelligence in Medicine, Vol. 34 (2), pages 113-127.
  28. Cruz, J. A. , and Wishart, D. S. (2006). Applications of machine learning in cancer prediction and prognosis, Cancer Informatics, 2006(2):59–78
  29. Burke, H. , Rosen, D. , and Goodman, P. (1995), Comparing the Prediction Accuracy of Artificial Neural Networks and Other Statistical Models for Breast Cancer Survival, Advances in Neural Information Processing Systems, Vol. 7, pp. 1063--1067.
  30. Romeo, M. , F. Burden, M. Quinn, B. Wood and D. McNaughton. (1998),"Infrared Microspectroscopy And Artificial Neural Networks In The Diagnosis Of Cervical Cancer. ". U. S. National Library of Medicine National Institutes of Health ,Vol. 44(1),pp179-87.
  31. Ifeachor, E. , Sperduti, A. ,and Starita, A. , "Making the Distinction between Crohn's Disease and Ulcerative Colitis by Histopathological Examination: A Comparison of Human Performance, Logistic Regression and Adaptive Resonance Theory Mapping Neural Networks (ARTMAP)", In 3rd International Conference on Neural Networks and Expert Systems in Medicine and Healthcare, pp. 116--124.
  32. Hemalatha, M. , and Megala, S. (2011). Mining Techniques in Health Care: A Survey of Immunization. Journal of Theoretical and Applied Information Technology, Vol. 25, No2, pgs 63-70.
  33. Assamnew, S. (2011). Predicting the Outbreak of Measles Occurrence in Ethiopia using Data Mining Technology. Unpublished M. Sc Thesis submitted to School of Public Health and School of Information Science, School of Graduate Studies, Addis Ababa University, Ethiopia.
  34. Viaene, S. , Derrig, R. & Dedene, G. (2004). A Case Study of Applying Boosting Naive Bayes to Claim Fraud Diagnosis. IEEE Transactions on Knowledge and Data Engineering, 16(5):12-620.
  35. Haykin, S. (1994). Neural Networks: A Comprehensive Foundation, 2nd Ed. New York: Macmillan Publishing.
  36. Bishop, C. M. (1995). Neural Networks for Pattern Recognition, Oxford: Oxford University Press.
  37. Han, Jiawei and Micheline Kamber (2007). Data Mining: Concepts and Techniques, 5th Ed. San Francisco, CA: Morgan Kaufmann publishers.
  38. Clarence, N. W. T (2002). 'An Artificial Neural networks primer with Financial applications. Examples in financial Distress Predictions and Foreign Exchange Hybrid Trading System', Bond University, Gold coast.
  39. Kantardzic, M. (2011). Data Mining: Concepts, Models, Methods, and Algorithms. Wiley-IEEE Press.
  40. Witten, I. H. , Frank, E. , and Hall, M. A. (2011). Data Mining: Practical Machine Learning Tools and Techniques: Practical Machine Learning Tools and Techniques. Morgan Kaufmann.
  41. Baradwaj, B. K. , and Pal, S. (2011). Mining Educational Data to Analyze Students? Performance. International Journal of Advanced Computer Science and Applications, Vol. 2, No. 6, pgs 63-69.
  42. Quinlan, J. R. (1986). "Introduction of decision tree", Machine learn, 1: pp. 86-106.
  43. Olsen, W. (2004). Triangulation in Social Research: Qualitative and Quantitative Methods Can Really Be Mixed. Developments in Sociology, ed. M. Holborn, Ormskirk: Causeway Press.
  44. Sikander, M. , and Mirza (2003) Introduction to Matlab: Resource book for students,1st Ed. , Pieas Publisher.
Index Terms

Computer Science
Information Sciences


Data mining techniques Immunize-able diseases MATLAB Databases Decision tree algorithm and Predictive model