Heart Disease Prediction Based on an Optimal Feature Selection Method using Autoencoder
DOI:
https://doi.org/10.32628/IJSRST20748Keywords:
Data Mining, Autoencoder, Hybrid, Classification Model, Dynamic Integration AlgorithmAbstract
Heart Failure is one of the common diseases that can lead to dangerous situations. There are several data available within the healthcare systems. However, there was an absence of successful analysis methods to find connections and patterns in health care data. Some Machine learning methods can help us remedy this circumstance. This helps in getting a better insight into the concept of a classification problem. In many classification problems, it is difficult to learn good classifiers before removing these unwanted features due to the huge size of the data. In my work, we have used an artificial neural network-based autoencoder for effective feature selection The aim of feature selection is improving prediction performance and providing a better understanding of the process data. Hybrid Classification method with a dynamic integration algorithm for classification that aims at finding optimal features by applying machine learning techniques resulting in improving the performance in the prediction of cardiovascular disease.
References
- Jabbar MA, Chandra P, Deekshatulu BL. Cluster-based association rule mining for heart attack prediction. Journal of Theoretical and Applied Information Technology. 2011; 32(2):197–201.
- Sudha A, Gayathiri P, Jaisankar N. Effective analysis and predictive model of stroke disease using classification methods. International Journal of Computer Applications. 2012; 43(14):26–31.
- Amin SU, Agarwal K, Beg R. Genetic neural network-based data mining in the prediction of heart disease using risk factor. Proceeding of IEEE Conference on Information and Communication Technologies (ICT); 2013 Apr. p. 1227– 31.
- Deepika N, Chandrashekar K. Association rule for classification of Heart Attack Patients. International Journal of Advanced Engineering Science and Technologies. 2011; 11(2):253–57.
- Sellappan Palaniappan and Rafiah Awang (2008): Intelligent Heart Disease Prediction System Using Data Mining Techniques, 978-1-4244- 1968- 5/08/ IEEE.
- Enhanced Prediction of Heart Disease with Feature Subset Selection using Genetic Algorithm M. ANBARASI, E. ANUPRIYA,N.CH.S.N.IYENGAR.-2012
- Feature Selection using Artificial Bee Colony for Cardiovascular Disease Classification B.Subanya, Dr.R.R.Rajalaxmi-2014
- “Analysis of data mining techniques for heart disease prediction,” 2016-2017, M. Sultana, A. Haider, and M. S. Uddin
- D. K. Srivastava and L. Bhambhu, “Data classification using support vector machine,” J.Theor. Appl. Inf. Technol., 2009
- N. Bhatia and C. Author, “Survey of Nearest Neighbor Techniques,” IJCSIS) Int. J. Comput. Sci. Inf. Secur., vol. 8, no. 2, pp. 302–305, 2010.
- T. M. Lakshmi, A. Martin, R. M. Begum, and V. P.Venkatesan, “An Analysis on Performance of Decision Tree Algorithms using Student’s Qualitative Data,” Int. J. Mod. Educ. Comput. Sci., vol. 5, no. 5, pp. 18–27, 2013.
- Accelerated PSO Swarm Search Feature Selection for Data Stream Mining Big Data Simon Fong, Raymond Wong, and Athanasios V. Vasilakos, Senior Member, IEEE
- T.Peter and K.Sonausundaram “An empirical study on prediction of heart disease using classification data minings techniques” in IEEE International Conference and Management-2012
- A. Khemphila and V.Boonjing “Comparing Performance of logistic regression decision trees and neural networks for classifying heart disease patients “in International Conference on Computer Information systems and Industrial Management Applications”
- Data Mining Techniques on Risk Prediction: Heart Disease, G. Purusothaman* and P. Krishnakumari, Indian Journal of Science and Technology-2015
- A Fuzzy Rule-based Approach to Predict Risk Level of Heart Disease, By Kantesh Kumar Oad & Xu Delhi-2014
- The Best Two Independent Measurements Are Not the Two Best”-1974, THOMAS. M. COVER
- Enhanced Prediction of Heart Disease by Genetic Algorithm and RBF Network-2015, A. Durga Devi
- E. J. Benjamin, P. Muntner, and et al. Alonso, Alvaro, ―Heart Disease and Stroke Statistics—2019 Update: A Report From the American Heart Association,‖ Circulation, vol. 139, no. 10, 2019.
- M. Ramaraj and T. A. Selvadoss, ―A Comparative Study of CN2 Rule and SVM Algorithm and Prediction of Heart Disease Datasets Using Clustering Algorithms,‖ Netw. Complex Syst., vol. 3, no. 10, pp. 1–6, 2013.
- A. Gavhane, G. Kokkula, I. Pandya, and P. K. Devadkar, ―Prediction of Heart Disease Using Machine Learning,‖ in Proceedings of the 2nd International Conference on Electronics, Communication and Aerospace Technology, ICECA 2018, 2018, pp. 1275–1278.
- C.-S. Lee and M.-H. Wang, ―A fuzzy expert system for diabetes decision support application.,‖ IEEE Trans. Syst. MAN, Cybern. B Cybern., vol. 41, no. 1, pp. 139–153, 2011.
- C. B. Rjeily, G. Badr, E. Hassani, A. H., and E. Andres, ―Medical Data Mining for Heart Diseases and the Future of Sequential Mining in Medical Field,‖ in Machine Learning Paradigms, 2019, pp. 71–99.
- K. Shameer, K. W. Johnson, B. S. Glicksberg, J. T. Dudley, and P. P. Sengupta, ―Machine learning in cardiovascular medicine: are we there yet? ‖ Heart, vol. 104, no. 14, pp. 1156–1164, 2018.
- A. M. Shahiri and W. Husain, "A review on predicting student's performance using data mining techniques," Procedia Computer Science, vol. 72, pp. 414-422, 2015.
- C. Romero and S. Ventura, "Educational data mining: a review of the state of the art," IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 40, pp. 601-618, 2010.
- M. Ramaswami and R. Bhaskaran, "A study on feature selection techniques in educational data mining," arXiv preprint ar X iv:0912.3924, 2009.
- A. Mueen, B. Zafar, and U. Manzoor, "Modeling and Predicting Students' Academic Performance Using Data Mining Techniques," International Journal of Modern Education and Computer Science, vol. 8, p. 36, 2016.
- W. Punlumjeak and N. Rachburee, "A comparative study of feature selection techniques for classify student performance," in Information Technology and Electrical Engineering (ICITEE), 2015 7th International Conference on, 2015, pp. 425-429.
- D. Koller and M. Sahami, "Toward optimal feature selection," Stanford InfoLab1996.
- P. Mitra, C. Murthy, and S. K. Pal, "Unsupervised feature selection using feature similarity," IEEE transactions on pattern analysis and machine intelligence, vol. 24, pp. 301-312, 2002.
- H. M. Harb and M. A. Moustafa, "Selecting an optimal subset of features for student performance model," Int J Comput Sci, p. 5,2012.
- A. Figueira, "Predicting Grades by Principal Component Analysis: A Data Mining Approach to Learning Analytics," in Advanced Learning Technologies (ICALT), 2016 IEEE 16th International Conference on, 2016, pp. 465-467.
- E. A. Amrieh, T. Hamtini, and I. Aljarah, "Mining Educational Data to Predict Student’s academic Performance using Ensemble Methods," International Journal of Database Theory and Application, vol. 9, pp. 119-136, 2016.
- K. Patel, J. Vala, and J. Pandya, "Comparison of various classification algorithms on iris datasets using WEKA," Int. J.Adv. Eng. Res. Dev. (IJAERD), vol. 1, 2014.
- M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, "The WEKA data mining software: an update,"ACM SIGKDD explorations newsletter, vol. 11, pp. 10-18,2009.
Webliography
- https://en.wikipedia.org/wiki/Feature_selection#/media/File: created on 7 march 2019 and accessed on 13 September 2019
- https://web.stanford.edu › class › handouts › CS276: Information Retrieval and Web Search Christopher Manning and Pandu Nayak accessed on 26 September 2019
- https://images.app.goo.gl/7SGi2Y6sHwwiFGNa9 created on 13 march 2029 and accessed on 10 April 2020
Downloads
Published
Issue
Section
License
Copyright (c) IJSRST

This work is licensed under a Creative Commons Attribution 4.0 International License.