Enhancing Cyber Security : A Study of Data Preprocessing Techniques for Cyber Security Datasets

Authors

  • Gauri Dhongade School of Information Technology, MATS University, Raipur, Chattisgarh, India Author
  • Dr. Omprakash Chandrakar School of Information Technology, MATS University, Raipur, Chhattisgarh, India Author
  • Dr. Rajeshree Khande Department of Computer Science and Applications, Dr.Vishwanath Karad MIT World Peace University, Pune, Maharashtra, India Author

DOI:

https://doi.org/10.32628/IJSRST2411427

Keywords:

Cyber Security, Feature Selction, Outliers, Normalization

Abstract

In today's fast-changing digital world, cybersecurity is a critical concern because of heightened frequency and sophistication of cyber threats. As a result, the need for effective data preprocessing techniques has become increasingly essential for processing and analyzing cybersecurity datasets in order to identify and mitigate potential risks. The study begins by outlining the unique characteristics of cybersecurity datasets, including their high dimensionality, imbalanced class distribution, and presence of noise and outliers. Subsequently, it examines a range of preprocessing techniques such as data cleaning, transformation, normalization, and feature selection, highlighting their applicability and effectiveness in the context of cybersecurity. It gives systematic analysis of different preprocessing detection, feature selection, and normalization. (Brightwood & Seraphina Brightwood, 2024) By implementing appropriate data preprocessing techniques, cybersecurity professionals can enhance the accuracy and effectiveness of their predictive models, intrusion detection systems, and other cybersecurity methods such as data cleaning, outlier solutions.

Downloads

Download data is not yet available.

References

Alshaibi, A., Al-Ani, M., Al-Azzawi, A., Konev, A., & Shelupanov, A. (2022). The Comparison of Cybersecurity Datasets. In Data (Vol. 7, Issue 2). MDPI. https://doi.org/10.3390/data7020022

Brightwood, S., & Seraphina Brightwood, A. (2024). Data Preprocessing and Feature Engineering for Cyber Threat Detection. https://www.researchgate.net/publication/379078896

Srivastava, D., Singh, R., Chakraborty, C., Maakar, S. K., Makkar, A., & Sinwar, D. (2024). A framework for detection of cyber attacks by the classification of intrusion detection datasets. Microprocessors and Microsystems, 105. https://doi.org/10.1016/j.micpro.2023.104964

Werner de Vargas, V., Schneider Aranda, J. A., dos Santos Costa, R., da Silva Pereira, P. R., & Victória Barbosa, J. L. (2023). Imbalanced data preprocessing techniques for machine learning: a systematic mapping study. Knowledge and Information Systems, 65(1), 31–57. https://doi.org/10.1007/s10115-022-01772-8

Wei,L.,Fang,Q.,” A Data Preprocessing Algorithim for ClassificationModel Based On Rough Sets”, International Conference on Solid State Devices and Material Science, ELSEVIER, pp. 25-29,2012.

Downloads

Published

07-09-2024

Issue

Section

Research Articles