Enhancing Cyber Security : A Study of Data Preprocessing Techniques for Cyber Security Datasets
DOI:
https://doi.org/10.32628/IJSRST2411427Keywords:
Cyber Security, Feature Selction, Outliers, NormalizationAbstract
In today's fast-changing digital world, cybersecurity is a critical concern because of heightened frequency and sophistication of cyber threats. As a result, the need for effective data preprocessing techniques has become increasingly essential for processing and analyzing cybersecurity datasets in order to identify and mitigate potential risks. The study begins by outlining the unique characteristics of cybersecurity datasets, including their high dimensionality, imbalanced class distribution, and presence of noise and outliers. Subsequently, it examines a range of preprocessing techniques such as data cleaning, transformation, normalization, and feature selection, highlighting their applicability and effectiveness in the context of cybersecurity. It gives systematic analysis of different preprocessing detection, feature selection, and normalization. (Brightwood & Seraphina Brightwood, 2024) By implementing appropriate data preprocessing techniques, cybersecurity professionals can enhance the accuracy and effectiveness of their predictive models, intrusion detection systems, and other cybersecurity methods such as data cleaning, outlier solutions.
Downloads
References
Alshaibi, A., Al-Ani, M., Al-Azzawi, A., Konev, A., & Shelupanov, A. (2022). The Comparison of Cybersecurity Datasets. In Data (Vol. 7, Issue 2). MDPI. https://doi.org/10.3390/data7020022
Brightwood, S., & Seraphina Brightwood, A. (2024). Data Preprocessing and Feature Engineering for Cyber Threat Detection. https://www.researchgate.net/publication/379078896
Srivastava, D., Singh, R., Chakraborty, C., Maakar, S. K., Makkar, A., & Sinwar, D. (2024). A framework for detection of cyber attacks by the classification of intrusion detection datasets. Microprocessors and Microsystems, 105. https://doi.org/10.1016/j.micpro.2023.104964
Werner de Vargas, V., Schneider Aranda, J. A., dos Santos Costa, R., da Silva Pereira, P. R., & Victória Barbosa, J. L. (2023). Imbalanced data preprocessing techniques for machine learning: a systematic mapping study. Knowledge and Information Systems, 65(1), 31–57. https://doi.org/10.1007/s10115-022-01772-8
Wei,L.,Fang,Q.,” A Data Preprocessing Algorithim for ClassificationModel Based On Rough Sets”, International Conference on Solid State Devices and Material Science, ELSEVIER, pp. 25-29,2012.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 International Journal of Scientific Research in Science and Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.