A Meta-Stacked Software Bug Prognosticator Classifier

Authors

  • Ajay Kumar Shrivastava  M.Tech Research Scholar, Jharkhand Rai University, Ranchi, Jharkhand, India
  • Dr. Ekbal Rashid  Department of CSE, Aurora’s Technological and Research Institute, Uppal, Hyderabad, Telangana, India

Keywords:

Stacked Regression, Bug Prediction, Cost Estimation, Rayleigh defect density, Software Project Bugs

Abstract

Predicting defects defines the proactive process of classifying the defects that can be found in entire software’s content, within and cross-project codes for producing high quality product with optimized cost. Error prediction in open source software is more crucial due to its inherent complexity and the large repository of contributors. In this paper we present the meta-stacked regression model (MSRM) which improvises the Rayleigh Probabilistic distribution for feature selection estimates. Firstly, a heuristic bug mining approach is adopted to mine the parameters reported by developers and contributors of various Open source projects (Bugzilla, Eclipse, Mozilla) activity logs. In the second part, Stacked Regression is compared to Neural Networks and Linear Support Vector Machine models in terms of the bug prediction performance with Feature importance and Correlation amongst parameters. The results show that the ensemble based Stacked regression has better precision and F-measure compared to simple machine learning models. The MSRM model accurately predicts and classifies bugs with accuracy of 96.8% and reduces the impact of false positives by recall of 71.2%.

References

  1. X. Huo, M. Li, and Z.-H. Zhou, "Learning unified features from natural and programming languages for locating buggy source code," in Proceedings of IJCAI'2016
  2. S. Wang, T. Liu, and L. Tan, "Automatically learning semantic features for defect prediction," in ICSE'16: Proc. of the International Conference on Software Engineering, 2016
  3. V. Raychev, M. Vechev, and E. Yahav, "Code completion with statistical language models," in ACM SIGPLAN Notices, vol. 49, no. 6. ACM, 2014, pp. 419-428.
  4. Jian Li, Pinjia He, Jieming Zhu, and Michael R. Lyu.2017. "Software Defect Prediction via Convolutional Neural Network" at IEEE International Conference on Software Quality, Reliability and Security, 2017
  5. Z. He, F. Peters, T. Menzies, and Y. Yang, "Learning from open-source projects: An empirical study on defect prediction," in ESEM'13: Proc. of the International Symposium on Empirical Software Engineering and Measurement, 2013.
  6. N. A. Abou-Elheggag, Estimation for Rayleigh distribution using progressive first-failure censored data, Journal of Statistics Applications and Probability 2(2) (2013) 171-182.
  7. J. Wang, B. Shen, and Y. Chen, "Compressed c4. 5 models for software defect prediction," in QSIC'12: Proc. of the International Conference on Quality Software, 2012.
  8. T. Gyimothy, R. Ferenc, I. Siket, "Empirical Validation of Object-Oriented Metrics on Open Source Software for Fault Prediction", IEEE Transactions on Software Engineering, vol. 31, no.10, pp. 897-910, 2005.
  9. S.W. Haider, J.W. Cangussu, K.M.L. Cooper, R. Dantu, "Estimation of Defects Based on Defect Decay Model: ED3M", IEEE Transactions on Software Engineering, vol. 34, no. 3, pp. 336-356, 2008.
  10. R. M. El-Sagheer, Inferences using type-II progressively censored data with binomial removals, Arabian Journal of Mathematics 4 (1) (2015) 127-139.
  11. K. Herzig, S. Just, and A. Zeller. It's not a bug, it's a feature: how misclassification impacts bug prediction. In ICSE'13, pages 392-401.

Downloads

Published

2018-04-30

Issue

Section

Research Articles

How to Cite

[1]
Ajay Kumar Shrivastava, Dr. Ekbal Rashid, " A Meta-Stacked Software Bug Prognosticator Classifier, International Journal of Scientific Research in Science and Technology(IJSRST), Online ISSN : 2395-602X, Print ISSN : 2395-6011, Volume 4, Issue 5, pp.111-117, March-April-2018.