A Novel Developed Supervised Machine Learning System For Classification And Prediction of Software Faults Using NASA Dataset

Main Article Content

Nikita Gupta
Ripu Ranjan Sinha

Abstract

The software systems of modern computers are extremely complex and versatile. Therefore, it is essential to regularly detect and correct software design faults. In order to devote resources effectively towards the creation of trustworthy software, software companies are increasingly engaging in the practise of predicting fault-prone modules in advance of testing. These software fault prediction methods rely on the thoroughness with which prior software versions' fault as well as related code has been retrievedTime, energy, and money are all saved as a result. Increases the company's initial success and bottom line greatly by satisfying its clientele. Numerous academics have poured into this area throughout the years in an effort to raise the bar for all software. Nowadays, The most often used approaches in this field are those based on machine learning (ML). The field of ML seeks to perfect software capable of evolving as well as adapting in response to fresh data. This paper introduces a fresh approach for doing ML by bringing together a number of different expert systems. In order to reach agreement on which aspects of a software system need to be tested, the proposed multi-classifier model pools the strengths of the most effective classifiers. Several top-performing classifiers for defect prediction are put through their paces in an experiential evaluation. We test our method on 16 publicly available datasets from the NASA Metric Data Programme (MDP) repository at the promise repository. Parameters of confusion, recall, precision, recognition accuracy, etc., are evaluated and contrasted with existing schemes in a software analysis performed with the help of the python simulation tool with findings. The experimental outcomes demonstrate that by combining LGBM, XGBoost, and Voting classifiers, using a multi classifier approach, we are capable to significantly improve software fault prediction performance. The results of the investigation show that the suggested method will lead to better practical outcomes in the prediction of device failures.

Article Details

How to Cite
Gupta, N. ., & Sinha, R. R. . (2023). A Novel Developed Supervised Machine Learning System For Classification And Prediction of Software Faults Using NASA Dataset. International Journal on Recent and Innovation Trends in Computing and Communication, 11(10s), 715–729. https://doi.org/10.17762/ijritcc.v11i10s.7710
Section
Articles

References

J. Xu, D. Ho, and L. F. Capretz, “An Empirical Study on the Procedure to Derive Software Quality Estimation Models,” Int. J. Comput. Sci. Inf. Technol., 2010, doi: 10.5121/ijcsit.2010.2401.

S. Aleem, L. F. Capretz, and F. Ahmed, “Benchmarking Machine Learning Techniques for Software Defect Detection,” Int. J. Softw. Eng. Appl., 2015, doi: 10.5121/ijsea.2015.6302.

M. W. Thant and N. T. T. Aung, “Software Defect Prediction using Hybrid Approach,” in 2019 International Conference on Advanced Information Technologies (ICAIT), 2019, pp. 262–267. doi: 10.1109/AITC.2019.8921374.

A. Alsaeedi and M. Z. Khan, “Software Defect Prediction Using Supervised Machine Learning and Ensemble Techniques: A Comparative Study,” J. Softw. Eng. Appl., 2019, doi: 10.4236/jsea.2019.125007.

M. Caulo and G. Scanniello, “A Taxonomy of Metrics for Software Fault Prediction,” in 2020 46th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), 2020, pp. 429–436. doi: 10.1109/SEAA51224.2020.00075.

C. Catal, “Software fault prediction: A literature review and current trends,” Expert Systems with Applications. 2011. doi: 10.1016/j.eswa.2010.10.024.

T. Hall, S. Beecham, D. Bowes, D. Gray, and S. Counsell, “A systematic literature review on fault prediction performance in software engineering,” IEEE Transactions on Software Engineering. 2012. doi: 10.1109/TSE.2011.103.

K. Tumer and J. Ghosh, “Analysis of decision boundaries in linearly combined neural classifiers,” Pattern Recognit., vol. 29, no. 2, pp. 341–348, 1996, doi: https://doi.org/10.1016/0031-3203(95)00085-2.

P. Singh and S. Verma, “Multi-classifier model for software fault prediction,” Int. Arab J. Inf. Technol., 2018.

M. Z. Mohammed and I. A. Saleh, “Predicted of Software Fault Based on Random Forest and K-Nearest Neighbor,” in 2022 4th International Conference on Advanced Science and Engineering (ICOASE), 2022, pp. 43–48. doi: 10.1109/ICOASE56293.2022.10075596.

T. M. Phuong Ha, D. Hung Tran, L. E. T. My Hanh, and N. Thanh Binh, “Experimental Study on Software Fault Prediction Using Machine Learning Model,” in 2019 11th International Conference on Knowledge and Systems Engineering (KSE), 2019, pp. 1–5. doi: 10.1109/KSE.2019.8919429.

K. Phung, E. Ogunshile, and M. Aydin, “A Novel Software Fault Prediction Approach To Predict Error-type Proneness in the Java Programs Using Stream X-Machine and Machine Learning,” in 2021 9th International Conference in Software Engineering Research and Innovation (CONISOFT), 2021, pp. 168–179. doi: 10.1109/CONISOFT52520.2021.00032.

C. Anjali, J. P. M. Dhas, and J. A. P. Singh, “A Study on Predicting Software Defects with Machine Learning Algorithms,” in 2022 International Conference on Intelligent Innovations in Engineering and Technology (ICIIET), 2022, pp. 195–198. doi: 10.1109/ICIIET55458.2022.9967593.

E. Elahi, S. Kanwal, and A. N. Asif, “A new Ensemble approach for Software Fault Prediction,” in 2020 17th International Bhurban Conference on Applied Sciences and Technology (IBCAST), 2020, pp. 407–412. doi: 10.1109/IBCAST47879.2020.9044596.

A. Joon, R. Kumar Tyagi, and K. Kumar, “Noise Filtering and Imbalance Class Distribution Removal for Optimizing Software Fault Prediction using Best Software Metrics Suite,” in 2020 5th International Conference on Communication and Electronics Systems (ICCES), 2020, pp. 1381–1389. doi: 10.1109/ICCES48766.2020.9137899.

S. R. Aziz, T. Khan, and A. Nadeem, “Experimental validation of inheritance metrics’ impact on software fault prediction,” IEEE Access, 2019, doi: 10.1109/ACCESS.2019.2924040.

K. Yang, H. Yu, G. Fan, X. Yang, S. Zheng, and C. Leng, “Software Defect Prediction Based on Fourier Learning,” 2018. doi: 10.1109/PIC.2018.8706304.

H. He, Y. Bai, E. A. Garcia, and S. Li, “ADASYN: Adaptive synthetic sampling approach for imbalanced learning,” 2008. doi: 10.1109/IJCNN.2008.4633969.

M. S. Zulfiker, N. Kabir, A. A. Biswas, T. Nazneen, and M. S. Uddin, “An in-depth analysis of machine learning approaches to predict depression,” Curr. Res. Behav. Sci., 2021, doi: 10.1016/j.crbeha.2021.100044.

D. Barber, Bayesian Reasoning and Machine Learning. 2012. doi: 10.1017/cbo9780511804779.

S. Geva and J. Sitte, “Adaptive Nearest Neighbor Pattern Classification,” IEEE Trans. Neural Networks, 1991, doi: 10.1109/72.80344.

S. J. Hong, “R-MINI: An iterative approach for generating minimal rules from examples,” IEEE Trans. Knowl. Data Eng., 1997, doi: 10.1109/69.634750.

E. W. T. Ngai, L. Xiu, and D. C. K. Chau, “Application of data mining techniques in customer relationship management: A literature review and classification,” Expert Syst. Appl., vol. 36, no. 2, Part 2, pp. 2592–2602, 2009, doi: https://doi.org/10.1016/j.eswa.2008.02.021.

J. R. Quinlan, “Generating production rules from decision trees,” Proc. Tenth Int. Jt. Conf. Artif. Intell., 1987.

Y. Ren, L. Zhang, and P. N. Suganthan, “Ensemble Classification and Regression-Recent Developments, Applications and Future Directions [Review Article],” IEEE Computational Intelligence Magazine. 2016. doi: 10.1109/MCI.2015.2471235.

A. M. S. Sathya Bama, “Identification of Default Payments of Credit Card Clients using Boosting Techniques,” Int. J. Recent Technol. Eng., 2020, doi: 10.35940/ijrte.f8897.038620.

M. Fatima and M. Pasha, “Survey of Machine Learning Algorithms for Disease Diagnostic,” J. Intell. Learn. Syst. Appl., 2017, doi: 10.4236/jilsa.2017.91001.

A. Gupta, S. Sharma, S. Goyal, and M. Rashid, “Novel XGBoost Tuned Machine Learning Model for Software Bug Prediction,” 2020. doi: 10.1109/ICIEM48762.2020.9160152.

and R. R. Purvika Bajaj, “SALES PREDICTION USING MACHINE LEARNING ALGORITHMS,” Int. Res. J. Eng. Technol., vol. 07, no. 06, pp. 1–7, 2020.

E. K. Ampomah, Z. Qin, and G. Nyame, “Evaluation of tree-based ensemble machine learning models in predicting stock price direction of movement,” Inf., 2020, doi: 10.3390/info11060332.

S. Tong, Haonan; Liu, Bin; Wang, “Benchmark data sets,” Mendeley Data, vol. V1, 2017, doi: 10.17632/923xvkk5mm.1.

T. J. Sayyad Shirabad, J. and Menzies, “The PROMISE Repository of Software Engineering Databases,” University of Ottawa, 2005.

Bhawana Verma, S. K.A. (2019). Design & Analysis of Cost Estimation for New Mobile-COCOMO Tool for Mobile Application. International Journal on Recent and Innovation Trends in Computing and Communication, 7(1), 27–34. https://doi.org/10.17762/ijritcc.v7i1.5222

S. K.A., Raj, A. ., Sharma, V., & Kumar, V. (2022). Simulation and Analysis of Hand Gesture Recognition for Indian Sign Language using CNN. International Journal on Recent and Innovation Trends in Computing and Communication, 10(4), 10–14. https://doi.org/10.17762/ijritcc.v10i4.5556