Gelişmiş Arama

Basit öğe kaydını göster

dc.contributor.authorTahir, Touseef
dc.contributor.authorGencel, Cigdem
dc.contributor.authorRasool, Ghulam
dc.contributor.authorUmer, Tariq
dc.contributor.authorRasheed, Jawad
dc.contributor.authorYeo, Sook Fern
dc.contributor.authorCevik, Taner
dc.date.accessioned2024-01-25T05:09:38Z
dc.date.available2024-01-25T05:09:38Z
dc.date.issued2023en_US
dc.identifier.citationTahir, T., Gencel, C., Rasool, G., Umer, T., Rasheed, J., Yeo, S. F., & Cevik, T. (2023). Early Software Defects Density Prediction: Training the International Software Benchmarking Cross Projects Data Using Supervised Learning. IEEE Access, 11, 141965-141986.en_US
dc.identifier.issn21693536
dc.identifier.urihttps://doi.org/10.1109/ACCESS.2023.3339994
dc.identifier.urihttps://hdl.handle.net/20.500.12294/4034
dc.description.abstractRecent reviews of the literature indicate the need for empirical studies on cross-project defect prediction (CPDP) that would allow aggregation of the evidence and improve predictive performance. Most empirical studies predict defects at granularity levels of method, class, file, and module/package during the coding phase, and thereby avoid external failure costs. The main goal of this study is to perform an empirical study on early defect prediction at the beginning of a project at the product level of granularity for using it as input in planning quality activities of the project. Hence, both internal and external failure costs could be avoided as much as possible through proper planning of quality. We first made a systematic mapping study (SMS) on secondary studies (literature reviews) on defect prediction to identify the most used datasets, the project attributes and metrics utilized as estimators, and the supervised learning methods employed for training the data. Then, we made an empirical study on defect density prediction using cross-project data. We collected 760 project data from the International Software Benchmarking (ISBSG) dataset version 11, which reported both defects and functional size attributes. We trained the prediction models using: i) the complete set of project attributes, ii) the individual attributes, and iii) multiple subsets of attributes. We employed classification and regression approaches of machine learning. The machine learning models are trained using original values of the dataset, and z-score and logged transformations of original values to explore the effects of data normalization on prediction. Most machine learning models trained on the z-score transformation of the dataset performed best for classifying defects. The Multilayer-Perceptron (Neural Network) model trained on the z-score transformation of complete dataset predicted defects with the highest F1-score of 0.89 using binary classification. The logged transformation and feature selection methods improved the results for multivariable regression. The multivariable regression predicted defects with the highest Root Mean Squared Error (RMSE) and R2 (r-squared) values of 0.4 and 0.9, respectively, with a subset of 11 features using logged transformation. The results of classification and regression approaches indicate that defects can be predicted with reasonable accuracy at the software product level using cross-project data. © 2013 IEEE.en_US
dc.language.isoengen_US
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_US
dc.relation.ispartofIEEE Accessen_US
dc.identifier.doi10.1109/ACCESS.2023.3339994en_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectCross Projects Dataseten_US
dc.subjectDefect Predictionen_US
dc.subjectFault Predictionen_US
dc.subjectFeature Selectionen_US
dc.subjectISBSG Dataseten_US
dc.subjectMachine Learningen_US
dc.titleEarly Software Defects Density Prediction: Training the International Software Benchmarking Cross Projects Data Using Supervised Learningen_US
dc.typearticleen_US
dc.departmentMühendislik ve Mimarlık Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.authorid0000-0001-9653-5832en_US
dc.identifier.volume11en_US
dc.identifier.startpage141965en_US
dc.identifier.endpage141986en_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.institutionauthorCevik, Taner
dc.authorwosidAAD-9997-2022en_US
dc.authorscopusid37017221000en_US
dc.identifier.wosqualityQ1en_US
dc.identifier.wosWOS:001131617800001en_US
dc.identifier.scopus2-s2.0-85179821066en_US


Bu öğenin dosyaları:

Thumbnail

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster