Layered Defect Prediction Model for Software Product Lines Using Feature Fusion and Ensemble Classification
Subject Areas : Journal of Computer & Robotics
Mehdi Habibzadeh-khameneh
1
,
Akbar Nabiollahi Najafabadi
2
,
Reza Tavoli
3
,
Hamid Rastegari
4
1 -
2 -
3 -
4 -
Keywords: Software Product lines, Defect Prediction, Metaheuristic Algorithms, Ensemble Learning, Feature Fusion,
Abstract :
Software defect prediction plays a critical role in ensuring the quality and reliability of modern software systems, particularly within Software Product Lines (SPLs), where numerous feature combinations and collaborative development efforts often result in flawed feature selections and increased risk of defects. This study presents a novel three-layered architecture designed to enhance defect prediction accuracy. The architecture consists of: (1) dataset preprocessing and optimal feature extraction using metaheuristic algorithms, (2) feature fusion to enrich the input representation, and (3) final classification through a stacking-based ensemble learning strategy. The proposed method was rigorously evaluated on benchmark datasets from NASA (CM1, JM1, KC1, PC1) and the Linux Variable Analysis Tools (LVAT) repository (LTS1, LTM2, LTL3, LTV4). It achieved accuracy rates of 98.62%, 97.81%, 98.59%, and 98.71% on the NASA datasets, and 95.1%, 94.2%, 97.3%, and 99.4% on the LVAT datasets, respectively. These results demonstrate that the proposed approach consistently outperforms existing methods across diverse SPL scenarios.
1. Díaz, O., et al., Visualizing the customization endeavor in product-based-evolving software product lines: a case of action design research. Empirical Software Engineering, 2022. 27(3): p. 75.
2. Bhushan, M., S. Goel, and A. Kumar, Improving quality of software product line by analysing inconsistencies in feature models using an ontological rule‐based approach. Expert Systems, 2018. 35(3): p. e12256.
3. Raatikainen, M., J. Tiihonen, and T. Männistö, Software product lines and variability modeling: A tertiary study. Journal of Systems and Software, 2019. 149: p. 485-510.
4. Horcas, J.-M., M. Pinto, and L. Fuentes, A modular metamodel and refactoring rules to achieve software product line interoperability. Journal of Systems and Software, 2023. 197: p. 111579.
5. Henard, C., et al., Combining Multi-Objective Search and Constraint Solving for Configuring Large Software Product Lines. 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering, 2015. 1: p. 517-528.
6. Jain, S. and A. Saha, Improving performance with hybrid feature selection and ensemble machine learning techniques for code smell detection. Science of Computer Programming, 2021. 212: p. 102713.
7. Mehta, S. and K.S. Patnaik, Improved prediction of software defects using ensemble machine learning techniques. Neural Computing and Applications, 2021. 33(16): p. 10551-10562.
8. Maâzoun, J., H. Ben-Abdallah, and N. Bouassida. Clustering techniques for software product line feature identification. in 2022 IEEE/ACS 19th International Conference on Computer Systems and Applications (AICCSA). 2022. IEEE.
9. Batool, I. and T.A. Khan, Software fault prediction using deep learning techniques. Software Quality Journal, 2023. 31(4): p. 1241-1280.
10. Azzeh, M., et al., Software Defect Prediction Using Non-Dominated Sorting Genetic Algorithm and k-Nearest Neighbour Classifier. e-Informatica Software Engineering Journal, 2024. 18(1).
11. Liu, G., et al., A feature selection method based on the Golden Jackal-Grey Wolf Hybrid Optimization Algorithm. Plos one, 2024. 19(1): p. e0295579.
12. Shanbhag, A., et al., Leveraging metaheuristics for feature selection with machine learning classification for malicious packet detection in computer networks. IEEE Access, 2024.
13. Potharlanka, J.L., Feature importance feedback with Deep Q process in ensemble-based metaheuristic feature selection algorithms. Scientific Reports, 2024. 14(1): p. 2923.
14. Nemati, Z., et al., Metaheuristic and Data Mining Algorithms-based Feature Selection Approach for Anomaly Detection. IETE Journal of Research, 2024: p. 1-15.
15. Ali, M., et al., Software defect prediction using an intelligent ensemble-based model. IEEE Access, 2024.
16. Bakro, M., et al., Building a cloud-IDS by hybrid bio-inspired feature selection algorithms along with random forest model. IEEE Access, 2024.
17. Ali, M., et al., Enhancing software defect prediction: a framework with improved feature selection and ensemble machine learning. PeerJ Computer Science, 2024. 10: p. e1860.
18. Singh, M. and J.K. Chhabra, Improved software fault prediction using new code metrics and machine learning algorithms. Journal of Computer Languages, 2024. 78: p. 101253.
19. Tang, Y., et al., A software defect prediction method based on learnable three-line hybrid feature fusion. Expert Systems with Applications, 2024. 239: p. 122409.
20. Siddiqui, T. and M. Mustaqeem, Performance evaluation of software defect prediction with NASA dataset using machine learning techniques. International Journal of Information Technology, 2023. 15(8): p. 4131-4139.
21. Guo, J., et al., SMTIBEA: a hybrid multi-objective optimization algorithm for configuring large constrained software product lines. Software & Systems Modeling, 2019. 18: p. 1447-1466.
22. Afzal, U., T. Mahmood, and S. Usmani, Evolutionary Computing to solve product inconsistencies in Software Product Lines. Science of Computer Programming, 2022. 224: p. 102875.
23. Canaparo, M., E. Ronchieri, and G. Bertaccini, Software defect prediction: A study on software metrics using statistical and machine learning methods. 2022. 020.
24. Czibula, G., Z. Onet-Marian, and I. Czibula, Software defect prediction using relational association rule mining. Information Sciences: an International Journal, 2014. 264: p. 260-278.
25. Singh, L.K., et al., Emperor penguin optimization algorithm-and bacterial foraging optimization algorithm-based novel feature selection approach for glaucoma classification from fundus images. Soft Computing, 2023: p. 1-37.
26. Oueslati, R. and G. Manita. Software Defect Prediction Using Integrated Logistic Regression and Fractional Chaotic Grey Wolf Optimizer. in ENASE. 2024.
27. SaiSindhuTheja, R. and G.K. Shyam, An efficient metaheuristic algorithm based feature selection and recurrent neural network for DoS attack detection in cloud computing environment. Applied Soft Computing, 2021. 100: p. 106997.