Enhancing the accuracy of quality-based Web Services classification through dataset feature enhancement
mehdi nozad bonab
1
(
Department of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia, Iran
)
jafar tanha
2
(
Electrical and Computer Engineering Department, University of Tabriz, Tabriz, Iran
)
mohammad masdari
3
(
Department of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia, Iran
)
Keywords: Web services, classification, quality, feature engineering, machine learning,
Abstract :
Web services, which facilitate machine-to-machine communication over the Internet, require accurate classification for reliable and efficient service delivery. The classification significantly affects service discovery, recommendation systems, and service composition. Web service brokers assist users in selecting the most suitable service based on quality parameters. Currently, only a limited number of datasets focusing on web service quality are available. The QWS dataset, with nine quality features for services, is one of the most prominent datasets in this field. However, this dataset overlooks non-functional attributes such as Security, Interoperability, Scalability, and Robustness, which are essential for discovering web services. This paper proposes enhancing the QWS dataset by using feature engineering to create new features from existing ones. The experimental results on the SSL-WSC algorithm demonstrate that the proposed approach significantly improves the classification of web services. This is evidenced by the 5.05% increase in F1-Score, 5.69% increase in accuracy, and 6.92% increase in precision evaluation criteria.
[1] M. M. Amin, A. Sutrisman, D. Stiawan, E. Ermatita, M. Y. Alzahrani, and R. Budiarto, "Interoperability framework for integrated e-health services," Bulletin of Electrical Engineering and Informatics, vol. 9, no. 1, pp. 354-361, 2020.
[2] M. Masdari, M. Nozad Bonab, and S. Ozdemir, "QoS-driven metaheuristic service composition schemes: a comprehensive overview," Artificial Intelligence Review, vol. 54, pp. 3749-3816, 2021.
[3] H. Ye, B. Cao, Z. Peng, T. Chen, Y. Wen, and J. Liu, "Web services classification based on wide & Bi-LSTM model," IEEE Access, vol. 7, pp. 43697-43706, 2019.
[4] M. N. Bonab, J. Tanha, and M. Masdari, "A Semi-supervised Learning Approach to Quality-based Web Service Classification," IEEE Access, 2024.
[5] B. Al-Shargabi, S. Al-Jawarneh, and S. Hayajneh, "A cloudlet based security and trust model for e-government web services," Journal of Theoretical and Applied Information Technology, vol. 98, no. 1, pp. 27-37, 2020.
[6] G. Moritz, F. Golatowski, and D. Timmermann, "A lightweight SOAP over CoAP transport binding for resource constraint networks," in 2011 IEEE eighth international conference on mobile ad-hoc and sensor systems, 2011: IEEE, pp. 861-866.
[7] M. S. Das, A. Govardhan, and D. V. Lakshmi, "Classification of web services using data mining algorithms and improved learning model," TELKOMNIKA (Telecommunication Computing Electronics and Control), vol. 17, no. 6, pp. 3191-3202, 2019.
[8] E. Al-Masri and Q. H. Mahmoud, "Qos-based discovery and ranking of web services," in 2007 16th international conference on computer communications and networks, 2007: IEEE, pp. 529-534.
[9] S. L. Brunton, B. R. Noack, and P. Koumoutsakos, "Machine learning for fluid mechanics," Annual review of fluid mechanics, vol. 52, pp. 477-508, 2020.
[10] M. J. Kaur, V. P. Mishra, and P. Maheshwari, "The convergence of digital twin, IoT, and machine learning: transforming data into action," Digital twin technologies and smart cities, pp. 3-17, 2020.
[11] M. Hasnain, I. Ghani, M. F. Pasha, and S. R. Jeong, "Machine learning methods for trust-based selection of web services," KSII Transactions on Internet and Information Systems (TIIS), vol. 16, no. 1, pp. 38-59, 2022.
[12] J. Tanha, M. Van Someren, and H. Afsarmanesh, "Semi-supervised self-training for decision tree classifiers," International Journal of Machine Learning and Cybernetics, vol. 8, pp. 355-370, 2017.
[13] F. Nargesian, H. Samulowitz, U. Khurana, E. B. Khalil, and D. S. Turaga, "Learning Feature Engineering for Classification," in Ijcai, 2017, vol. 17, pp. 2529-2535.
[14] I. Guyon and A. Elisseeff, "An introduction to feature extraction," in Feature extraction: foundations and applications: Springer, 2006, pp. 1-25.
[15] V. Kumar and S. Minz, "Feature selection," SmartCR, vol. 4, no. 3, pp. 211-229, 2014.
[16] P. Rodriguez-Mier, C. Pedrinaci, M. Lama, and M. Mucientes, "An integrated semantic web service discovery and composition framework," IEEE transactions on services computing, vol. 9, no. 4, pp. 537-550, 2015.
[17] J. M. Kanter and K. Veeramachaneni, "Deep feature synthesis: Towards automating data science endeavors," in 2015 IEEE international conference on data science and advanced analytics (DSAA), 2015: IEEE, pp. 1-10.
[18] N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Dropout: a simple way to prevent neural networks from overfitting," The journal of machine learning research, vol. 15, no. 1, pp. 1929-1958, 2014.
[19] Z. L. Chia, M. Ptaszynski, F. Masui, G. Leliwa, and M. Wroczynski, "Machine Learning and feature engineering-based study into sarcasm and irony classification with application to cyberbullying detection," Information Processing & Management, vol. 58, no. 4, p. 102600, 2021.
[20] I. Sommerville, Engineering software products. Pearson London, 2020.
[21] D. Thomas and A. Hunt, The Pragmatic Programmer: your journey to mastery. Addison-Wesley Professional, 2019.
[22] B. Burns, Designing distributed systems: patterns and paradigms for scalable, reliable services. " O'Reilly Media, Inc.", 2018.
[23] M.-T. Wu, "Confusion matrix and minimum cross-entropy metrics based motion recognition system in the classroom," Scientific Reports, vol. 12, no. 1, p. 3095, 2022.
[24] M. Hasnain, M. F. Pasha, I. Ghani, M. Imran, M. Y. Alzahrani, and R. Budiarto, "Evaluating trust prediction and confusion matrix measures for web services ranking," Ieee Access, vol. 8, pp. 90847-90861, 2020.
[25] M. Grandini, E. Bagli, and G. Visani, "Metrics for multi-class classification: an overview," arXiv preprint arXiv:2008.05756, 2020.
[26] S. Ruuska, W. Hämäläinen, S. Kajava, M. Mughal, P. Matilainen, and J. Mononen, "Evaluation of the confusion matrix method in the validation of an automated system for measuring feeding behaviour of cattle," Behavioural processes, vol. 148, pp. 56-62, 2018.