Six Fuzzy Morphology Methods for Roles of ‎Combines Standard Sentences Persian language

Rezaei, Haniye; Motameni, Homayun; Barzegar, Behnam

doi:https://doi.org/10.71503/JADSC.2025.1190869

Manuscript ID : 140308281190869 Visit : 108 Page: 44 - 60

https://doi.org/10.71503/JADSC.2025.1190869

Article Type: Original Research

Six Fuzzy Morphology Methods for Roles of ‎Combines Standard Sentences Persian language

Subject Areas : Computer Engineering

Haniye Rezaei ¹ , Homayun Motameni ² , Behnam Barzegar ³

1 -
2 -
3 -

Received: 2024-11-18 Accepted : 2025-02-02 Published : 2025-04-07

Keywords: Fuzzy System, Bi-gram, difuzzifire, independent roles, dependent roles‎,

Abstract :

Ability to remove ambiguities is one of the main characteristics of the fuzzy systems in resolving the problems like NLP and morphology. On the other hand, most of the conducted studies in the field of morphology of Farsi language are dealing with analysis of words by means of HMM statistical method. Therefore, this paper has conducted the statistical morphology in the role of words in a sentence in the two sets of independent roles like: “Subject, adverb, possess, possessive, subject, subject header, appositive, conjunct, conjunctive, exclamation and proclaimed” using Fuzzy system. Also, in this Fuzzy system, the Max (Product) Fuzzifier by Bi-gram labeling was used. In addition, regarding the importance of defuzzifier of step one, in all fuzzy systems, for the first time, 6 types of defuzzifiers of ‘Maximum membership, center of gravity, weighted average, mean of maximum, smallest of maximum, largest of maximum, center average’ were implemented and obtained results have shown that the Center of gravity defuzzifier method with the mean of 63.698% had better results in comparison with other defuzzifer methods.

References:

‎[1] ‎ H. Motameni and A. Peykar, "Morphology of ‎Compounds as Standard Words in Persian through ‎Hidden Markov Model and Fuzzy Method," Journal ‎of Intelligent & Fuzzy Systems, vol. 30, no. 3, pp. ‎‎1567-1580, 2016. ‎
‎[2] ‎ T. Chadza, K. G. Kyriakopoulos, S. Lambotharan, ‎Analysis of hidden markov model learning algo- ‎rithms for the detection and prediction of multi-stage ‎network attacks, Future generation computer systems ‎‎108 (2020) 636–649.‎
‎[3] ‎ J. Wettig, S. Hiltunen and R. Yangarber, "Hidden ‎Markov Models for induction of morphological ‎structure of natural language," Department of ‎Computer Science,University of Helsinki, Finland, ‎Helsinki, 2010.‎
‎[4] ‎ S. Naderi Parizi, "Implementation of hidden Markov ‎models associated with the ability to apply the ‎language, grammar, search methods and the ‎applicability of the model," Amirkabir University of ‎Technology, Department of Computer Engineering ‎and Information Technology, Tehran, 2007.‎
‎[5] ‎ C. C. Aggarwal, Data Mining, Switzerland: Springer ‎International, 2015. ‎
‎[6] ‎ M. Mohseni and B. Minaei-bidgoli, "A Persian Part-Of-‎Speech Tagger Based on Morphological Analysis," in ‎Proceedings of the Seventh International Conference ‎on Language Resources and Evaluation (LREC'10), ‎Valletta, Malta, 2010. ‎
‎[7] ‎ W. Khan, A. Daud, K. Khan, J. A. Nasir, M. Basheri, N. ‎Aljohani, F. S. Alotaibi, Part of speech tagging in urdu: ‎Comparison of machine and deep learning ‎approaches, IEEE Access 7 (2019) 38918–38936.‎
‎[8] ‎ P. Koehn, Statistical Machine Translation, New York: ‎United States of America by Cambridge University ‎Press, 2010, pp. 181-212.‎
‎[9] ‎ M. Shrivastava, "Hindi POS Tagger Using Naive ‎Stemming : Harnessing Morphological Information ‎Without Extensive Linguistic Knowledge," in ICON-‎‎2008:6th International Conference on Natural ‎Language Processing,Macmillan Publishers., India, ‎‎2008. ‎
‎[10] ‎ M. Bahrani, H. Sameti, N. Hafezi and S. Momtazi, ‎‎"A New Word Clustering Method for Building N-Gram ‎Language Models in Continuous Speech Recognition ‎Systems," in New Frontiers in Applied Artificial ‎Intelligence, 21st International Conference on ‎Industrial, Engineering and Other Applications of ‎Applied Intelligent Systems, IEA/AIE, Wroclaw, ‎Poland, 2008. ‎
‎[11] ‎ A. F. Alajmi, E. M. Saad and M. H. Awadalla, ‎‎"Hidden markov model based Arabic morphological ‎analyzer," International Journal of Computer ‎Engineering Research, vol. 2, no. 2, pp. 28-33, 2011. ‎
‎[12] D. Dutta, S. Halder, T. Gayen, Intelligent part of ‎speech tagger for hindi, Procedia Computer Science ‎‎218 (2023) 604–611‎
‎[13] T.-L. Tseng, F. Jiang, Y. Kwon, Hybrid type ii fuzzy ‎system & data mining approach for surface finish, ‎Journal of Computational Design and Engineering 2 (3) ‎‎(2015) 137–147.‎
‎[14] A. Chiche, B. Yitagesu, Part of speech tagging: a ‎systematic review of deep learning and machine ‎learning approaches, Journal of Big Data 9 (1) (2022) ‎‎1–25.‎
‎[15] ‎ D. Modi and N. Nain, "Part-of-Speech Tagging of ‎Hindi Corpus Using Rule-Based Method," Proceedings ‎of the International Conference on Recent ‎Cognizance in Wireless Communication & Image ‎Processing, Vols. 10.1007/978-81-322-2638-3, no. 28, ‎pp. 241-247, 2016. ‎
‎[16] ‎ J. S. Rohl, "A note on Backus Naur form," ‎Department of Computer Science, The University, ‎Manchester 13, 2010.‎
‎[17] ‎ S. Poria, E. Cambria, G. Winterstein and G.-B. ‎Huang, "Sentic patterns: Dependency based rules for ‎concept-level sentiment analysis.," Knowledge-Based ‎Systems., 2014.‎
‎[18] ‎ M. R. Costa-Juss`a, M. Farr´us, J. B. Mari˜no and J. ‎A. Fonollosa, "Study and comparison of rule-based ‎and statistical catalan-spanish machine translation ‎systems," Computing and Informatics, vol. 31, pp. ‎‎245-270, 2012. ‎
‎[19] A. Alnaied, M. Elbendak, A. Bulbul, An intelligent use ‎of stemmer and morphology analysis for arabic ‎information retrieval, Egyptian Informatics Journal 21 ‎‎(4) (2020) 209–217.‎
‎[20] ‎ K. Darwish, "Building a shallow morphological ‎analyzer in one day.," in ACL-02 Workshop on ‎Computational Approaches to Semitic Languages, ‎Philadelphia, PA, 2002. ‎
‎[21] ‎ K. Taghva, R. Elkhoury and J. S Coombs, "Arabic ‎stemming without a root dictionary.," in Information ‎Technology: Coding and Computing.ITCC 2005, ‎‎2005. ‎
‎[22] ‎ A. Mohamad, A.-S. Riyad and K. Ghass, "Building ‎an Effective Rule-Based Light Stemmer for Arabic ‎Language to Improve Search Effectiveness.," ‎International Arab Journal of Information Technology ‎‎(IAJIT), vol. 9, no. 4, pp. 368-372, 2012. ‎
‎[23] ‎ T. Buckwalter, "Buckwalter Arabic Morphological ‎Analyzer.," the Linguistic Data Consortium,, ‎Pennsylvania, 2002.‎
‎[24] ‎ H. K. Al Ameed, S. O. Al Ketbi, A. A. Al Kaabi, K. S. ‎Al Shebli, N. F. Al Shamsi, N. H. Al Nuaimi and S. S. Al ‎Muhairi, "Arabic light stemmer: anew enhanced ‎approach," in The Second International Conference ‎on Innovations in Information Technology (IIT’05), ‎‎2005. ‎
‎[25] ‎ L. Larkey, L. Ballesteros and M. Connel, "Improving ‎Stemming for Arabic Information Retrieval: Light ‎Stemming and Co-occurrence Analysis.," in 25th ‎annual international ACM SIGIR conference on ‎Research and development in information retrieval, ‎‎2002. ‎
‎[26] ‎ A. El-Hajar, M. Hajar and K. Zreik, "A System for ‎Evaluation of Arabic Root Extraction Methods.," in ‎fifth international Conference on Internet and Web ‎Applications and Services., 2010. ‎
‎[27] H.Alshalabi,S.Tiun,N.Omar,F.N.AL-‎Aswadi,K.A.Alezabi,Arabiclight-basedstemmer ‎usingnewrules,JournalofKingSaudUniversity-‎ComputerandInformationSciences34(9)(2022) 6635–‎‎6642‎
‎[28] M. F. Kabir, K. Abdullah-Al-Mamun, M. N. Huda, ‎Deep learning based parts of speech tagger for bengali, ‎in: 2016 5th International Conference on Informatics, ‎Electronics and Vision (ICIEV), IEEE, 2016, pp. 26–‎‎29.‎
‎[29] K. K. Zin, N. Thein, Hidden markov model with rule ‎based approach for part of speech tagging of ‎myanmar language, in: Proceedings of 3rd ‎International Conference on Communications and ‎Information, 2009, pp. 123–128.‎
‎[30] F. Pisceldo, M. Adriani, R. Manurung, Probabilistic part ‎of speech tagging for bahasa indonesia, in: Third ‎international MALINDO workshop, 2009, pp. 1–6.‎
‎[31] M. Attia, Y. Samih, A. Elkahky, H. Mubarak, A. ‎Abdelali, K. Darwish, Pos tagging for improving code-‎switching identification in arabic, in: Proceedings of ‎the Fourth Arabic Natural Language Processing ‎Workshop, 2019, pp. 18–29.‎
‎[32] F. Pisceldo, M. Adriani, R. Manurung, Probabilistic part ‎of speech tagging for bahasa indonesia, in: Third ‎international MALINDO workshop, 2009, pp. 1–6.‎
‎[33] M. Febryanto, I. Sulyaningsih, A. A. Zhafirah, Analysis ‎of translation techniques and quality of translated ‎terms of mechanical engineering in accredited ‎national journals, Professional Journal of English ‎Education 1 (2021) 116–119.‎

‎[34] A. Xv, Russian-english bidirectional machine ‎translation system, in: Proceedings of the Fifth Con- ‎ference on Machine Translation, 2020, pp. 320–325.‎
‎[35] H. Aldarmaki, A. Ullah, S. Ram, N. Zaki, Unsupervised ‎automatic speech recognition: A review, Speech ‎Communication 139 (2022) 76–91.‎
‎[36] ‎ M. Creutz, "Unsupervised segmentation of words ‎using prior distributions of morph length and ‎frequency," in Proc. 41st Meeting of ACL, ‎Sapporo,Japan, 2003.‎
‎[37] ‎ F. Ahmed and A. Nürnberger, "N-grams Conflation ‎Approach for Arabic," in ACM SIGIR Conference, ‎Amsterdam, 2007. ‎
‎[38] A. Y. Muaad, G. H. Kumar, J. Hanumanthappa, J. B. ‎Benifa, M. N. Mourya, C. Chola,‎
M. Pramodha, R. Bhairava, An effective approach for ‎arabic document classification using machine ‎learning, Global Transitions Proceedings 3 (1) (2022) ‎‎267–271.‎
‎[39] K. Tnaji, K. Bouzoubaa, S. L. Aouragh, A light arabic ‎pos tagger using a hybrid approach, in: Digital ‎Technologies and Applications: Proceedings of ICDTA ‎‎21, Fez, Morocco, Springer, 2021, pp. 199–208.‎
‎[40] ‎ M. El-Hadj, A.-S. IA and A.-A. AM, "Arabic Part of ‎Speech Tagging Using the Sentence Structure.," in 2nd ‎international Conference on Arabic Language ‎Resources & Tools, Cairo, 2009. ‎
‎[41] H. Hassani, Part of speech tagging (post) of a low-‎resource language using another language (devel- ‎oping a pos-tagged lexicon for kurdish (sorani) using a ‎tagged persian (farsi) corpus), CoRR (2022) ‎abs/2201.12793.‎
‎[42] S. Alqrainy, M. Alawairdhi, Towards developing a ‎comprehensive tag set for the arabic language, ‎Journal of Intelligent Systems 30 (1) (2020) 287–296.‎
‎[43] H. Motameni, A. Ebrahimnejad, J. Vahidi, et al., ‎Morphology of composition functions in persian ‎sentences through a newly proposed classified fuzzy ‎method and center of gravity defuzzification method, ‎Journal of Intelligent & Fuzzy Systems 36 (6) (2019) ‎‎5463–5473.‎
‎[44] ‎ S. M. Assi and M. Haji Abdolhosseini, "Grammatical ‎tagging of a Farsi Corpus.," International Journal of ‎Corpus Linguistics., vol. 5, no. 1, pp. 69-81, 2000. ‎
‎[45] ‎ M. Bijankhan, J. Sheykhzadegan, M. Bahrani and ‎M. Ghayoomi, "Lessons from Building a Persian ‎Written Corpus: Peykare," Language Resources and ‎Evaluation, vol. 45, pp. 143-164, 2011. ‎
‎[46] ‎ M. Shamsfard, H. Sadat Jafari and M. Ilbe, "STeP-1: ‎A Set of Fundamental Tools for Persian Text ‎Processing.," in LREC 2010, Valletta, Malt, 2010. ‎
‎[47] H. T.-P., L. K.-Y. and W. S.-L., "Mining linguistic ‎browsing patterns in the world wide web," Soft ‎Computing, vol. 5, pp. 329-336, 2002.‎
‎[48] F. M. Zanzotto, L. Dell’Arciprete, A. Moschitti, ‎Efficient graph kernels for textual entailment recog- ‎nition, Fundamenta Informaticae 107 (2-3) (2011) ‎‎199–222.‎
‎[49] N. Passalis, J. Raitoharju, A. Tefas, M. Gabbouj, ‎Efficient adaptive inference for deep convolutional ‎neural networks using hierarchical early exits, Pattern ‎Recognition 105 (2020) 107346.‎
‎[50] Z. Elaggoune, R. Maamri, I. Boussebough, A fuzzy ‎agent approach for smart data extraction in big data ‎environments, Journal of King Saud University-‎Computer and Information Sciences 32 (4) (2020) ‎‎465–478.‎
‎[51] M. E. Cintra, M. C. Monard, H. A. Camargo, A fuzzy ‎decision tree algorithm based on c4. 5, Mathware & ‎Soft Computing 20 (1) (2013) 56–62.‎
‎[52] X. Sun, L. Yuan, M. Liu, S. Liang, D. Li, L. Liu, ‎Quantitative estimation for the impact of mining ‎activities on vegetation phenology and identifying its ‎controlling factors from sentinel-2 time series, ‎International Journal of Applied Earth Observation ‎and Geoinformation 111 (2022) 102814.‎
‎[53] X. Bai, Y. Yang, Fuzzy decision tree algorithm based ‎on feature value’s class contribution level, Iranian ‎Journal of Fuzzy Systems 19 (4) (2022) 73–88.‎
‎[54] S. Sayami, S. Shakya, Nepali pos tagging using deep ‎learning approaches, NU. International Journal of ‎Science 17 (2) (2020) 69–84.‎
‎[55] ‎ A. Krassimir, "Intuitionistic fuzzy logics as tools for ‎evaluation of Data Mining processes," 25th ‎anniversary of Knowledge-Based Systems, vol. 80, pp. ‎‎122-130, 2015. ‎
‎[56] ‎ M. Moniri, "Fuzzy and Intuitionistic Fuzzy Turing ‎Machines," Fundamenta Informaticae, vol. 123, no. 3, ‎pp. 305-315, 2013. ‎
‎[57] C. Rahul, T. Arathi, L. S. Panicker, R. Gopikakumari, ‎Morphology & word sense disambiguation em- ‎bedded multimodal neural machine translation ‎system between sanskrit and malayalam, Biomedical ‎Signal Processing and Control 85 (2023) 105051.‎
‎[58] ‎ A. Bria, W. Faber and N. Leone, "Normal Form ‎Nested Programs," Fundamenta Informaticae, vol. 96, ‎no. 3, pp. 271-295, 2009. ‎
‎[59] R. Jayashree, S. K. Murthy, K. Sunny, Keyword ‎extraction based summarization of categorized ‎kannada text documents, International Journal on ‎Soft Computing 2 (4) (2011) 81.‎
‎[60] C. Gupta, A. Jain, N. Joshi, Fuzzy logic in natural ‎language processing–a closer view, Procedia computer ‎science 132 (2018) 1375–1384.‎
‎[61] G. Chen, T. T. Pham, N. Boustany, Introduction to ‎fuzzy sets, fuzzy logic, and fuzzy control systems, ‎Applied Mechanics Reviews 54 (6) (2001) B102–‎B103.‎
‎[62] H. Englund, H. Stockhult, S. Du Rietz, A. Nilsson, G. ‎Wennblom, Learning-environment uncertainty and ‎students’ approaches to learning: A self-determination ‎theory perspective, Scandinavian Journal of ‎Educational Research (2022) 1–15.‎
‎[63] T. Chen, An innovative fuzzy and artificial neural ‎network approach for forecasting yield under an ‎uncertain learning environment, Journal of Ambient ‎Intelligence and Humanized Computing 9 (2018) ‎‎1013–1025.‎ ‎. ‎
‎[64] A. Chiche, B. Yitagesu, Part of speech tagging: a ‎systematic review of deep learning and machine ‎learning approaches, Journal of Big Data 9 (1) (2022) ‎‎1–25.‎
‎[65] C. Marsala, B. Bouchon-Meunier, Fuzzy data mining ‎and management of interpretable and subjective ‎information, Fuzzy Sets and Systems 281 (2015) ‎‎252–259.‎
‎[66] ‎ C. Marsala and B. Bouchon-Meunier, "Fuzzy data ‎mining and management of interpretable and ‎subjective information," Fuzzy Sets and Systems, vol. ‎‎281, no. Special Issue Celebrating the 50th ‎Anniversary of Fuzzy Sets, p. 252–259, 2015. ‎
‎[67] ‎ F. M. Zanzotto, L. Dell'Arciprete and A. Moschitti, ‎‎"Efficient Graph Kernels for Textual Entailment ‎Recognition," Fundamenta Informaticae, vol. ‎Moschitti, no. 2-3, pp. 199-222, 2011. ‎
‎[68] ‎ T.-L. Tseng, F. Jiang and Y. Kwon, "Hybrid Type II ‎fuzzy system & datamining approach for surface ‎finish," Journal of Computational Design and ‎Engineering, vol. 2, no. 3, pp. 137-147, 2015. ‎
‎[69] ‎ E. J. Khatib, R. Barco, A. Gómez-Andrades, P. ‎Muñoz and I. Serrano, "Data mining for fuzzy ‎diagnosis systems in LTE networks," Expert Systems ‎with Applications, vol. 42, no. 21, p. 7549–7559, 2015. ‎
‎[70] ‎ A. Estiri, M. Kahani, H. Ghaemi and M. Abasi, ‎‎"Improvement of An Abstractive Summarization ‎Evaluation Tool using Lexical-Semantic Relations and ‎Weighted Syntax Tags in Farsi Language," in 12th ‎Iranian Conference on Intelligent Systems Higher ‎Education Complex of Bam, Bam, 2014. ‎
‎[71] ‎ A. Jacob, A. Babu and P. C. R. Raj, "TnT tagger with ‎fuzzy rule based learning," in Signal Processing, ‎Informatics, Communication and Energy Systems ‎‎(SPICES), Kozhikode, 2015. ‎
‎[72] A. R. Martinez, Part-of-speech tagging, Wiley ‎Interdisciplinary Reviews: Computational Statistics 4 ‎‎(1) (2012) 107–113.‎
‎[73] ‎ A. R. Martinez, "Part-of-speech tagging," Wiley ‎Periodicals, Inc., vol. 4, pp. 107-113, 2012. ‎
‎[74] ‎ H. Yamane and M. Hagiwara, "Oxymoron ‎generation using an association word corpus and a ‎large-scale N-gram corpus," Soft Computing, vol. 19, ‎pp. 919-927, 2015. ‎
‎[75] ‎ J. Hoon Kim, J. Seo and G. Chang Kim, "Estimating ‎Membership Functions in a Fuzzy Network Model for ‎Part-Of-Speech Tagging," Journal of Intelligent & ‎Fuzzy Systems: Applications in Engineering and, vol. 4, ‎no. 4, pp. 309-320, 1996. ‎
‎[76] ‎ K. Atanassov, "Intuitionistic fuzzy logics as tools for ‎evaluation of Data Mining processes," 25th ‎anniversary of Knowledge-Based Systems, vol. 80, pp. ‎‎122-130, 2015. ‎
‎[77] ‎ A. Chitra and A. Rajkumar, "Paraphrase Extraction ‎using fuzzy hierarchical clustering," Applied Soft ‎Computing, vol. 34, p. 426–437, 2015. ‎
‎[78] T. J. Ross, Properties of membership functions, ‎fuzzification, and defuzzification, Fuzzy logic with ‎engineering applications (2010) 89–116.‎
‎[79] K. Gilda, S. Satarkar, Analytical overview of ‎defuzzification methods, International Journal of Ad- ‎vance Research, Ideas and Innovations in Technology ‎‎6 (2) (2020) 359–365.‎
‎[80] P. M. LaCasse, W. Otieno, F. P. Maturana, A ‎hierarchical, fuzzy inference approach to data ‎filtration and feature prioritization in the connected ‎manufacturing enterprise, Journal of Big Data 5 (2018) ‎‎1–31.‎
‎[81] L. Perumal, F. H. Nagi, Switching control system based ‎on largest of maximum (lom) defuzzification- theory ‎and application, Fuzzy Logic–Controls, Concepts, ‎Theories andApplications, InTech, Rijeka (2012) 301–‎‎324.‎

‎[82] ‎ L. Perumal and F. H. Nagi, "Switching Control ‎System Based on Largest of Maximum (LOM) ‎Defuzzification – Theory and Application," in Fuzzy ‎Logic – Controls, Concepts, Theories and Applications, ‎Slavka Krautzeka, InTech, 2012, pp. 301-325.‎
‎[83] ‎ S. Naaz, A. Alam and R. Biswas, "Effect of different ‎defuzzification methods in a fuzzy based load ‎balancing application," IJCSI International Journal of ‎Computer Science Issues, vol. 8, no. 5, pp. 261-267, ‎‎2011. ‎
‎[84] ‎ H. Tzung-Pei, C. Chun-Hao, W. Yu-Lung and L. ‎Yeong-Chyi, "A GA-based fuzzy mining approach to ‎achieve a trade-off between number of rules and ‎suitability of membership functions," Soft Computing, ‎vol. 10, p. 1091–1101, 2006. ‎
‎[85] ‎ C. D. Manning, "Part-of-Speech Tagging from 97% to ‎‎100%: Is It Time for Some Linguistics?," in ‎CICLing'11 Proceedings of the 12th international ‎conference on Computational linguistics and ‎intelligent text processing., Tokyo, 2011.‎

Share To

Article Url

Six Fuzzy Morphology Methods for Roles of ‎Combines Standard Sentences Persian language

Sanad

Links

Related Centers

Technical Support

Official pages