مروری اجمالی بر یادگیری ماشین: فرآیند، الگوریتم ها، نحوه انتخاب الگوریتم، کاربردها، مزایا و چالش های آن
محورهای موضوعی : مدیریت
امل سیاح
1
,
امیر عباس شجاعی
2
,
ali ali akbari
3
,
مهدی حاجی رضایی
4
,
حمید توحیدی
5
1 - دانشکده مهندسی صنایع، دانشگاه آزاد اسلامی، تهران جنوب، تهران، ایران
2 - دانشگاه تهران جنوب
3 - Islamic Azad University, South Tehran Branch, Department of Industrial engineering
4 - دانشکده ارتباطات، حمل و نقل و لجستیک دانشگاه آزاد اسلامی واحد تهران جنوب
5 - دانشگاه آزاد اسلامی، واحد تهران جنوب، دانشکده مهندسی صنایع
کلید واژه: یادگیری ماشین, هوش مصنوعی, فرایند یادگیری ماشین, الگوریتم های یادگیری ماشین,
چکیده مقاله :
الگوریتمهای یادگیری ماشینی مدلهای محاسباتی هستند که به رایانهها اجازه میدهند الگوها را بفهمند و بدون برنامهنویسی صریح بر اساس دادهها، پیشبینی یا تصمیم گیری کنند. این الگوریتمها شالوده هوش مصنوعی مدرن را تشکیل میدهند و در کاربردهای مختلفی از جمله تشخیص تصویر و گفتار، پردازش زبان طبیعی، سیستمهای توصیه، تشخیص تقلب، خودروهای خودران و غیره استفاده میشوند. این مطالعه با هدف مروری اجمالی بر یادگیری ماشین به ارائه یک نمای کلی از توسعه یادگیری ماشینی تا امروز می پردازد و سپس فرآیند یادگیری ماشین، الگوریتمهای مختلف آن، روش انتخاب الگوریتم ها را مورد بحث قرار می دهد. همچنین به کاربردها، چالشها و مزایای یادگیری ماشین اشاره می شود. به طور کلی، این مقاله قصد دارد به عنوان یک نقطه مرجع برای متخصصان دانشگاهی، صنعت و همچنین برای تصمیمگیرندگان در موقعیتهای مختلف دنیای واقعی و حوزههای کاربردی، مورد استفاده قرار گیرد. روش تحقیق حاضر مبتنی بر تجزیه و تحلیل کیفی است که در آن ادبیات مختلف براساس موضوع یادگیری ماشین مورد بررسی قرار گرفته است. نتایج نشان می دهد که یادگیری ماشین برای ایجاد تحول بزرگ در فناوری آماده است و یکی از سریعترین فناوریهایی است که تا امروز مورد استفاده بوده است و به طور مداوم در حال تکامل است، یادگیری ماشین یک فناوری با ریسک و بازده بالا می باشد.
Machine learning algorithms are computational models that allow computers to understand patterns and make predictions or decisions based on data without explicit programming. These algorithms form the foundation of modern artificial intelligence and are used in various applications such as image and speech recognition, natural language processing, recommender systems, fraud detection, and autonomous vehicles. This study aims to provide a brief overview of machine learning by discussing the development of machine learning to date and then examining the machine learning process, its various algorithms, and the method of selecting algorithms. It also addresses the applications, challenges, and benefits of machine learning. Overall, this paper intends to serve as a reference point for academic and industry professionals as well as for decision-makers in various real-world situations and application domains. The present research methodology is based on qualitative analysis in which various literature based on machine learning is reviewed. The results show that machine learning is poised to create a major technological revolution and is one of the fastest-growing technologies used to date, and it is constantly evolving. Machine learning is a high-risk, high-reward technology.
1. Ahmad, F.S.; Ali, L. A, (2020), “hybrid machine( learning framework to predict mortality in paralytic ileus patients using electronic health records (EHRs)”. J. Ambient Intell. Humaniz. Comput., 2020, 12(2), 3283-3293
2. Alzubi, J.A., Nayyar, A., & Kumar, A. (2019). Machine Learning from Theory to Algorithms: An Overview. Journal of Physics: Conference Series, 1142
3. A. L. Samuel. (1967). "Some Studies in Machine Learning Using the Game of Checkers. II—Recent Progress," in IBM Journal of Research and Development, vol. 11, no. 6, pp. 601-617, Nov. 1967, doi: 10.1147/rd.116.0601
4. A. M. Turing, I. (1950). “Computing machinery and intelligence”, Mind, Volume LIX, Issue 236, October 1950, Pages 433–460, https://doi.org/10.1093/mind/LIX.236.433
5. Banoula, Mayank. (2023). “Machine Learning Steps: A Complete Guide”, https://www.simplilearn.com/tutorials/machine-learning-tutorial/machine-learning-steps 6. Bennett, J., & Lanning, S. (2007). “The Netflix Prize”
7. Boukerche A, Wang J. (2020), “Machine learning-based trafc prediction models for intelligent transportation systems”. Comput Netw. 2020;181
8. Brown, T.B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D.M., Wu, J., Winter, C., Hesse, C., Chen, M., Sigler, E., Litwin, M., Gray, S., Chess, B., Clark, J., Berner, C., McCandlish, S., Radford, A., Sutskever, I., & Amodei, D. (2020).” Language Models are Few-Shot Learners”. ArXiv, abs/2005.14165
9. B. Senthil Arasu, B. Jonath Backia Seelan, N. (2020), “Thamaraiselvan,A machine learning-based approach to enhancing social media marketing”,Computers & Electrical Engineering, Volume 86,2020, 106723,ISSN 0045-7906, https://doi.org/10.1016/j.compeleceng.2020.106723
10. Davenport, T. H. (2018). “The AI advantage: How to Put the Artificial Intelligence Revolution to Work”. Cambridge, MA: MIT Press
11. Dean, Jeffrey & Corrado, G.s & Monga, Rajat & Chen, Kai & Devin, Matthieu & Le, Quoc & Mao, Mark & Ranzato, Aurelio & Senior, Andrew & Tucker, Paul & Yang, Ke & Ng, Andrew. (2012). “Large Scale Distributed Deep Networks. Advances in neural information processing systems”
12. Devarapalli, Sri. (2022). “Language Translation using Machine Learning”
13. Devlin, J., Chang, M., Lee, K., & Toutanova, K. (2019). “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding”. North American Chapter of the Association for Computational Linguistics
14. DeJong, G. (1981). “Generalizations Based on Explanations”. International Joint Conference on Artificial Intelligence
15. Feng-Hsiung Hsu. (1999). "IBM's Deep Blue Chess grandmaster chips," in IEEE Micro, vol. 19, no. 2, pp. 70-81, March-April 1999, doi: 10.1109/40.755469
16. Forbes.com. (2021). “AutoML 2.0: Is The Data Scientist Obsolete?”. URL: https://www.forbes.com/sites/cognitiveworld/2020/04/07/automl-20-is-the-data-scientist-obsolete/?sh=63b69def53c9. Accessed Oct 8, 2021
17. Gayhardt ,Lauryn. Yoshioka , Hiroshi. (2024). “How to select algorithms for Azure Machine Learning”, https://learn.microsoft.com/en-us/azure/machine-learning/how-to-select-algorithms?view=azureml-api-1
18. Geeksforgeeks, (2024), “Machine Learning Algorithms”, https://www.geeksforgeeks.org/machine-learning-algorithms/
19. Geeksforgeeks, (2024), “what is machine learning”, https://www.geeksforgeeks.org/machine-learning-algorithms/
20. Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning (1sted.). MIT Press
21. Goodfellow, Ian & Pouget-Abadie, Jean & Mirza, Mehdi & Xu, Bing & Warde-Farley, David & Ozair, Sherjil & Courville, Aaron & Bengio, Y.. (2014). “Generative Adversarial Networks. Advances in Neural Information Processing Systems”. 3. 10.1145/3422622
22. Grace, K., Salvatier, J., Dafoe, A., Zhang, B., & Evans, O. (2019). “When will AI exceed human performance? Evidence from AI experts?” Journal of Artificial Intelligence Research, 62, 729-754
23. Hebb, D. O. (1949). “The organization of behavior; a neuropsychological theory. Wiley
24. J. Deng, W. Dong, R. Socher, L. -J. Li, Kai Li and Li Fei-Fei. (2009). "ImageNet: A large-scale hierarchical image database," 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 2009, pp. 248-255, doi: 10.1109/CVPR.2009.5206848
25. Khanzode, K. C. A. & Sarode, R. D. (2020). Advantages and disadvantages of artificial intelligence and machine learning: A literature review. Int. J. Libr. Inf. Sci. (IJLIS) 9, 30–36
26. Kharwal, Aman, (2023), “Here’s How to Choose Machine Learning Algorithms”, https://thecleverprogrammer.com/2023/09/28/heres-how-to-choose-machine-learning-algorithms/
27. Krizhevsky, Alex & Sutskever, Ilya & Hinton, Geoffrey. (2012). “ImageNet Classification with Deep Convolutional Neural Networks. Neural Information Processing Systems”. 25. 10.1145/3065386
28. Kulin, M., Kazaz, T., De Poorter, E., & Moerman, I. (2021). A Survey on Machine Learning-Based Performance Improvement of Wireless Networks: PHY, MAC and Network Layer. Electronics, 10(3), 318. https://doi.org/10.3390/electronics10030318
29. Learned-Miller, E. G.(2014). Introduction to supervised learning. I: Department of Computer Science, University of Massachusetts
30. Lev Craig. (2024). “What is machine learning? Guide, definition and examples”, https://www.techtarget.com/searchenterpriseai/definition/machine-learning-ML
31. Le, Quoc & Ranzato, Marc'Aurelio & Monga, Rajat & Devin, Matthieu & Chen, Kai & Corrado, G.s & Dean, Jeff & Ng, Andrew. (2011).” Building high-level features using large scale unsupervised learning”. Proceedings of ICML. 1
32. Leong, Lester . (2019). “Machine Learning Pipelines: Feature Engineering Numbers”, https://towardsdatascience.com/machine-learning-pipelines-feature-engineering-numbers-29f53aaec82a
33. McCulloch, W.S., Pitts, W. (1943). “A logical calculus of the ideas immanent in nervous activity”. Bulletin of Mathematical Biophysics 5, 115–133. https://doi.org/10.1007/BF02478259
34. Maleki, Farhad, et al. (2020). "Overview of machine learning part 1: fundamentals and classic approaches." Neuroimaging Clinics 30.4 (2020): e17-e32
35. Malone, B.; Simovski, B.; Moliné, C.; Cheng, J.; Gheorghe, M.; Fontenelle, H.; Vardaxis, I.; Tennøe, S.; Malmberg, J.A.; Stratford, R.; Clancy, T.(2020), “ Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs”. Sci. Rep., 2020, 10(1), 22375
36. Marchand A, Marx P.(2020), “Automated product recommendations with preference-based explanations”. J Retail. 2020;96(3):328–43
37. Menaga, A. & Shanmugam, Vasantha. (2022). “Smart Sustainable Agriculture Using Machine Learning and AI: A Review”. 10.1007/978-981-16-7952-0_42
38. Mikolov, T., Chen, K., Corrado, G.S., & Dean, J. (2013). “Efficient Estimation of Word Representations in Vector Space”. International Conference on Learning Representations
39. Minsky, Marvin. (1952). “A Neural-Analogue Calculator Based Upon a Probability Model of Reinforcement.” Harvard University Psychological Laboratories, Cambridge, Massachusetts
40. Moravec, H.P. (1990). “The Stanford Cart and the CMU Rover”. In: Cox, I.J., Wilfong, G.T. (eds) Autonomous Robot Vehicles. Springer, New York, NY. https://doi.org/10.1007/978-1-4613-8997-2_30
41. Nesajian, Minoo, (2024), “Machine Learning Algorithms You Should Know – 10 Key Algorithms for 2023,https://blog.faradars.org/ [In Persian]
42. Pan, Y. (2016).” Heading toward artificial intelligence 2.0”. Engineering, 2(4), 409- 413
43. Radford, A., & Narasimhan, K. (2018). “Improving Language Understanding by Generative Pre-Training”
44. Rajbanshi,Sabita,2021,Everything you need to know about Machine Learning, published , a part of the Data Science Blogathon
45. Ren R, Wu DD, Liu T. (2018), “Forecasting stock market movement direction using sentiment analysis and support vector machine”. IEEE SystemsJournal. 2018;13(1):760-70
46. Rosenblatt, F. (1958). "The perceptron: A probabilistic model for information storage and organization in the brain". Psychological Review. 65 (6): 386–408. CiteSeerX 10.1.1.588.3775. doi:10.1037/h0042519. PMID 13602029
47. Sejnowski, T.J., & Rosenberg, C.R. (1988). “NETtalk: a parallel network that learns to read aloud”
48. Silver, D., Huang, A., Maddison, C. et al. (2016). “Mastering the game of Go with deep neural networks and tree search”. Nature 529, 484–489 https://doi.org/10.1038/nature16961
49. Taherdoost, Hamed. (2023). “Enhancing Social Media Platforms with Machine Learning Algorithms and Neural Networks. Algorithms”. 16. 271. 10.3390/a16060271
50. Tan, M., & Le, Q.V. (2019). “EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks”. ArXiv, abs/1905.11946
51. T. Cover and P. Hart. (1967). "Nearest neighbor pattern classification," in IEEE Transactions on Information Theory, vol. 13, no. 1, pp. 21-27, January 1967, doi: 10.1109/TIT.1967.1053964
52. Tuia, D., Kellenberger, B., Beery, S. et al. (2022), “Perspectives in machine learning for wildlife conservation”. Nat Commun 13, 792. https://doi.org/10.1038/s41467-022-27980-y
53. Y. Taigman, M. Yang, M. Ranzato and L. Wolf. (2014). "DeepFace: Closing the Gap to Human-Level Performance in Face Verification," 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 1701-1708, doi: 10.1109/CVPR.2014.220
54. Yurushkin, Michael , “ How do Machine Learning Pipelines Work?”, https://broutonlab.com/blog/how-machine-learning-pipelines-work/
55. Z. Zhang. (2012). "Microsoft Kinect Sensor and Its Effect," in IEEE MultiMedia, vol. 19, no. 2, pp. 4-10, Feb. 2012, doi: 10.1109/MMUL.2012.24