Comparison of the Efficiency of Statistical Learning Algorithms and Artificial Neural Networks to Predict Stock Prices
Subject Areas : International Journal of Mathematical Modelling & Computations
Alireza Sadat Najafi
1
,
Soheila Sardar
2
*
1 - Department of Information Technology Management, Faculty of Management and Social Sciences, Islamic Azad University Tehran North Branch, Tehran, Iran
2 - Department of Industrial Management, Faculty of management and Social Sciences, Islamic Azad University Tehran North Branch, Tehran, Iran
Keywords: Artificial Neural Network, Capital Market, Prediction, Data Analysis, Statistical Learning Algorithm,
Abstract :
The importance of the capital market in economic development is undeniable through the effective management of capital and the optimal allocation of resources. In this study, according to capital market behaviors and research, Statistical Learning (SL) algorithms compared to Artificial Neural Networks (ANN) to analyze time-series data and predict stock prices have been investigated. In studies to compare methods or provide hybrid models, most statistical learning algorithms are limited and examined without the comparison of other algorithms. In this study, to eliminate this shortcoming by implementing and comparing statistical learning algorithms in the two categories of Regression Learner and Classification Learner, the most efficient algorithm has been identified based on the selected shares and based on the presented parameters. The first category (Regression Learner) includes Linear Regression, Interaction Linear Regression, Robust Linear Regression, Stepwise Linear Regression, Fine Tree, Medium Tree, Coarse Tree, Linear Support Vector Machine (SVM), Quadratic SVM, Cubic SVM, Fine Gaussian SVM, Medium Gaussian SVM, Coarse Gaussian SVM, Ensemble Boosted Trees, Ensemble Bagged Trees, Squared Exponential Gaussian Process Regression, Matern 5/2 Gaussian Process Regression, Exponential Gaussian Process Regression, Rational Quadratic Gaussian Process Regression. The second category (Classification Learner) includes Gaussian, Naive Bayes, K-nearest neighbors. The results show that Regression Learner methods are more effective in predicting the price of selected stocks.
[1] M. Akbari, H. Homaei and M. Heidari, An intelligent fault diagnosis approach for gears and bearings based on
wavelet transform as a preprocessor and artificial neural networks, International Journal of Mathematical
Modelling & Computations, 4 (2014) 309β329.
[2] O. Altay, M. Ulas and K. E. Alyamac, Prediction of the fresh performance of steel fiber reinforced
self-compacting concrete using quadratic SVM and weighted KNN models, IEEE Access, 8 (2020) 92647β
92658.
[3] T. Beckers, An introduction to Gaussian process models, Electrical Engineering and Systems Science, (2021),
doi:10.48550/arXiv.2102.05497.
[4] P. Cunningham and S. J. Delany, k-Nearest neighbour classifiers, ACM Computing Surveys, 54 (6) (2007),
Article 128, doi:10.1145/3459665.
[5] Y. C. Du and A. Stephanus, Levenberg-Marquardt neural network algorithm for degree of arteriovenous fistula
stenosis classification using a dual optical photoplethysmography sensor, Sensors, 18 (2018) 1β26,
doi:10.3390/s18072322.
[6] A. M. Farayola, A. N. Hasan and A. Ali, Optimization of PV systems using data mining and regression learner
MPPT techniques, Indonesian Journal of Electrical Engineering, 10 (2018) 1080β1089.
[7] M. Farrell and A. Correa, Gaussian process regression models for predicting stock trends, Computer Science,
10 (2007) 3414.
[8] P. L. Flom and D. L. Cassell, Stopping stepwise: Why stepwise and similar selection methods are bad, and what
you should use, NESUG Conference, Baltimore, Maryland, (2007).
[9] J. Gareth, D. Witten, T. Hastie and R. Tibshirani, An Introduction to Statistical Learning: with Applications in
R, Springer Texts in Statistics, Springer, (2014).
A. Sadat Najafi and S. Sardar/πΌπ½π2πΆ, 12 -04 (2022) 275-297. 297
[10] E. Gholamian and S. M. Davoodi, Predicting the direction of stock market prices using random forest,
Financial Engineering and Portfolio Management, 9 (2018) 301β322.
[11] E. Guresen, G. Kayakutlu and T. U. Daim, Using artificial neural network models in stock market index
prediction, Expert Systems with Applications, 38 (8) (2011) 10389β10397.
[12] S. Hajiali, M. Doroudian and H. Borna, Understanding Behavior of Antineoplastic Molecules Based on MLR
Models, International Journal of Mathematical Modelling & Computations, 1 (2011) 263β269.
[13] H. Jafarzadeh, M. Mahdianpari, E. Gill, F. Mohammadimanesh and S. Homayouni, Bagging and boosting
ensemble classifiers for classification of multispectral, hyperspectral and PolSAR data: A comparative
evaluation, Remote Sensing, 13 (21) (2021) 4405, doi:10.3390/rs13214405.
[14] Y. Kim and H. Oh, Comparison between multiple regression analysis, polynomial regression analysis, and an
artificial neural network for tensile strength prediction of BFRP and GFRP, Materials, 14 (17) (2021) 4861.
[15] M. Kuss, Gaussian Process Models for Robust Regression, Classification, and Reinforcement Learning
Darmstadt, Technische UniversitΓ€t, Ph.D. Thesis, (2006).
[16] S. L. Lin, Application of machine learning to a medium Gaussian support vector machine in the diagnosis of
motor bearing faults, Electronics, 10 (18) (2021) 2266, doi:10.3390/electronics10182266.
[17] Y. Lin, H. Guo and J. Hu, An SVM-based approach for stock market trend prediction, International Joint
Conference on Neural Networks (IJCNN), Dallas, Texas, USA, (2013) 4β9.
[18] H. Nickisch and C. Rasmussen, Approximations for binary Gaussian process classification, Journal of Machine
Learning Research, 9 (2008) 2035β2078.
[19] R. Noori, K. Ashrafi and A. Ajdarpour, Comparison of ANN and PCA based multivariate linear regression
applied to predict the daily average concentration of CO: A case study of Tehran, Journal of the Earth and
Space Physics, 34 (2008) 135β152.
[20] R. Raee, A. Nikahd and M. Habib, M., The Index Prediction of Tehran Stock Exchange by Combining the
Principal Components Analysis, Support Vector Regression and Particle Swarm Optimization, Journal of
Financial Management Strategy, Alzahra University, 4 (2017) 1β23.
[21] R. Rivest, Learning decision lists, Machine Learning, 2 (1987) 229β246.
[22] C. Robinson, S. Tomek and R. Schumacker, Tests of moderation effects: Difference in simple slopes versus the
interaction term, Multiple Linear Regression Viewpoints, 39 (2013) 16β24.
[23] A. Sadat Najafi and S. Sardar, Application of econometric modeler for predicting stock prices in the capital
market, Financial Engineering and Portfolio Management, 11 (45) (2020) 223β246.
[24] D. Selvamuthu, V. Kumar and A. Mishra, Indian stock market prediction using artificial neural networks on
tick data, Financial Innovation, 16 (2019) 5β16.
[25] D. Shah, H. Isah and F. Zulkernine, Stock market analysis: A review and taxonomy of prediction techniques,
International Journal of Financial Studies, 7 (2) (2019) 7β26.
[26] K. Shailaja and M. Manjunath, Prediction and clustering techniques used in the development of stock
forecasting model, International Journal of Innovative Technology and Exploring Engineering, 9 (3) (2020)
2278β3075.
[27] G. Sundar and K. Satyanarayana, Stock prediction scrutiny using artificial neural network, International
Journal of Recent Technology and Engineering, 7 (623) (2019) 2277β3878.
[28] D. Szafron, R. Greiner, P. Lu, D. Wishart, C. Macdonell, J. Anvik, B. Poulin, Z. Lu and R. Eisner, Explaining
naΓ―ve Bayes classifications, Computer Science, (2003), doi:10.7939/R36D5PH6N.
[29] B. Taremi and B. Zeighami, Biostatistics, Payame Noor University, 17th Edition, (2020), (In Persian).
[30] J. Ticknor, A bayesian regularized artificial neural network for stock market forecasting, Expert Systems with
Applications, 40 (14) (2013) 5501β5506.
[31] G. Vaishnavi, V. Shriya and K. Ashwini, Stock market prediction using linear regression and support vector
machines, International Journal of Applied Engineering Research, 14 (2019) 1931β1934.
[32] C. Yu and W. Yao, Robust linear regression: A review and comparison, Communications in Statistics
-Simulation and Computation, 46 (8) (2017) 6261β6282.
[33] Z. Zhang, Introduction to machine learning: K-nearest neighbors, Annals of Translational Medicine, 4 (11)
(2016) 218