Assar, Samaneh, Masoumi, Behrooz. Utilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs. Journal of Computer & Robotics. 2013;6(2):15-22. doi: