Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

Barkhan, Masoud; Alizadeh, Fattah; Maihami, Vafa

رقم المقالة : JACET-1811-1243 زيارة : 335 الصفحة: 71 - 80

نوع المخطوط: ابحاث

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

الموضوعات : Human Computer Interaction and Interface

Masoud Barkhan ¹ , Fattah Alizadeh ² , Vafa Maihami ³

1 - Computer Dept, Technical &amp;amp; Engineering Faculty, Islamic Azad University Sanandaj Branch, Sanandaj, Iran
2 - Computer Dept, Technical &amp; Engineering Faculty, Islamic Azad University Mahabad Branch, Mahabad, Iran
3 - Computer Dept, Technical &amp; Engineering Faculty, Islamic Azad Unversity Sanandaj Branch, Sanandaj, Iran

تاريخ الإرسال : 10 الأحد , ربيع الأول, 1440 تاريخ التأكيد : 11 السبت , جمادى الثانية, 1440 تاريخ الإصدار : 26 الأربعاء , شعبان, 1440

الکلمات المفتاحية: Persian language, automatic letter detecting system, image processing, lip reading,

ملخص المقالة :

For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges being the noise in some situations, which is the main cause of errors in the correct diagnosis of speech. One of the ways for solving this problem is image processing, that in this study, the purpose has been designing and implementing a system for automatic recognition of Persian letters through image-processing techniques. For this purpose, after providing a database for Persian verbal phonetics, we first used image processing techniques to eliminate the presence of noises and detect the cantor in lip, in which we used edge detection to identify the edges of the lip. After finding the upper and lower points of the lip for five frames of each film, we used the mean gap between the upper and lower points of the lip as the characteristic of each phoneme and then by providing a database of these features, with the help of the back propagation artificial neural network and The radial basis function have categorized these phonemes, which ultimately achieved the desired results in the categorization of the phonemes. Of course, the precision of classification using the back propagation artificial neural network has been more than radial basis function ANN.

المصادر:

شارک

عنوان URL للمقالة

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

سند

الروابط

المراكز ذات الصلة

دعامة

الصفحات الرسمية