Persian Printed Document Analysis and Page Segmentation
محورهای موضوعی : Journal of Computer & RoboticsAli Broumandnia 1 , Jamshid Shanbehzadeh 2
1 - islamic azad university of south tehran
2 - tarbiat moalem
کلید واژه: Keywords : Page segmentation, pyramidal image structure, connected components, horizontal and vertical merging,
چکیده مقاله :
This paper presents, a hybrid method, low-resolution and high-resolution, for Persian page segmentation. In the low-resolution page segmentation, a pyramidal image structure is constructed for multiscale analysis and segments document image to a set of regions. By high-resolution page segmentation, by connected components analysis, each region is segmented to homogeneous regions and identifying them as texts, images, and tables/drawings. The proposed method was experiment with the Persian documents. The result of these tests have shown that the proposed method provide more accurate and speed results.