Word Spotting in Handwritten Document Images based on Multiple Features
Mallikarjun Hangarge1, Veershetty C.2
1Mallikarjun Hangarge*, Department of Computer Science, Karanata Arts, Science and Commerece College, Bidar, India,
2Veershetty C., Department of Computer Science, Government First Grade College, Basavakalayan, Bidar, India.
Manuscript received on September 16, 2019. | Revised Manuscript received on 24 September, 2019. | Manuscript published on October 10, 2019. | PP: 3527-3537 | Volume-8 Issue-12, October 2019. | Retrieval Number: L26251081219/2019©BEIESP | DOI: 10.35940/ijitee.L2625.1081219
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: This paper presents word spotting in handwritten documents based on multiple features. Multiple features are derived using Gabor, Histogram oriented gradient (HOG), Local binary pattern, texture filters and Morphological filters. The real time documents are heterogeneous in nature, for instance application forms, postal cards, railway reservations forms etc. includes handwritten and printed text with different scripts. To spot a word in such documents and retrieving them from a huge digitized repository is a challenging task. To address such issues word spotting based on multiple features is carried out with learning and without learning methods. In both the methods (learning and learning free) texture filters are exhibiting outstanding performance in terms of precision recall and f-measures. To confirm the capability of the proposed method, extensive experiments are made on publically available dataset i.e.GW20 and noted encouraging results compared to other contemporary works.
Keywords: Document Image Processing, Image Retrieval, Cosine Distance, Optical Character Reorganization, Word Spotting
Scope of the Article: Signal and Image Processing