Optimal Character Segmentation for Touching Characters in Tamil Language Palm Leaf Manuscripts using Horver Method
M. Mohamed Sathik1, R. Spurgen Ratheash2
1M.Mohamed Sathik*, Principal and Research Supervisor PG and Research Department of Computer Science, Sadakathullah Appa College, Tirunelveli, Tamil Nadu, India.
2R. Spurgen Ratheash, Research Scholar, Sadakathullah Appa College, Tirunelveli, Tamil Nadu, India.
Manuscript received on March 15, 2020. | Revised Manuscript received on March 30, 2020. | Manuscript published on April 10, 2020. | PP: 1010-1015 | Volume-9 Issue-6, April 2020. | Retrieval Number: E3126039520/2020©BEIESP | DOI: 10.35940/ijitee.E3126.049620
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: An optimality of an automatic character recognition for Tamil palm leaf manuscripts can be achieved only by an efficient segmentation of touching characters. In this article, the touching characters are segmented as a single character to achieve an optimum solution by the recognizer in Optical Character Recognition (OCR). The proposed method provides a novelty in touching character segmentation of Tamil palm leaf manuscripts. An initial process of separation of background image and foreground characters is applied on the palm leaf images by filtering and removing unwanted pieces of characters by noise removal methods. The thickening process overcomes the difficulty of small breakages in the characters. The aspect ratio of the character image can be used to categorize the character such as single or multi touching. Single touching is divided by yet another ways such as horizontal or vertical touching. Finally, the proposed algorithm for Horizontal and Vertical character segmentation named as HorVer method is applied on the horizontally and vertically touching characters to segment as independent character. Experimental result produces 91% of an accuracy on segmenting the touching characters in Tamil palm leaf manuscript images collected from various resources and Tamil Heritage Foundation (THF). A novelty method can be achieved in Tamil touching character segmentation by the proposed algorithm.
Keywords: Character Segmentation, Pre Processing, Touching Characters, Tamil Character Segmentation.
Scope of the Article: Digital signal Processing Theory.