Text Mining with Topical and Informative Measures using Semantic Ontology
G Shobarani1, K Arulanandham2
1G Shobarani, Ph.D Research Scholar, Bharathiar University, Coimbatore, Tamilnadu, India.
2K Arulanandham, Head, Department of Computer Applications, Government Thirumagal Mills College, Gudiyattam, Tamilnadu, India.
Manuscript received on 24 August 2019. | Revised Manuscript received on 05 September 2019. | Manuscript published on 30 September 2019. | PP: 4234-4238 | Volume-8 Issue-11, September 2019. | Retrieval Number: K23360981119/2019©BEIESP | DOI: 10.35940/ijitee.K2336.0981119
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: The text mining has been identified as dominant throughout the era in many scientific problems. Number of techniques have been identified and proposed earlier, each uses different features towards mining text information. However, they suffer to achieve higher performance in mining relevant text information or documents. To support the development of text mining tasks, an efficient topical and informative measures based algorithm is presented, which uses semantic ontology and the taxonomy as dictionary. The text features of the documents has been extracted to generate set of terms. For any document, the text features are used to remove the noisy features like stop words, stemming and tagging. With the noise removed pure terms, the method estimates the Topical Depth Similarity (TDS) and informative Depth Similarity (IDM) measures. The measures has been estimated towards each document to perform text mining. The input query has been estimated for the TDS measure to identify the category of the query. According to the category of query, the method estimates the TDS and IDS measures to mining the text. The proposed method improve the performance of text mining with reduces false classification ratio.
Keywords: Text Mining, Semantic Ontology, Taxonomy, TDS,IDS, Feature Extraction, Similarity Measures.
Scope of the Article: Text Mining