A Simple Semantic Web Crawler for Intelligent Information Retrieval from Academic Websites
V.Kiran Kumar1, Ramya2
1Dr V.Kiran Kumar*, Associate Professor, Department of Computer Science, Dravidian University, Kuppam, Andhra Pradesh.
2Mrs Ramya, Research Scholar, Department of Computer Science, Dravidian University, Kuppam.
Manuscript received on January 16, 2020. | Revised Manuscript received on January 22, 2020. | Manuscript published on February 10, 2020. | PP: 3255-3258 | Volume-9 Issue-4, February 2020. | Retrieval Number: D2085029420/2020©BEIESP | DOI: 10.35940/ijitee.D2085.029420
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: In various applications data is shared and reused through a common framework like Semantic Web. In essence, in the ever-expanding sphere of the web, huge quantities of web content is created and made available for men and machines for their interpretation and application. In the present paper, an attempt has been made to develop a “scutter”, other wisely known as semantic crawler that essentially collects and stores information in a centrally-located database by crawling through the semantic content. The projected „scutter‟ is based on Jena 3.0 Framework, which is a freely downloadable language software available on https://jena.apache.org/download/. In this process, a RDF file is taken as the seed input file after which the „scutter‟ accesses other RDF documents by crawling through „rdfs:seeAlso‟ property, thereby designated as an automatic extraction of semantic information emanating from various websites. Also, certain privacy related issues, especially in FOAF metadata is discussed in this paper.
Keywords: Semantic Web, RDF, RDFS, Semantic.
Scope of the Article: Web Technologies