Loading

Association Rule Mining on Spambase Dataset using Tanagra
Pushpa Devi1, Vikrant Singh Bhardwaj2, K.L. Bansal3

1Pushpa Devi*, Department of Computer Science, Himachal Pradesh University, Summerhill, Shimla, Himachal Pradesh, India.
2Vikrant Singh Bhardwaj, Department of computer Science, Himachal Pradesh University, Summerhill, Shimla.
3Dr. Kishori Lal Bansal, Professor Department of Computer Science, Himachal Pradesh University, Summerhill, Shimla, Himachal Pradesh, India.
Manuscript received on March 15, 2020. | Revised Manuscript received on April 02, 2020. | Manuscript published on April 10, 2020. | PP: 890-894 | Volume-9 Issue-6, April 2020. | Retrieval Number: C8022019320/2020©BEIESP | DOI: 10.35940/ijitee.C8022.049620
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: DThere is huge amount of data being generated every minute on internet. This data is of no use until we cannot extract useful information from it. Data mining is the process of extracting useful information or knowledge from this huge amount of data that can be further used for various purposes. Discovering Association rules is one of the most important tasks among all other data mining tasks. Association rules contain the rules in the form of IF then THAN form. The leftmost part of the rule i.e. IF is called as the Antecedent which defines the condition and the rightmost part i.e. ELSE is called as the Consequent which defines the result. In this paper, we present the overview and comparison of Apriori, Apriori PT and Frequent Itemsets algorithm of association component in Tanagra Tool. We analyzed the performance based on the execution time and memory used for different number of instances, support and Rule Length in Spambase Dataset. The results show that when we increase the support value the Apriori PT takes the less execution time and Apriori takes less memory space. When numbers of instances are reduced Frequent Itemsets outperforms well both in case of memory and execution time. When rule length is increased the Apriori algorithm performs better than Apriori PT and Frequent Itemsets. 
Keywords: Apriori, Apriori PT, Frequent Item sets, support, Confidence, Tanagra.
Scope of the Article: Data mining and warehousing