High Accurate and a Variant of k-fold Cross Validation Technique for Predicting the Decision Tree Classifier Accuracy
D. Mabuni1, S. Aquter Babu2
1Dr. D. Mabuni*, Department of Computer Science, Dravidian University, Kuppam, India.
2Dr. S. Aquter Babu, Department of Computer Science, Dravidian University, Kuppam, India.
Manuscript received on December 23, 2020. | Revised Manuscript received on January 16, 2020. | Manuscript published on January 10, 2021. | PP: 105–110 | Volume-10 Issue-3, January 2021 | Retrieval Number: 100.1/ijitee.C84030110321| DOI: 10.35940/ijitee.C8403.0110321
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: In machine learning data usage is the most important criterion than the logic of the program. With very big and moderate sized datasets it is possible to obtain robust and high classification accuracies but not with small and very small sized datasets. In particular only large training datasets are potential datasets for producing robust decision tree classification results. The classification results obtained by using only one training and one testing dataset pair are not reliable. Cross validation technique uses many random folds of the same dataset for training and validation. In order to obtain reliable and statistically correct classification results there is a need to apply the same algorithm on different pairs of training and validation datasets. To overcome the problem of the usage of only a single training dataset and a single testing dataset the existing k-fold cross validation technique uses cross validation plan for obtaining increased decision tree classification accuracy results. In this paper a new cross validation technique called prime fold is proposed and it is experimentally tested thoroughly and then verified correctly using many bench mark UCI machine learning datasets. It is observed that the prime fold based decision tree classification accuracy results obtained after experimentation are far better than the existing techniques of finding decision tree classification accuracies.
Keywords: Decision Tree Classification Accuracy, K-Fold Cross Validation Technique, Machine learning, Prediction.
Scope of the Article: Machine learning