AN ENHANCED CLASSIFICATION AND REGRESSION TREE ALGORITHM USING GINI EXPONENTIAL
Keywords:
Gini index, Information gain, Decision Tree, Classification, Regression TreeAbstract
Decision tree algorithms, particularly Classification and Regression Trees (CART), are widely used in machine learning for their simplicity, interpretability, and ability to handle both categorical and numerical data. However, traditional decision trees often encounter limitations when dealing with complex, high-dimensional, or imbalanced datasets, as conventional impurity measures such as the Gini Index and Information Gain may fail to capture subtle variations in the data effectively. This study enhances the traditional Classification and Regression Trees (CART) model by introducing the Gini Exponential Criterion, which incorporates an exponential weighting factor into the split point calculation process. This novel approach amplifies the influence of highly discriminative features, resulting in more refined splits and improved decision boundaries. The enhanced CART model was evaluated on two benchmark datasets: the wine quality dataset and the hypothyroid dataset, with preprocessing steps like feature scaling and SMOTE for class imbalance, and hyperparameter tuning via Bayesian Optimization. On the wine quality dataset, the enhanced model improved accuracy from 57% (traditional CART) to 86%, while on the hypothyroid dataset, it achieved an impressive accuracy of 98%. These results highlight the model's ability to handle complex and imbalanced data effectively. Feature importance analysis and decision tree visualization further demonstrated the model's interpretability. The study concludes that the Gini Exponential Criterion significantly improves CART's performance, offering better generalization and clearer decision boundaries. This advancement is particularly valuable for applications requiring precise and interpretable predictions, such as healthcare diagnostics and quality assessment. Future work could explore integrating this criterion into ensemble methods and...
Published
How to Cite
Issue
Section
FUDMA Journal of Sciences
How to Cite
Most read articles by the same author(s)
- Hadiza Hassan, Muhammad Aminu Ahmad, Rabi Mustapha, AN ENHANCED FEATURE ENGINEERING TECHNIQUE FOR CREDIT CARD FRAUD DETECTION , FUDMA JOURNAL OF SCIENCES: Vol. 8 No. 4 (2024): FUDMA Journal of Sciences - Vol. 8 No. 4
- Kumawuese Jennifer Kurugh, Muhammad Aminu Ahmad, Awwal Ahmad Babajo, THE EFFECT OF DATASETS ON BREAST CANCER DETECTION MODELS , FUDMA JOURNAL OF SCIENCES: Vol. 4 No. 4 (2020): FUDMA Journal of Sciences - Vol. 4 No. 4
- Suleiman Dauda, Muhammad Aminu Ahmad, Ahmad Abubakar Aliyu, Mohammed Ibrahim, Sa'adatu Abdulkadir, Abubakar Mu'azu Ahmed, A. S. Mukhtar, S. Bello, COMBATTING BANKING MALWARE THREATS: EVALUATING THE EFFICACY OF HYBRID AND SINGLE-CLASSIFICATION ALGORITHMS , FUDMA JOURNAL OF SCIENCES: Vol. 9 No. 3 (2025): FUDMA Journal of Sciences - Vol. 9 No. 3
- Maryam Lawal Ibrahim, Muhammad Aminu Ahmad, AN ENHANCED RGB PROJECTION ALGORITHM FOR CONTENT BASED IMAGE RETRIEVAL , FUDMA JOURNAL OF SCIENCES: Vol. 3 No. 1 (2019): FUDMA Journal of Sciences - Vol. 3 No. 1
- Aliyu Sulaiman Mukhtar, Muhammad Aminu Ahmad, Mohammad Ibrahim, Saadatu Abdulkadir, Abubakar Mu’azu, Sulaiman Dauda, Abdullahi Diso, ENHANCING AGE ESTIMATION FROM SCLERA IMAGES USING RESNET-50, VGG16, AND RANDOM FOREST , FUDMA JOURNAL OF SCIENCES: Vol. 9 No. 4 (2025): FUDMA Journal of Sciences - Vol. 9 No. 4