SMOTE Oversampling and Near Miss Undersampling Based Diabetes Diagnosis from Imbalanced Dataset with XAI Visualization
View/ Open
Date
2023-05Author
Nayan, Nasim Mahmud
Islam, Ashraful
Islam, Muhammad Usama
Ahmed, Eshtiak
Hossain, Mohammad Mobarak
Alam, Md Zahangir
Metadata
Show full item recordAbstract
This study investigated the predictive ability of ten different machine learning (ML) models for diabetes using a dataset that was not evenly distributed. Additionally, the study evaluated the effectiveness of two oversampling and undersampling methods, namely the Synthetic Minority Oversampling Technique (SMOTE) and the Near-Miss algorithm. Explainable Artificial Intelligence (XAI) techniques were employed to enhance the interpretability of the model’s predictions. The results indicate that the extreme gradient boosting (XGB) model combined with SMOTE oversampling technique exhibited the highest accuracy and an F1-score of 99% and 1.00 respectively. Furthermore, the utilization of XAI methods increased the dependability of the model’s decision-making process, rendering it more appropriate for clinical use. These results imply that integrating XAI with ML and oversampling techniques can enhance the early detection and management of diabetes, leading to better diagnosis and intervention.
Collections
- 2023 [67]