Optimizing credit risk assessment with ensemble sampling and hybrid machine learning models

dc.contributor.authorMucheru, N.
dc.date.accessioned2026-04-24T10:37:22Z
dc.date.issued2025
dc.descriptionFull - text thesis
dc.description.abstractAccurate credit risk modeling is essential for minimizing financial losses, but class imbalance, where defaulters make up a small fraction of the data, remains a challenge. This study tackles the issue using ensemble sampling and hybrid machine learning models. A Kaggle dataset with 32,582 entries was used in this study. SMOTE + Random Under sampling, ADASYN + Random Under sampling, Borderline-SMOTE + Random Under sampling, SVM-SMOTE + Random Under sampling, and SMOTE-TOMEK, were applied before training. Our findings reveal that Random Forest with Borderline-SMOTE + Random Under sampling achieved the highest recall, while SMOTE + Random Under sampling with Random Forest achieved highest AUC. While hybrid machine learning models improved precision, they sacrificed recall. This study reinforces the power of ensemble sampling and hybrid approaches in credit risk modeling, with future research focusing on dynamic thresholding and advanced ensemble strategies to refine predictions. Keywords: Credit risk modeling, Class Imbalance, Ensemble sampling, Hybrid machine learning, Random Forest
dc.identifier.citationMucheru, N. (2025). Optimizing credit risk assessment with ensemble sampling and hybrid machine learning models [Strathmore University]. https://hdl.handle.net/11071/16461
dc.identifier.urihttps://hdl.handle.net/11071/16461
dc.language.isoen_US
dc.publisherStrathmore University
dc.titleOptimizing credit risk assessment with ensemble sampling and hybrid machine learning models
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Optimizing credit risk assessment with ensemble sampling and hybrid machine learning models.pdf
Size:
1.49 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: