University of Rwanda Digital Repository

Enhancement of Credit Score Prediction for imbalanced Datasets Using Data Mining Approaches (Case study: Bank of Kigali)

Show simple item record

dc.contributor.author BETT KIPKIRUI, ERICK
dc.date.accessioned 2021-12-21T12:50:00Z
dc.date.available 2021-12-21T12:50:00Z
dc.date.issued 2021
dc.identifier.uri http://hdl.handle.net/123456789/1469
dc.description Master's Dissertation en_US
dc.description.abstract Credit score prediction is the most effective way of analyzing whether a potential client is eligible for loan or not especially in financial institutions where class imbalance problems are prevalent. However, limited number of credit score prediction models in banking institutions take into consideration imbalance data and again, the best resampling technique to be applied with imbalanced data is still a challenge. Therefore, in an attempt to address these problems, this research presents an empirical comparison of various combinations of data imbalance resampling techniques and machine learning algorithms used to address this challenge of imbalance data. This study utilized credit score secondary data from bank of Kigali with 58096 customer transaction with 31 variables. The time scope of data was limited to 2018-2019. Modelling the data and handling class imbalance was done using python jupyter notebook libraries. The credit score prediction from each combination were evaluated with F_Beta, F1_score, precision, recall score to avoid biasness towards majority class hence taking into consideration effectiveness in each technique and model used which have not been considered in similar studies. An experimental result was done in this research using resampling techniques and machine learning algorithms to enhance credit score prediction for bank of Kigali clients. The findings suggest that combining oversampling technique and random forest algorithm yield the best prediction of 96.42% and F_Beta of 97.56% among the rest hence the most effective way of evaluating the eligibility of customers in the banking institution. en_US
dc.language.iso en en_US
dc.subject Machine learning, credit Score, imbalance data, algorithm, defaulter, non-defaulter en_US
dc.title Enhancement of Credit Score Prediction for imbalanced Datasets Using Data Mining Approaches (Case study: Bank of Kigali) en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Browse

My Account