COMPREHENSIVE PERFORMANCE ANALYSIS OF MACHINE LEARNING BASED ALGORITHMS FOR CREDIT DEFAULT PREDICTION USING LENDING CLUB DATASET

Authors

  • ANOOP SUNARTY Department of Computer Sciences and Applications, Mandsaur University, Mandsaur, Madhya Pradesh, India

DOI:

https://doi.org/10.22159/ijet.2025v13.57906

Keywords:

Credit default prediction, Machine learning, XGBoost, Lending club dataset, Credit risk assessment, Predictive analytics

Abstract

Objective: To develop and evaluate a machine learning–based framework for accurate prediction of credit default using the Lending Club dataset.

Methods: The study applies extensive data preprocessing including missing value treatment, outlier removal using Z-score, one-hot encoding of categorical variables, Min–Max normalization, and class balancing using SMOTE. Several models were evaluated, with Extreme Gradient Boosting (XGBoost) used as the primary classifier. Data were split into 80% training and 20% testing. Performance was measured using accuracy, precision, recall, and AUC.

Results: XGBoost achieved the best performance with accuracy of 91.56%, precision of 94.78%, recall of 93.30%, and AUC of 97.07, outperforming decision tree, logistic regression, artificial neural networks, gradient boosting, and convolutional neural networks.

Conclusion: The proposed XGBoost-based framework demonstrates strong potential for real-world credit risk assessment by effectively handling large and imbalanced datasets and improving early identification of default risk.

References

1. Xu J. Research on the impact of credit risk in commercial banks under the background of fintech. Highlights Business Econ Manag 2025;55:198-214.

2. Chandrashekar P. Data-driven loan default prediction: enhancing business process workflows with machine learning. Int J Emerg Res

Eng Technol 2025;6:198-214.

3. Lin J. Research on loan default prediction based on logistic regression, randomforest, xgboost and adaboost. SHS Web Conf 2024;181:02008.

4. Majumder RQ. A review of anomaly identification in finance frauds using machine learning systems. Int J Adv Res Sci Commun Technol 2025;5:101-10.

5. Wawge SJ. A survey on the identification of credit card fraud using machine learning with precision, performance, and challenges. Int J Innov Sci Res Technol 2025;10:3345-52.

6. Verma V. Deep learning-based fraud detection in financial transactions: A case study using real-time data streams. ESP J Eng Technol Adv 2023;3:149-57.

7. Cui X, Huang Y, Lai S, Keong CI, Wu D. Machine learning on credit risk for lending club business. BCP Bus Manag 2023;23:894-902.

8. Thakkar KB, Kapadia HP. The Roadmap to Digital Transformation in Banking: Advancing Credit Card Fraud Detection with Hybrid Deep Learning Model. In: 2025 2nd International Conference on Trends in Engineering Systems and Technologies (ICTEST), IEEE; 2025. p. 1-6.

9. Ande BR. Federated learning and explainable AI for decentralized fraud detection in financial systems. J Inf Syst Eng Manag 2025;10:48-56.

10. Bilipelli AR. Forecasting the evolution of cyber attacks in fintech using transformer-based time series models. Int J Res Anal Rev 2023;10:383-9.

11. Zhang X, Zhang T, Hou L, Liu X, Guo Z, Tian Y, et al. Data-driven loan default prediction: Amachine learning approach for enhancing business process management. Systems 2025;13:581.

12. Mantha G. Transforming the insurance industry with salesforce: Enhancing customer engagement and operational efficiency. North Am J Eng Res 2024;5:???.

13. Macha Y, Pulichikkunnu SK. An explainable AI system for fraud identification in insurance claims via machine-learning methods. Int J Adv Res Sci Commun Technol 2023;3:1391-400.

14. Yu G. Intelligent Credit Assessment and Risk Prediction Model based on Machine Learning. In: 2025 3rd International Conference on Data Science and Information System (ICDSIS), IEEE; 2025. p. 1-7.

15. Chauhan S. Machine Learning Models for Loan Default Forecasting: Accuracy Comparison. In: 2024 Second International Conference Computational and Characterization Techniques in Engineering and Sciences (IC3TES), 2024. p. 1-5.

16. Saravana Kumar T, Ahadit AB. Machine Learning Applications in Banking for Credit Rating and Risk Evaluation. In: 2024 Third International Conference on Artificial Intelligence, Computational Electronics and Communication System (AICECS), 2024. p. 1-6.

17. Al Smadi B, Glisson WB, Tahat M, Alamleh H, AlQahtani AA. Credit Card Transactions Fraud Detection for Multiple Consumer Behaviors. In: 2024 International Conference on Computing, Networking and Communications (ICNC); 2024. p. 26-32.

18. Deborah RN, Rajiv SA, Vinora A, Devi CM, Arif SM. An Efficient Loan Approval Status Prediction Using Machine Learning. In: Proceedings of 3rd International Conference on Advanced Computing Technologies and Applications, ICACTA; 2023.

19. Chen H. Prediction and analysis of financial default loan behavior based on machine learning model. Comput Intell Neurosci 2022;2022:7907210.

20. Karim M, Samad MF, Muntasir F. Improving Performance Factors of an Imbalanced Credit Risk Dataset Using SMOTE. In: 2022 4th International Conference on Electrical, Computer and Telecommunication Engineering (ICECTE), IEEE; 2022. p. 1-4.

21. Gupta A, Gulati P, Chakrabarty SP. Classification-based credit risk analysis: The case of lending club. Lect Notes Netw Syst 2024;964:77-86.

22. Tiwari SK. Quality assurance strategies in developing high-performance financial technology solutions. Int J Data Sci Mach Learn 2025;5:323-35.

23. Qian H, Zhang S, Wang B, Peng L, Gao S, Song Y. A comparative study on machine learning models combining with outlier detection and balanced sampling methods for credit scoring; arXiv, 2021.

24. Malali N. Predictive AI for identifying lapse risk in life insurance policies: Using machine learning to foresee and mitigate policyholder attrition. Int J Adv Res Sci Commun Technol 2025;5:1-11.

25. Shah TV. Leadership in digital transformation: Enhancing customer value through AI-driven innovation in financial services marketing. Int J Sci Res Arch 2025;15:618-27.

26. Chen Y, Dong Y, Liu W. Prediction of credit default based on the XGBoost model. Appl Comput Eng 2024;96:85-92.

27. Kurakula SR. The role of AI in transforming enterprise systems architecture for financial services modernization. J Comput Sci Technol Stud 2025;7:181-6.

28. Deva MR, Jain N. Utilizing Azure Automated Machine Learning and XGBoost for Predicting Cloud Resource Utilization in Enterprise Environments. In: 2025 International Conference on Networks and Cryptology (NETCRYPT), IEEE; 2025. p. 535-40.

29. Kabade S. Smart pension payroll management enhances accuracy and efficiency through AI and cloud integration. TIJER Int Res J 2024;11:915-21.

30. Kali H. Optimizing credit card fraud transactions identification and classification in banking industry using machine learning algorithms. Int J Recent Technol Sci Manag 2024;9:85-96.

31. Omogbhemhe MI, Momodu IB. Model for predicting bank loan default using XGBoost. Int J Comput Appl 2021;183:1-4.

32. Yang R. Machine learning-based loan default prediction in peer-to-peer lending. Highlights Sci Eng Technol 2024;94:310-8.

33. Robinson N, Sindhwani N. Loan Default Prediction Using Machine Learning. 2024 11th ICRITO 2025. 12th International Conference on Reliability, Infocom Technologies and Optimization. (Future Trends and their Directions). Vol. 25. ICRITO; 2024. p. 15-8.

34. Chang V, Sivakulasingam S, Wang H, Wong ST, Ganatra MA, Luo J. Credit risk prediction using machine learning and deep learning: A study on credit card customers. Risks 2024;12:174.

35. Suhadolnik N, Ueyama J, Da Silva S. Machine learning for enhanced credit risk assessment: An empirical approach. J Risk Financ Manag 2023;16:496.

36. Ko PC, Lin PC, Do HT, Huang YF. P2P lending default prediction based on AI and statistical models. Entropy 2022;24:1-23.

Published

01-12-2025

How to Cite

ANOOP SUNARTY. (2025). COMPREHENSIVE PERFORMANCE ANALYSIS OF MACHINE LEARNING BASED ALGORITHMS FOR CREDIT DEFAULT PREDICTION USING LENDING CLUB DATASET. Innovare Journal of Engineering and Technology, 13, 28–33. https://doi.org/10.22159/ijet.2025v13.57906

Issue

Section

Original Article(s)

Similar Articles

1 2 > >> 

You may also start an advanced similarity search for this article.