Quantitative Finance and Economics | |
Machine learning and artificial neural networks to construct P2P lending credit-scoring model: A case using Lending Club data | |
Shih-Kuei Lin1  An-Hsing Chang1  Li-Kai Yang2  Rua-Huan Tsaih3  | |
[1] 1. Department of Money and Banking, National Chengchi University, Taiwan;2. Department of Investment and Trading, Cathay Life Insurance Co., Ltd., Taiwan;3. Department of Management Information Systems, National Chengchi University, Taiwan; | |
关键词: p2p lending; credit score; machine learning; artificial neural networks; feature engineering; lending club; | |
DOI : 10.3934/QFE.2022013 | |
来源: DOAJ |
【 摘 要 】
In this study, we constructed the credit-scoring model of P2P loans by using several machine learning and artificial neural network (ANN) methods, including logistic regression (LR), a support vector machine, a decision tree, random forest, XGBoost, LightGBM and 2-layer neural networks. This study explores several hyperparameter settings for each method by performing a grid search and cross-validation to get the most suitable credit-scoring model in terms of training time and test performance. In this study, we get and clean the open P2P loan data from Lending Club with feature engineering concepts. In order to find significant default factors, we used an XGBoost method to pre-train all data and get the feature importance. The 16 selected features can provide economic implications for research about default prediction in P2P loans. Besides, the empirical result shows that gradient-boosting decision tree methods, including XGBoost and LightGBM, outperform ANN and LR methods, which are commonly used for traditional credit scoring. Among all of the methods, XGBoost performed the best.
【 授权许可】
Unknown