Telecom bank card fraud prediction model based on machine learning
Abstract
It is particularly important to identify and prevent telecom scams that trick victims into transferring funds through phone calls, Internet and text messages. Based on the collected data, a prediction model of telecom bank card fraud is established in this paper. In the analysis, we first checked the data missing through Pandas and missingno library, and conducted Pearson correlation analysis, and found that the ratio of transaction amount has a strong positive correlation with fraud. In terms of data preprocessing, outliers are defined and data are cleaned by box diagram, missing values are processed by KNN filling, and data is normalized by Yeo-Johnson transformation. Then, the importance of features is calculated by random forest and GBDT, and the features with greater influence are selected. In the model training, XGBoost, LightGBM and CatBoost integrated learning algorithms were selected, and the optimal model configuration was obtained through parameter optimization, and finally integrated into BaggingClassifier. The model performance evaluation shows that the prediction accuracy of the model established in this paper is up to 99.99%.
Show Figures
Share and Cite
Article Metrics
References
- Wang Wei. A Credit Card Fraud Prediction Model Based on Improved Focal Loss Function XGBoost [J] Information Record Materials, 2022, 23 (12): 192-196.
- Yi Deyan Analysis and Research on Telecom Fraud Prevention Based on Support Vector Machine [D] University of International Business and Economics, 2024.
- Xiao Wenqin Research on Telecom Fraud Identification Based on BP Neural Network [D] Central China Normal University, 2023 .
- Sun Yujia Research on Fraud Phone Identification Based on User Communication Behavior Data [D] Capital University of Economics and Trade, 2023.
- Sun Yue, Ding Jianli A Stacking Integrated Prediction Model for Flight Delays in Adverse Weather Conditions [J/OL] Big data: 1-18 [2024-06-08].
- Chen Xiaoling, Zhang Cong, Huang Xiaoyu Research on Grain Yield Prediction Based on Bayesian LightGBM Model [J] China Journal of Agricultural Machinery Chemistry, 2024, 45 (06): 163-169.
- Pang Songling, Fan Kaidi, Chen Chao, etc A multi time scale prediction model for electric vehicle charging load based on LightGBM algorithm and travel chain theory [J/OL] Automotive Technology: 1-8 [2024-06-08].
- Jin Wanying Research on 5G Telecom User Prediction Based on Data Mining [D] Dalian University of Technology, 2022.
- Liu Bofei 5G potential user identification based on ensemble learning [D] Dalian University of Technology, 2022.
- Yu Jiang The research and application of data mining technology in the telecommunications field [D] Xiangtan University, 2022.