Abstract: Internet finance fraud is an increasingly serious social and economic problem. Online loan services are a typical mode of Internet finance. Constructing machine learning models has become a promising solution for Internet financial anti-fraud. In the process of constructing a fraud detection model, feature engineering is the most critical step. The quality of features directly affects the performance of models. It is also one of the most time-consuming and customized steps. The existing Internet finance anti-fraud model is mainly carried out by experts using manually constructed features based on business knowledge. However, this artificial feature construction method is time-consuming and labor-intensive in the ever-changing Internet finance scenarios, and it has the disadvantages of missing complex features. Artificial feature construction methods can no longer meet the increasing demand for anti-fraud. Automated feature engineering is a promising solution for efficiently constructing anti-fraud models. To evaluate fraud risk for online loan services, this paper proposes an automated feature engineering method based on deep feature synthesis and feature selection, which can be easily cooperated with the current mainstream machine learning models. This method reduces the scale of synthesized features by dividing the original data table into small parts. Combined with the designed feature selection strategy, the method picks out important features from a large number of redundant and meaningless synthesized features. Experiments indicate that our method can improve the efficiency of feature construction and select important features for anti-fraud models.
Loading