Abstract: Recently, as messenger phishing has been occurring more frequently, the need for its detection has increased; however, datasets for messenger phishing detection are publicly unavailable. In this paper, we address the data scarcity problem of the newly collected messenger phishing dataset by leveraging various types of pre-existing auxiliary phishing data. Experimental results demonstrate that the error rate decreased by up to 1.81 % and the F1 score improved by 7.35% when smishing and voice phishing data are used. These findings confirm that integrating heterogeneous phishing data can mitigate the data scarcity problem and enhance messenger phishing detection performance.
External IDs:dblp:conf/elinfocom/OhNKAJ25
Loading