Improving Messenger Phishing Detection Using Heterogeneous Phishing Data

Seokjin Oh, Keonwoong Noh, Sunyub Kim, Dohyung An, Woohwan Jung

Published: 2025, Last Modified: 06 Jan 2026ICEIC 2025EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Recently, as messenger phishing has been occurring more frequently, the need for its detection has increased; however, datasets for messenger phishing detection are publicly unavailable. In this paper, we address the data scarcity problem of the newly collected messenger phishing dataset by leveraging various types of pre-existing auxiliary phishing data. Experimental results demonstrate that the error rate decreased by up to 1.81 % and the F1 score improved by 7.35% when smishing and voice phishing data are used. These findings confirm that integrating heterogeneous phishing data can mitigate the data scarcity problem and enhance messenger phishing detection performance.

External IDs:dblp:conf/elinfocom/OhNKAJ25