AIセキュリティポータル K Program
ChatSpamDetector: Leveraging Large Language Models for Effective Phishing Email Detection
Share
Abstract
The proliferation of phishing sites and emails poses significant challenges to existing cybersecurity efforts. Despite advances in malicious email filters and email security protocols, problems with oversight and false positives persist. Users often struggle to understand why emails are flagged as potentially fraudulent, risking the possibility of missing important communications or mistakenly trusting deceptive phishing emails. This study introduces ChatSpamDetector, a system that uses large language models (LLMs) to detect phishing emails. By converting email data into a prompt suitable for LLM analysis, the system provides a highly accurate determination of whether an email is phishing or not. Importantly, it offers detailed reasoning for its phishing determinations, assisting users in making informed decisions about how to handle suspicious emails. We conducted an evaluation using a comprehensive phishing email dataset and compared our system to several LLMs and baseline systems. We confirmed that our system using GPT-4 has superior detection capabilities with an accuracy of 99.70%. Advanced contextual interpretation by LLMs enables the identification of various phishing tactics and impersonations, making them a potentially powerful tool in the fight against email-based phishing threats.
to do this properly, you need more resources: The hidden costs of introducing simulated phishing campaigns
L. Brunken, A. Buckmann, J. Hielscher, M.A. Sasse
Published: 2023
A content-based phishing email detection method
H. Che, Q. Liu, L. Zou, H. Yang, D. Zhou, F. Yu
Published: 2017
High precision detection of business email compromise
A. Cidon, L. Gavish, I. Bleier, N. Korshun, M. Schweighauser, A. Tsitkin
Published: 2019
Phinding phish: An evaluation of anti-phishing toolbars
L.F. Cranor, S. Egelman, J.I. Hong, Y. Zhang
Published: 2007
Reading between the lines: Content-agnostic detection of spear-phishing emails
H. Gascon, S. Ullrich, B. Stritter, K. Rieck
Published: 2018
User context: an explanatory variable in phishing susceptibility
K.K. Greene, M. Steves, M. Theofanos, J. Kostick
Published: 2018
Learning from the ones that got away: Detecting new forms of phishing attacks
C.N. Gutierrez, T. Kim, R.D. Corte, J. Avery, D. Goldwasser, M. Cinque, S. Bagchi
Published: 2018
Accurate spear phishing campaign attribution and early detection
Y. Han, Y. Shen
Published: 2016
Devising and detecting phishing emails using large language models
F. Heiding, B. Schneier, A. Vishwanath, J. Bernstein, P.S. Park
Published: 2024
Detecting and characterizing lateral phishing at scale
Grant Ho, Asaf Cidon, Lior Gavish, Marco Schweighauser, Vern Paxson, Stefan Savage, Geoffrey M Voelker, David Wagner
Published: 2019
End-to-end measurements of email spoofing attacks
H. Hu, G. Wang
Published: 2018
Pickmail: a serious game for email phishing awareness training
G. Jayakrishnan, V. Banahatti, S. Lodha
Published: 2022
Email Summarization to Assist Users in Phishing Identification
Amir Kashapov, Tingmin Wu, Alsharif Abuadbba, Carsten Rudolph
Published: 2022.3.25
Large language models are zero-shot reasoners
Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, Yusuke Iwasawa
Published: 2022
DomainKeys Identified Mail (DKIM) Signatures
M. Kucherawy, D. Crocker, T. Hansen
Published: 2011
Domain-based Message Authentication, Reporting, and Conformance (DMARC)
M. Kucherawy, E. Zwicky
Published: 2015
D-fence: A flexible, efficient, and comprehensive phishing email detection system
J. Lee, F. Tang, P. Ye, F. Abbasi, P. Hay, D.M. Divakaran
Published: 2021
LSTM based phishing detection for big email data
Q. Li, M. Cheng, J. Wang, B. Sun
Published: 2022
Understanding the viability of gmail’s origin indicator for identifying the sender
E. Liu, L. Sun, A. Bellon, G. Ho, G.M. Voelker, S. Savage, I.N.S. Munyaka
Published: 2023
An intelligent cyber security phishing detection system using deep learning techniques
A. Mughaid, S. AlZu’bi, A. Hnaif, S. Taamneh, A. Alnajjar, E.A. Elsoud
Published: 2022
CADUE: content-agnostic detection of unwanted emails for enterprise security
M. Nabeel, E. Altinisik, H. Sun, I. Khalil, W.H. Wang, T. Yu
Published: 2021
“i didn’t click”: What users say when reporting phishing
N. Pilavakis, A. Jenkins, N. Kökciyan, K. Vaniea
Published: 2023
Leveraging synthetic data and PU learning for phishing email detection
F.Z. Qachfar, R.M. Verma, A. Mukherjee
Published: 2022
An investigation of phishing awareness and education over time: When and how to best remind users
Benjamin Reinheimer, Lukas Aldag, Peter Mayer, Mattia Mossano, Reyhan Duezguen, Bettina Lofthouse, Tatiana Von Landesberger, Melanie Volkamer
Published: 2020
Context-based clustering to mitigate phishing attacks
T. Saka, K. Vaniea, N. Kökciyan
Published: 2022
Sender Policy Framework (SPF) for Authorizing Use of Domains in E-Mail, Version 1
W. Schlitt, M.W. Wong
Published: 2006
Weak links in authentication chains: A large-scale analysis of email sender spoofing attacks
K. Shen, C. Wang, M. Guo, X. Zheng, C. Lu, B. Liu, Y. Zhao, S. Hao, H. Duan, Q. Pan, M. Yang
Published: 2021
Who is targeted by email-based phishing and malware?: Measuring factors that differentiate risk
C. Simoiu, A. Zand, K. Thomas, E. Bursztein
Published: 2020
Phishing email detection based on binary search feature selection
G. Sonowal
Published: 2020
B@bel: Leveraging email delivery for spam mitigation
G. Stringhini, M. Egele, A. Zarras, T. Holz, C. Kruegel, G. Vigna
Published: 2012
Chain-of-thought prompting elicits reasoning in large language models
J. Wei, X. Wang, D. Schuurmans, M. Bosma, B. Ichter, F. Xia, E. Chi, Q. Le, D. Zhou
Published: 2023
A first look at brand indicators for message identification (BIMI)
M. Yajima, D. Chiba, Y. Yoneya, T. Mori
Published: 2023
Presenting suspicious details in user-facing e-mail headers does not improve phishing detection
S.Y. Zheng, I. Becker
Published: 2022
Checking, nudging or scoring? evaluating e-mail user security tools
S.Y. Zheng, I. Becker
Published: 2023
Sok: Human-centered phishing susceptibility
S. Zhuo, R. Biddle, Y.S. Koh, D.M. Lottridge, G. Russello
Published: 2023
Share