These labels were automatically added by AI and may be inaccurate. For details, see About Literature Database.
Abstract
Phishing email attacks are among the most common and most harmful
cybersecurity attacks. With the emergence of generative AI, phishing attacks
can be based on emails generated automatically, making it more difficult to
detect them. That is, instead of a single email format sent to a large number
of recipients, generative AI can be used to send each potential victim a
different email, making it more difficult for cybersecurity systems to identify
the scam email before it reaches the recipient. Here we describe a corpus of
AI-generated phishing emails. We also use different machine learning tools to
test the ability of automatic text analysis to identify AI-generated phishing
emails. The results are encouraging, and show that machine learning tools can
identify an AI-generated phishing email with high accuracy compared to regular
emails or human-generated scam email. By applying descriptive analytic, the
specific differences between AI-generated emails and manually crafted scam
emails are profiled, and show that AI-generated emails are different in their
style from human-generated phishing email scams. Therefore, automatic
identification tools can be used as a warning for the user. The paper also
describes the corpus of AI-generated phishing emails that is made open to the
public, and can be used for consequent studies. While the ability of machine
learning to detect AI-generated phishing email is encouraging, AI-generated
phishing emails are different from regular phishing emails, and therefore it is
important to train machine learning systems also with AI-generated emails in
order to repel future phishing attacks that are powered by generative AI.
External Datasets
AI-generated phishing email corpus
Enron email dataset
Nigerian scam email dataset
Ling-Spam dataset
References
Proceedings of the Anti-phishing Working Groups 2nd Annual eCrime Researchers Summit
A comparison of machine learning techniques for phishing detection
Abu-Nimeh, S., Nappa, D., Wang, X., Nair, S.
Published: 2007
Computers & Security
Applying machine learning and natural language processing to detect phishing email
A. Alhogail, A. Alsabih
Published: 2021
IEEE communications surveys & tutorials
A survey of phishing email filtering techniques
A. Almomani, B. B. Gupta, S. Atawneh, A. Meulenberg, E. Almomani
Published: 2013
Sensors
Advancing phishing email detection: A comparative study of deep learning models
Altwaijry, N., Al-Turaiki, I., Alotaibi, R., Alakeel, F.
Published: 2024
Electronics
Phishing email detection model using deep learning
Atawneh, S., Aljehani, H.
Published: 2023
Journal of Computer Science
Machine learning and deep learning for phishing email classification using one-hot encoding
Bagui, S., Nandi, D., Bagui, S., White, R. J.
Published: 2021
Automatic categorization of email into folders: Benchmark experiments on enron and sri corpora
Bekkerman, R.
Published: 2004
Journal of computer security
New filtering approaches for phishing email
Bergholz, A., De Beer, J., Glahn, S., Moens, M.-F., Paaß, G., Strobel, S.