AIセキュリティポータル K Program
An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach
Share
Abstract
Phishing email is a serious cyber threat that tries to deceive users by sending false emails with the intention of stealing confidential information or causing financial harm. Attackers, often posing as trustworthy entities, exploit technological advancements and sophistication to make detection and prevention of phishing more challenging. Despite extensive academic research, phishing detection remains an ongoing and formidable challenge in the cybersecurity landscape. Large Language Models (LLMs) and Masked Language Models (MLMs) possess immense potential to offer innovative solutions to address long-standing challenges. In this research paper, we present an optimized, fine-tuned transformer-based DistilBERT model designed for the detection of phishing emails. In the detection process, we work with a phishing email dataset and utilize the preprocessing techniques to clean and solve the imbalance class issues. Through our experiments, we found that our model effectively achieves high accuracy, demonstrating its capability to perform well. Finally, we demonstrate our fine-tuned model using Explainable-AI (XAI) techniques such as Local Interpretable Model-Agnostic Explanations (LIME) and Transformer Interpret to explain how our model makes predictions in the context of text classification for phishing emails.
Phishing email detection using natural language processing techniques: a literature survey
S. Salloum, T. Gaber, S. Vadera, K. Shaalan
Published: 2021
A survey of phishing email filtering techniques
A. Almomani, B. B. Gupta, S. Atawneh, A. Meulenberg, E. Almomani
Published: 2013
A comprehensive survey of ai-enabled phishing attacks detection techniques
A. Basit, M. Zafar, X. Liu, A. R. Javed, Z. Jalil, K. Kifayat
Published: 2021
Machine learning for intelligent data analysis and automation in cybersecurity: current and future prospects
I. H. Sarker
Published: 2023
Transformer in transformer
K. Han, A. Xiao, E. Wu, J. Guo, C. Xu, Y. Wang
Published: 2021
Sentiment analysis on the impact of coronavirus in social life using the bert model
M. Singh, A. K. Jakhar, S. Pandey
Published: 2021
Bert: Pre-training of deep bidirectional transformers for language understanding
Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova
Published: 2019
Survey of bert-base models for scientific text classification: Covid-19 case study
M. Khadhraoui, H. Bellaaj, M. B. Ammar, H. Hamam, M. Jmaiel
Published: 2022
Explainable ai: A brief survey on history, research areas, approaches and challenges
F. Xu, H. Uszkoreit, Y. Du, W. Fan, D. Zhao, J. Zhu
Published: 2019
Interpretable bangla sarcasm detection using bert and explainable ai
R. Anan, T. S. Apon, Z. T. Hossain, E. A. Modhu, S. Mondal, M. G. R. Alam
Published: 2023
AI-driven cybersecurity and threat intelligence: cyber automation, intelligent decision-making and explainability
I. H. Sarker
Published: 2024
Explainable ai for cybersecurity automation, intelligence and trustworthiness in digital twin: Methods, taxonomy, challenges and prospects
I. H. Sarker, H. Janicke, A. Mohsin, A. Gill, L. Maglaras
Published: 2024
Explainabledetector: Exploring transformer-based language modeling approach for sms spam detection with explainability analysis
M. A. Uddin, M. N. Islam, L. Maglaras, H. Janicke, I. H. Sarker
Published: 2025
The Role of Machine Learning in Cybersecurity
Giovanni Apruzzese, Pavel Laskov, Edgardo Montes de Oca, Wissam Mallouli, Luis Burdalo Rapa, Athanasios Vasileios Grammatopoulos, Fabio Di Franco
Published: 2022.6.20
A machine learning approach towards phishing email detection
N. Harikrishnan, R. Vinayakumar, K. Soman
Published: 2018
Using feature selection and classification scheme for automating phishing email detection
I. R. A. Hamid, J. Abawajy, T. Kim
Published: 2013
Phishing web site detection using diverse machine learning algorithms
A. Zamir, H. U. Khan, T. Iqbal, N. Yousaf, F. Aslam, A. Anjum, M. Hamdani
Published: 2020
Email classification for forensic analysis by information gain technique
D. E. Salhi, A. Tari, M. T. Kechadi
Published: 2021
Applying machine learning and natural language processing to detect phishing email
A. Alhogail, A. Alsabih
Published: 2021
Intelligent deep learning based cybersecurity phishing email detection and classification
R. Brindha, S. Nandagopal, H. Azath, V. Sathana, G. P. Joshi, S. W. Kim
Published: 2023
Phish responder: A hybrid machine learning approach to detect phishing and spam emails
M. Dewis, T. Viana
Published: 2022
Phishing email detection using improved rcnn model with multilevel vectors and attention mechanism
Y. Fang, C. Zhang, C. Huang, L. Liu, Y. Yang
Published: 2019
Phishing detection method based on borderline-smote deep belief network
J. Zhang, X. Li
Published: 2017
Classifying phishing urls using recurrent neural networks
A. C. Bahnsen, E. C. Bohorquez, S. Villegas, J. Vargas, F. A. González
Published: 2017
Detection of online phishing email using dynamic evolving neural network based on reinforcement learning
S. Smadi, N. Aslam, L. Zhang
Published: 2018
Evaluation of Federated Learning in Phishing Email Detection
Chandra Thapa, Jun Wen Tang, Alsharif Abuadbba, Yansong Gao, Seyit Camtepe, Surya Nepal, Mahathir Almashor, Yifeng Zheng
Published: 2020.7.27
Phishing email detection model using deep learning
S. Atawneh, H. Aljehani
Published: 2023
Bert-based models for phishing detection
M. Songailaite, E. Kankevičiūtė, B. Zhyhun, J. Mandravickaitė
Published: 2023
A large-scale pretrained deep model for phishing url detection
Y. Wang, W. Zhu, H. Xu, Z. Qin, K. Ren, W. Ma
Published: 2023
Urltran: Improving phishing url detection using transformers
P. Maneriker, J. W. Stokes, E. G. Lazo, D. Carutasu, F. Tajaddodianfar, A. Gururajan
Published: 2021
Analysis on the selection of the appropriate batch size in cnn neural network
R. Lin
Published: 2022
Decoupled weight decay regularization
Ilya Loshchilov, Frank Hutter
Published: 2018
Explainable ai methods-a brief overview
A. Holzinger, A. Saranti, C. Molnar, P. Biecek, W. Samek
Published: 2022
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin
Published: 2016.2.16
Cyber-attack detection through ensemble-based machine learning classifier
M. A. Uddin, K. T. Shahriar, M. M. Haque, I. H. Sarker
Published: 2022
Llm potentiality and awareness: a position paper from the perspective of trustworthy and responsible ai modeling
I. H. Sarker
Published: 2024
Share