An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach | AIセキュリティポータル

EN

JA

EN

TOP 文献データベース An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach

arxiv

An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2402.13871

PDF

https://arxiv.org/pdf/2402.13871

文献情報

作者: Mohammad Amaz Uddin,Md Mahiuddin,Iqbal H. Sarker
公開日: 2024-2-22
更新日: 2025-8-14
所属機関: Department of Computer Science and Engineering, BGC Trust University Bangladesh
所属の国: Bangladesh
会議名: Comput. Networks

AIにより推定されたラベル

フィッシング検出モデルの解釈性モデル性能評価

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Phishing email is a serious cyber threat that tries to deceive users by sending false emails with the intention of stealing confidential information or causing financial harm. Attackers, often posing as trustworthy entities, exploit technological advancements and sophistication to make detection and prevention of phishing more challenging. Despite extensive academic research, phishing detection remains an ongoing and formidable challenge in the cybersecurity landscape. Large Language Models (LLMs) and Masked Language Models (MLMs) possess immense potential to offer innovative solutions to address long-standing challenges. In this research paper, we present an optimized, fine-tuned transformer-based DistilBERT model designed for the detection of phishing emails. In the detection process, we work with a phishing email dataset and utilize the preprocessing techniques to clean and solve the imbalance class issues. Through our experiments, we found that our model effectively achieves high accuracy, demonstrating its capability to perform well. Finally, we demonstrate our fine-tuned model using Explainable-AI (XAI) techniques such as Local Interpretable Model-Agnostic Explanations (LIME) and Transformer Interpret to explain how our model makes predictions in the context of text classification for phishing emails.

外部データセット

Kaggle phishing email dataset

参考文献

Procedia Computer Science

Phishing email detection using natural language processing techniques: a literature survey

S. Salloum, T. Gaber, S. Vadera, K. Shaalan

Published: 2021

IEEE communications surveys & tutorials

A survey of phishing email filtering techniques

A. Almomani, B. B. Gupta, S. Atawneh, A. Meulenberg, E. Almomani

Published: 2013

Telecommunication Systems

A comprehensive survey of ai-enabled phishing attacks detection techniques

A. Basit, M. Zafar, X. Liu, A. R. Javed, Z. Jalil, K. Kifayat

Published: 2021

Annals of Data Science

Machine learning for intelligent data analysis and automation in cybersecurity: current and future prospects

I. H. Sarker

Published: 2023

Advances in Neural Information Processing Systems

Transformer in transformer

K. Han, A. Xiao, E. Wu, J. Guo, C. Xu, Y. Wang

Published: 2021

An improved transformer-based model for detecting phishing, spam, and ham: A large language model approach

S. Jamal, H. Wimmer, I. Sarker

Published: 2023

A survey of large language models

W. X. Zhao, K. Zhou, J. Li, T. Tang, X. Wang, Y. Hou, Y. Min, B. Zhang, J. Zhang, Z. Dong

Published: 2023

HardwareX / Hardware and Communication? (Heliyon: Computing)

A survey on large language model (llm) security and privacy: The good, the bad, and the ugly

Yifan Yao, Jinhao Duan, Kaidi Xu, Yuanfang Cai, Zhibo Sun, Yue Zhang

Published: 2024

Bert: a review of applications in natural language processing and understanding

Published: 2021

Social Network Analysis and Mining

Sentiment analysis on the impact of coronavirus in social life using the bert model

M. Singh, A. K. Jakhar, S. Pandey

Published: 2021

Proceedings of NAACL-HLT

Bert: Pre-training of deep bidirectional transformers for language understanding

Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova

Published: 2019

Applied Sciences

Survey of bert-base models for scientific text classification: Covid-19 case study

M. Khadhraoui, H. Bellaaj, M. B. Ammar, H. Hamam, M. Jmaiel

Published: 2022

International Conference on Learning Representations

Albert: A lite bert for self-supervised learning of language representations

Z. Lan, M. Chen, S. Goodman, K. Gimpel, P. Sharma, R. Soricut

Published: 2020

Roberta: A robustly optimized bert pretraining approach

Published: 2019

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Victor Sanh, Lysandre Debut, Julien Chaumond, Thomas Wolf

Published: 2019

Springer

Explainable ai: A brief survey on history, research areas, approaches and challenges

F. Xu, H. Uszkoreit, Y. Du, W. Fan, D. Zhao, J. Zhu

Published: 2019

Metrics for explainable ai: Challenges and prospects

R. R. Hoffman, S. T. Mueller, G. Klein, J. Litman

Published: 2018

IEEE

Interpretable bangla sarcasm detection using bert and explainable ai

R. Anan, T. S. Apon, Z. T. Hossain, E. A. Modhu, S. Mondal, M. G. R. Alam

Published: 2023

Springer

AI-driven cybersecurity and threat intelligence: cyber automation, intelligent decision-making and explainability

I. H. Sarker

Published: 2024

ICT Express

Explainable ai for cybersecurity automation, intelligence and trustworthiness in digital twin: Methods, taxonomy, challenges and prospects

I. H. Sarker, H. Janicke, A. Mohsin, A. Gill, L. Maglaras

Published: 2024

Digital Communications and Networks

Explainabledetector: Exploring transformer-based language modeling approach for sms spam detection with explainability analysis

M. A. Uddin, M. N. Islam, L. Maglaras, H. Janicke, I. H. Sarker

Published: 2025

Digital Threats: Research and Practice (DTRAP)

The Role of Machine Learning in Cybersecurity

Giovanni Apruzzese, Pavel Laskov, Edgardo Montes de Oca, Wissam Mallouli, Luis Burdalo Rapa, Athanasios Vasileios Grammatopoulos, Fabio Di Franco

Published: 2022.6.20

Machine Learning (ML) represents a pivotal technology for current and future information systems, and many domains already leverage the capabilities of ML. However, deployment of ML in cybersecurity is still at an early stage, revealing a significant discrepancy between research and practice. Such discrepancy has its root cause in the current state-of-the-art, which does not allow to identify the role of ML in cybersecurity. The full potential of ML will never be unleashed unless its pros and cons are understood by a broad audience. This paper is the first attempt to provide a holistic understanding of the role of ML in the entire cybersecurity domain -- to any potential reader with an interest in this topic. We highlight the advantages of ML with respect to human-driven detection methods, as well as the additional tasks that can be addressed by ML in cybersecurity. Moreover, we elucidate various intrinsic problems affecting real ML deployments in cybersecurity. Finally, we present how various stakeholders can contribute to future developments of ML in cybersecurity, which is essential for further progress in this field. Our contributions are complemented with two real case studies describing industrial applications of ML as defense against cyber-threats.

敵対的サンプル機械学習の役割商用ML製品の問題

An intelligent classification model for phishing email detection

A. Yasin, A. Abuhasan

Published: 2016

Proceedings of the Anti-Phishing Pilot at ACM International Workshop on Security and Privacy Analytics

A machine learning approach towards phishing email detection

N. Harikrishnan, R. Vinayakumar, K. Soman

Published: 2018

Studies in Informatics and Control

Using feature selection and classification scheme for automating phishing email detection

I. R. A. Hamid, J. Abawajy, T. Kim

Published: 2013

The Electronic Library

Phishing web site detection using diverse machine learning algorithms

A. Zamir, H. U. Khan, T. Iqbal, N. Yousaf, F. Aslam, A. Anjum, M. Hamdani

Published: 2020

Phishing dynamic evolving neural fuzzy framework for online detection zero-day phishing email

A. Almomani, B. B. Gupta, T.-C. Wan, A. Altaher, S. Manickam

Published: 2013

International Journal of Software Science and Computational Intelligence (IJSSCI)

Email classification for forensic analysis by information gain technique

D. E. Salhi, A. Tari, M. T. Kechadi

Published: 2021

Computers & Security

Applying machine learning and natural language processing to detect phishing email

A. Alhogail, A. Alsabih

Published: 2021

Computers, Materials & Continua

Intelligent deep learning based cybersecurity phishing email detection and classification

R. Brindha, S. Nandagopal, H. Azath, V. Sathana, G. P. Joshi, S. W. Kim

Published: 2023

Applied System Innovation

Phish responder: A hybrid machine learning approach to detect phishing and spam emails

M. Dewis, T. Viana

Published: 2022

IEEE Access

Phishing email detection using improved rcnn model with multilevel vectors and attention mechanism

Y. Fang, C. Zhang, C. Huang, L. Liu, Y. Yang

Published: 2019

Springer

Phishing detection method based on borderline-smote deep belief network

J. Zhang, X. Li

Published: 2017

IEEE

Classifying phishing urls using recurrent neural networks

A. C. Bahnsen, E. C. Bohorquez, S. Villegas, J. Vargas, F. A. González

Published: 2017

Decision Support Systems

Detection of online phishing email using dynamic evolving neural network based on reinforcement learning

S. Smadi, N. Aslam, L. Zhang

Published: 2018

Italian National Conference on Sensors (Sensors)

Evaluation of Federated Learning in Phishing Email Detection

Chandra Thapa, Jun Wen Tang, Alsharif Abuadbba, Yansong Gao, Seyit Camtepe, Surya Nepal, Mahathir Almashor, Yifeng Zheng

Published: 2020.7.27

The use of Artificial Intelligence (AI) to detect phishing emails is primarily dependent on large-scale centralized datasets, which opens it up to a myriad of privacy, trust, and legal issues. Moreover, organizations are loathed to share emails, given the risk of leakage of commercially sensitive information. So, it is uncommon to obtain sufficient emails to train a global AI model efficiently. Accordingly, privacy-preserving distributed and collaborative machine learning, particularly Federated Learning (FL), is a desideratum. Already prevalent in the healthcare sector, questions remain regarding the effectiveness and efficacy of FL-based phishing detection within the context of multi-organization collaborations. To the best of our knowledge, the work herein is the first to investigate the use of FL in email anti-phishing. This paper builds upon a deep neural network model, particularly RNN and BERT for phishing email detection. It analyzes the FL-entangled learning performance under various settings, including balanced and asymmetrical data distribution. Our results corroborate comparable performance statistics of FL in phishing email detection to centralized learning for balanced datasets, and low organization counts. Moreover, we observe a variation in performance when increasing organizational counts. For a fixed total email dataset, the global RNN based model suffers by a 1.8% accuracy drop when increasing organizational counts from 2 to 10. In contrast, BERT accuracy rises by 0.6% when going from 2 to 5 organizations. However, if we allow increasing the overall email dataset with the introduction of new organizations in the FL framework, the organizational level performance is improved by achieving a faster convergence speed. Besides, FL suffers in its overall global model performance due to highly unstable outputs if the email dataset distribution is highly asymmetric.

深層学習プライバシー評価性能評価

Electronics

Phishing email detection model using deep learning

S. Atawneh, H. Aljehani

Published: 2023

Tinybert: Distilling bert for natural language understanding

X. Jiao, Y. Yin, L. Shang, X. Jiang, X. Chen, L. Li, F. Wang, Q. Liu

Published: 2019

Computing Research Repository (CoRR)

CATBERT: Context-Aware Tiny BERT for Detecting Social Engineering Emails

Younghoo Lee, Joshua Saxe, Richard Harang

Published: 2020.10.8

Targeted phishing emails are on the rise and facilitate the theft of billions of dollars from organizations a year. While malicious signals from attached files or malicious URLs in emails can be detected by conventional malware signatures or machine learning technologies, it is challenging to identify hand-crafted social engineering emails which don't contain any malicious code and don't share word choices with known attacks. To tackle this problem, we fine-tune a pre-trained BERT model by replacing the half of Transformer blocks with simple adapters to efficiently learn sophisticated representations of the syntax and semantics of the natural language. Our Context-Aware network also learns the context representations between email's content and context features from email headers. Our CatBERT(Context-Aware Tiny Bert) achieves a 87% detection rate as compared to DistilBERT, LSTM, and logistic regression baselines which achieve 83%, 79%, and 54% detection rates at false positive rates of 1%, respectively. Our model is also faster than competing transformer approaches and is resilient to adversarial attacks which deliberately replace keywords with typos or synonyms.

モデルアーキテクチャ学習の改善機械学習

CEUR-WS

Bert-based models for phishing detection

M. Songailaite, E. Kankevičiūtė, B. Zhyhun, J. Mandravickaitė

Published: 2023

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing

A large-scale pretrained deep model for phishing url detection

Y. Wang, W. Zhu, H. Xu, Z. Qin, K. Ren, W. Ma

Published: 2023

MILCOM 2021 - 2021 IEEE Military Communications Conference

Urltran: Improving phishing url detection using transformers

P. Maneriker, J. W. Stokes, E. G. Lazo, D. Carutasu, F. Tajaddodianfar, A. Gururajan

Published: 2021

IEEE

Analysis on the selection of the appropriate batch size in cnn neural network

R. Lin

Published: 2022

International Conference on Learning Representations

Decoupled weight decay regularization

Ilya Loshchilov, Frank Hutter

Published: 2018

Understanding adamw through proximal methods and scale-freeness

Z. Zhuang, M. Liu, A. Cutkosky, F. Orabona

Published: 2022

Springer

Explainable ai methods-a brief overview

A. Holzinger, A. Saranti, C. Molnar, P. Biecek, W. Samek

Published: 2022

arxiv

被引用数 1

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

Published: 2016.2.16

Despite widespread adoption, machine learning models remain mostly black boxes. Understanding the reasons behind predictions is, however, quite important in assessing trust, which is fundamental if one plans to take action based on a prediction, or when choosing whether to deploy a new model. Such understanding also provides insights into the model, which can be used to transform an untrustworthy model or prediction into a trustworthy one. In this work, we propose LIME, a novel explanation technique that explains the predictions of any classifier in an interpretable and faithful manner, by learning an interpretable model locally around the prediction. We also propose a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem. We demonstrate the flexibility of these methods by explaining different models for text (e.g. random forests) and image classification (e.g. neural networks). We show the utility of explanations via novel experiments, both simulated and with human subjects, on various scenarios that require trust: deciding if one should trust a prediction, choosing between models, improving an untrustworthy classifier, and identifying why a classifier should not be trusted.

説明可能な機械学習 XAI（説明可能なAI）特徴重要度分析

Hybrid quantum-inspired resnet and densenet for pattern recognition with completeness analysis

A. Chen, H.-L. Yin, Z.-B. Chen, S. Wu

Published: 2024

Springer

Cyber-attack detection through ensemble-based machine learning classifier

M. A. Uddin, K. T. Shahriar, M. M. Haque, I. H. Sarker

Published: 2022

Discover Artificial Intelligence

Llm potentiality and awareness: a position paper from the perspective of trustworthy and responsible ai modeling

I. H. Sarker

Published: 2024