Phishing URL Detection Through Top-level Domain Analysis: A Descriptive Approach

TOP 文献データベース Phishing URL Detection Through Top-level Domain Analysis: A Descriptive Approach

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2005.06599

PDF

https://arxiv.org/pdf/2005.06599

文献情報

作者: Orestis Christou,Nikolaos Pitropakis,Pavlos Papadopoulos,Sean McKeown,William J. Buchanan
公開日: 2020-5-14
所属機関: School of Computing, Edinburgh Napier University
所属の国: United Kingdom
会議名: International Conference on Information Systems Security and Privacy (ICISSP)

AIにより推定されたラベル

URL解析手法機械学習アルゴリズムランダムフォレスト

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Phishing is considered to be one of the most prevalent cyber-attacks because of its immense flexibility and alarmingly high success rate. Even with adequate training and high situational awareness, it can still be hard for users to continually be aware of the URL of the website they are visiting. Traditional detection methods rely on blocklists and content analysis, both of which require time-consuming human verification. Thus, there have been attempts focusing on the predictive filtering of such URLs. This study aims to develop a machine-learning model to detect fraudulent URLs which can be used within the Splunk platform. Inspired from similar approaches in the literature, we trained the SVM and Random Forests algorithms using malicious and benign datasets found in the literature and one dataset that we created. We evaluated the algorithms' performance with precision and recall, reaching up to 85% precision and 87% recall in the case of Random Forests while SVM achieved up to 90% precision and 88% recall using only descriptive features.

外部データセット

PhishTank active blocklist

Alexa’s top one million domains

Legitimate and malicious lists from Sahingoz et al. (2019)

Phishing/legitimate URL set from Phishtorm (Marchal et al., 2014)