MalDICT: Benchmark Datasets on Malware Behaviors, Platforms, Exploitation, and Packers

TOP 文献データベース MalDICT: Benchmark Datasets on Malware Behaviors, Platforms, Exploitation, and Packers

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2310.11706

PDF

https://arxiv.org/pdf/2310.11706

文献情報

作者: Robert J. Joyce;Edward Raff;Charles Nicholas;James Holt
公開日: 2023-10-18
所属機関: Laboratory for Physical Sciences
所属の国: United States of America
会議名: Conference on Applied Machine Learning in Information Security (CAMLIS)

AIにより推定されたラベル

マルウェア分類トークンの処理と収集データセット生成

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Existing research on malware classification focuses almost exclusively on two tasks: distinguishing between malicious and benign files and classifying malware by family. However, malware can be categorized according to many other types of attributes, and the ability to identify these attributes in newly-emerging malware using machine learning could provide significant value to analysts. In particular, we have identified four tasks which are under-represented in prior work: classification by behaviors that malware exhibit, platforms that malware run on, vulnerabilities that malware exploit, and packers that malware are packed with. To obtain labels for training and evaluating ML classifiers on these tasks, we created an antivirus (AV) tagging tool called ClarAVy. ClarAVy's sophisticated AV label parser distinguishes itself from prior AV-based taggers, with the ability to accurately parse 882 different AV label formats used by 90 different AV products. We are releasing benchmark datasets for each of these four classification tasks, tagged using ClarAVy and comprising nearly 5.5 million malicious files in total. Our malware behavior dataset includes 75 distinct tags - nearly 7x more than the only prior benchmark dataset with behavioral tags. To our knowledge, we are the first to release datasets with malware platform and packer tags.

外部データセット

MalDICT-Behavior

MalDICT-Platform

MalDICT-Vulnerability

MalDICT-Packer