AIセキュリティポータル K Program
RACONTEUR: A Knowledgeable, Insightful, and Portable LLM-Powered Shell Command Explainer
Share
Abstract
Malicious shell commands are linchpins to many cyber-attacks, but may not be easy to understand by security analysts due to complicated and often disguised code structures. Advances in large language models (LLMs) have unlocked the possibility of generating understandable explanations for shell commands. However, existing general-purpose LLMs suffer from a lack of expert knowledge and a tendency to hallucinate in the task of shell command explanation. In this paper, we present Raconteur, a knowledgeable, expressive and portable shell command explainer powered by LLM. Raconteur is infused with professional knowledge to provide comprehensive explanations on shell commands, including not only what the command does (i.e., behavior) but also why the command does it (i.e., purpose). To shed light on the high-level intent of the command, we also translate the natural-language-based explanation into standard technique & tactic defined by MITRE ATT&CK, the worldwide knowledge base of cybersecurity. To enable Raconteur to explain unseen private commands, we further develop a documentation retriever to obtain relevant information from complementary documentations to assist the explanation process. We have created a large-scale dataset for training and conducted extensive experiments to evaluate the capability of Raconteur in shell command explanation. The experiments verify that Raconteur is able to provide high-quality explanations and in-depth insight of the intent of the command.
A Convolutional Attention Network for Extreme Summarization of Source Code
M. Allamanis, H. Peng, C. Sutton
Published: 2016
METEOR: An automatic metric for MT evaluation with improved correlation with human judgments
Satanjeev Banerjee, Alon Lavie
Published: 2005
Towards a Systematic Multi-modal Representation Learning for Network Data
Z. Ben-Houidi, R. Azorin, M. Gallo, A. Finamore, D. Rossi
Published: 2022
Towards NLP-based Processing of Honeypot Logs
M. Boffa, G. Milan, L. Vassio, I. Drago, M. Mellia, Z. Ben-Houidi
Published: 2022
LogPrécis: Unleashing language models for automated malicious log analysis
Matteo Boffa, Idilio Drago, Marco Mellia, Luca Vassio, Danilo Giordano, Rodolfo Valentim, Zied Ben Houidi
Published: 2024
Language models are few-shot learners
T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. M. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, D. Amodei
Published: 2020
PyMT5: Multi-mode Translation of Natural Language and Python Code with Transformers
C. B. Clement, D. Drain, J. Timcheck, A. Svyatkovskiy, N. Sundare-san
Published: 2020
Robosourcing Educational Resources - Leveraging Large Language Models for Learner-sourcing
P. Denny, S. Sarsa, A. Hellas, J. Leinonen
Published: 2022
A New Hope for Network Model Generalization
A. Dietmuller, S. Ray, R. Jacob, L. Vanbever
Published: 2022
Share