AIセキュリティポータル K Program
Enhancing Reasoning Capacity of SLM using Cognitive Enhancement
Share
Abstract
Large Language Models (LLMs) have been applied to automate cyber security activities and processes including cyber investigation and digital forensics. However, the use of such models for cyber investigation and digital forensics should address accountability and security considerations. Accountability ensures models have the means to provide explainable reasonings and outcomes. This information can be extracted through explicit prompt requests. For security considerations, it is crucial to address privacy and confidentiality of the involved data during data processing as well. One approach to deal with this consideration is to have the data processed locally using a local instance of the model. Due to limitations of locally available resources, namely memory and GPU capacities, a Smaller Large Language Model (SLM) will typically be used. These SLMs have significantly fewer parameters compared to the LLMs. However, such size reductions have notable performance reduction, especially when tasked to provide reasoning explanations. In this paper, we aim to mitigate performance reduction through the integration of cognitive strategies that humans use for problem-solving. We term this as cognitive enhancement through prompts. Our experiments showed significant improvement gains of the SLMs' performances when such enhancements were applied. We believe that our exploration study paves the way for further investigation into the use of cognitive enhancement to optimize SLM for cyber security applications.
Improving log-based field failure data analysis of multi-node computing systems
A. Pecchia, D. Cotroneo, Z. Kalbarczyk, R.K. Iyer
Published: 2011
Detecting large-scale system problems by mining console logs
W. Xu, L. Huang, A. Fox, D. Patterson, M.I. Jordon
Published: 2009
Taming the logs – Vocabularies for semantic security analysis
A. Ekelhart, E. Kiesling, K. Kurniawan
Published: 2018
Explainable Artificial Intelligence Applications in Cyber Security: State-of-the-Art in Research
Zhibo Zhang, Hussam Al Hamadi, Ernesto Damiani, Chan Yeob Yeun, Fatma Taher
Published: 2022.9.1
ChatGPT for Digital Forensic Investigation: The Good, The Bad, and The Unknown
Mark Scanlon, Frank Breitinger, Christopher Hargreaves, Jan-Niclas Hilgert, John Sheppard
Published: 2023.7.11
Log-based Anomaly Detection without Log Parsing
V. H. Le, H. Zhang
Published: 2021
Explainable Artificial Intelligence in CyberSecurity: A Survey
N. Capuano, G. Fenza, V. Loia, C. Stanzione
Published: 2022
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela
Published: 2020.5.23
Resource-rational Task Decomposition to Minimize Planning Costs
C. G. Correa, M. K. Ho, F. Callaway, T. L. Griffiths
Published: 2020
Cognitive Task Analysis
J. M. Schraagen, S. F. Chipman, V. L. Shalin
Published: 2000
What supercomputers say: A study of five system logs
A. Oliner, J. Stearley
Published: 2007
What supercomputers say: A study of five system logs
A. Oliner, J. Stearley
Published: 2007
Judging LLM-as-a-judge with MT-bench and chatbot arena
Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li, Eric Xing, et al.
Published: 2024
Bert: Pre-training of deep bidirectional transformers for language understanding
Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova
Published: 2019
Self-consistency improves chain of thought reasoning in language models
Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc V Le, Ed H. Chi, Sharan Narang, Aakanksha Chowdhery, Denny Zhou
Published: 2023
Chain-of-thought prompting elicits reasoning in large language models
J. Wei, X. Wang, D. Schuurmans, M. Bosma, B. Ichter, F. Xia, E. Chi, Q. Le, D. Zhou
Published: 2023
Share