Enhancing Reasoning Capacity of SLM using Cognitive Enhancement

An Assessment of ChatGPT on Log Data

P. Mudgal, R. Wouhaybi

LogPrompt: Prompt Engineering Towards Zero-Shot and Interpretable Log Analysis

Y. Liu, S. Tao, W. Meng, J. Wang, W. Ma, Y. Zhao, Y. Chen, H. Yang, Y. Jiang, X. Chen

Computing Research Repository (CoRR)

RAGLog: Log Anomaly Detection using Retrieval Augmented Generation

Jonathan Pan, Swee Liang Wong, Yidi Yuan

Published: 2023.11.9

The ability to detect log anomalies from system logs is a vital activity needed to ensure cyber resiliency of systems. It is applied for fault identification or facilitate cyber investigation and digital forensics. However, as logs belonging to different systems and components differ significantly, the challenge to perform such analysis is humanly challenging from the volume, variety and velocity of logs. This is further complicated by the lack or unavailability of anomalous log entries to develop trained machine learning or artificial intelligence models for such purposes. In this research work, we explore the use of a Retrieval Augmented Large Language Model that leverages a vector database to detect anomalies from logs. We used a Question and Answer configuration pipeline. To the best of our knowledge, our experiment which we called RAGLog is a novel one and the experimental results show much promise.

ログ分析の課題クラス不均衡クラスタリング手法

Explainable Artificial Intelligence Applications in Cyber Security: State-of-the-Art in Research

Zhibo Zhang, Hussam Al Hamadi, Ernesto Damiani, Chan Yeob Yeun, Fatma Taher

Published: 2022.9.1

This survey presents a comprehensive review of current literature on Explainable Artificial Intelligence (XAI) methods for cyber security applications. Due to the rapid development of Internet-connected systems and Artificial Intelligence in recent years, Artificial Intelligence including Machine Learning (ML) and Deep Learning (DL) has been widely utilized in the fields of cyber security including intrusion detection, malware detection, and spam filtering. However, although Artificial Intelligence-based approaches for the detection and defense of cyber attacks and threats are more advanced and efficient compared to the conventional signature-based and rule-based cyber security strategies, most ML-based techniques and DL-based techniques are deployed in the black-box manner, meaning that security experts and customers are unable to explain how such procedures reach particular conclusions. The deficiencies of transparency and interpretability of existing Artificial Intelligence techniques would decrease human users' confidence in the models utilized for the defense against cyber attacks, especially in current situations where cyber attacks become increasingly diverse and complicated. Therefore, it is essential to apply XAI in the establishment of cyber security models to create more explainable models while maintaining high accuracy and allowing human users to comprehend, trust, and manage the next generation of cyber defense mechanisms. Although there are papers reviewing Artificial Intelligence applications in cyber security areas and the vast literature on applying XAI in many fields including healthcare, financial services, and criminal justice, the surprising fact is that there are currently no survey research articles that concentrate on XAI applications in cyber security.

モデルの解釈性 XAIの応用データセット生成

Forensic Sci. Int. Digit. Investig.

ChatGPT for Digital Forensic Investigation: The Good, The Bad, and The Unknown

Mark Scanlon, Frank Breitinger, Christopher Hargreaves, Jan-Niclas Hilgert, John Sheppard

Published: 2023.7.11

The disruptive application of ChatGPT (GPT-3.5, GPT-4) to a variety of domains has become a topic of much discussion in the scientific community and society at large. Large Language Models (LLMs), e.g., BERT, Bard, Generative Pre-trained Transformers (GPTs), LLaMA, etc., have the ability to take instructions, or prompts, from users and generate answers and solutions based on very large volumes of text-based training data. This paper assesses the impact and potential impact of ChatGPT on the field of digital forensics, specifically looking at its latest pre-trained LLM, GPT-4. A series of experiments are conducted to assess its capability across several digital forensic use cases including artefact understanding, evidence searching, code generation, anomaly detection, incident response, and education. Across these topics, its strengths and risks are outlined and a number of general conclusions are drawn. Overall this paper concludes that while there are some potential low-risk applications of ChatGPT within digital forensics, many are either unsuitable at present, since the evidence would need to be uploaded to the service, or they require sufficient knowledge of the topic being asked of the tool to identify incorrect assumptions, inaccuracies, and mistakes. However, to an appropriately knowledgeable user, it could act as a useful supporting tool in some circumstances.

デジタルフォレンジックプロンプトエンジニアリングデータ生成

2021 36th IEEE/ACM International Conference on Automated Software Engineering (ASE)

Log-based Anomaly Detection without Log Parsing

V. H. Le, H. Zhang

Published: 2021

A survey of large language models

W. X. Zhao, K. Zhou, J. Li, T. Tang, X. Wang, Y. Hou, Y. Min, B. Zhang, J. Zhang, Z. Dong

Teaching Small Language Models to Reason

IEEE Access

Explainable Artificial Intelligence in CyberSecurity: A Survey

N. Capuano, G. Fenza, V. Loia, C. Stanzione

Published: 2022

L. C. Magister, J. Mallinson, J. Adamek, E. Malmi, A. Severyn

ReAct: Synergizing Reasoning and Acting in Language Models

S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. Narasimhan, Y. Cao

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela

Published: 2020.5.23

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate knowledge is still limited, and hence on knowledge-intensive tasks, their performance lags behind task-specific architectures. Additionally, providing provenance for their decisions and updating their world knowledge remain open research problems. Pre-trained models with a differentiable access mechanism to explicit non-parametric memory can overcome this issue, but have so far been only investigated for extractive downstream tasks. We explore a general-purpose fine-tuning recipe for retrieval-augmented generation (RAG) -- models which combine pre-trained parametric and non-parametric memory for language generation. We introduce RAG models where the parametric memory is a pre-trained seq2seq model and the non-parametric memory is a dense vector index of Wikipedia, accessed with a pre-trained neural retriever. We compare two RAG formulations, one which conditions on the same retrieved passages across the whole generated sequence, the other can use different passages per token. We fine-tune and evaluate our models on a wide range of knowledge-intensive NLP tasks and set the state-of-the-art on three open domain QA tasks, outperforming parametric seq2seq models and task-specific retrieve-and-extract architectures. For language generation tasks, we find that RAG models generate more specific, diverse and factual language than a state-of-the-art parametric-only seq2seq baseline.

情報抽出手法 RAG 知識抽出手法

Latent retrieval for weakly supervised open domain question answering

K. Lee, M. W. Chang, K. Toutanov

Published: 2019

European Economic Review

The effect of cognitive load on economic decision making: A survey and new experiment

C. Deck, S. Jahedi

Published: 2015

International Journal of Human-Computer Studies

A cognitive decomposition to empirically study human performance in control room environments

B. M. Knisely, J. S. Joyner, A. M. Rutkowski, M. Wong, S. Barksdale, H. Hotham, K. Kharod, M. Vaughn-Cooke

Published: 2020

42nd Annual Meeting of the Cognitive Science Society

Resource-rational Task Decomposition to Minimize Planning Costs

C. G. Correa, M. K. Ho, F. Callaway, T. L. Griffiths

Published: 2020

Psychology Press

Cognitive Task Analysis

J. M. Schraagen, S. F. Chipman, V. L. Shalin

Published: 2000

Understanding the planning of LLM agents: A survey

X. Huang, W. Liu, X. Chen, X. Wang, H. Wang, D. Lian, Y. Wang, R. Tang, E. Chen

Experience Report: Deep Learning-based System Log Analysis for Anomaly Detection

Z. Chen, J. Liu, W. Gu, Y. Su, M. R. Lyu

37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN’07)

What supercomputers say: A study of five system logs

A. Oliner, J. Stearley

Published: 2007

DSN

What supercomputers say: A study of five system logs

A. Oliner, J. Stearley

Published: 2007

Llama 2: Open foundation and fine-tuned chat models

H. Touvron, L. Martin, K. Stone, P. Albert, A. Almahairi, Y. Babaei, N. Bashlykov, S. Batra, P. Bhargava, S. Bhosale, D. Bikel

Doremi: Optimizing data mixtures speeds up language model pretraining

Advances in Neural Information Processing Systems

Judging LLM-as-a-judge with MT-bench and chatbot arena

Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li, Eric Xing, et al.

Published: 2024

S. M. Xie, H. Pham, X. Dong, N. Du, H. Liu, Y. Lu, P. Liang, Q. V. Le, T. Ma, A. W. Yu

Editing large lanague models: Problems, method and opportunities

CoRR

Y. Yao, P. Wang, B. Tian, S. Cheng, Z. Li, S. Deng, H. Chen, N. Zhang

Reflexion: an autonomous agent with dynamic memory and self-reflection

Proceedings of NAACL-HLT

Bert: Pre-training of deep bidirectional transformers for language understanding

Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova

Published: 2019

N. Shinn, B. Labash, A. Gopinath