Unleashing the Power of LLM to Infer State Machine from the Protocol Implementation

2020 IEEE 13th International Conference on Software Testing, Validation and Verification (ICST)

Aflnet: a greybox fuzzer for network protocols

V.-T. Pham, M. Bohme, A. Roychoudhury

Published: 2020

29th USENIX Security Symposium (USENIX Security 20)

Analysis of DTLS implementations using protocol state fuzzing

P. Fiterau-Brostean, B. Jonsson, R. Merget, J. De Ruiter, K. Sagonas, J. Somorovsky

Published: 2020

2017 IEEE Symposium on Security and Privacy (SP)

Verified models and reference implementations for the tls 1.3 standard candidate

K. Bhargavan, B. Blanchet, N. Kobeissi

Published: 2017

31st USENIX Security Symposium (USENIX Security 22)

Stateful greybox fuzzing

J. Ba, M. Bohme, Z. Mirzamomen, A. Roychoudhury

Published: 2022

33st USENIX Security Symposium (USENIX Security 24)

Hermes: Unlocking security analysis of cellular network protocols by synthesizing finite state machines from natural language specifications

A. A. Ishtiaq, S. M. M. R. Sarkar Snigdha Sarathi Das, K. T. Ali Ranjbar, Z. S. Tianwei Wu, M. A. Weixuan Wang, S. R. H. Rui Zhang

Published: 2024

2022 IEEE Symposium on Security and Privacy (SP)

Automated attack synthesis by extracting finite state machines from protocol specification documents

M. L. Pacheco, M. von Hippel, B. Weintraub, D. Goldwasser, C. Nita-Rotaru

Published: 2022

32nd USENIX Security Symposium (USENIX Security 23)

Extracting protocol format as state machine via controlled static loop analysis

Q. Shi, X. Xu, X. Zhang

Published: 2023

12th USENIX Workshop on Offensive Technologies (WOOT 18)

NEMESYS: Network message syntax reverse engineering by analysis of the intrinsic structure of individual messages

S. Kleber, H. Kopp, F. Kargl

Published: 2018

NDSS

Netplier: Probabilistic network protocol reverse engineering from message traces

Y. Ye, Z. Zhang, F. Wang, X. Zhang, D. Xu

Published: 2021

IEEE INFOCOM 2020-IEEE Conference on Computer Communications

Message type identification of binary network protocols using continuous segment similarity

S. Kleber, R. W. van der Heijden, F. Kargl

Published: 2020

Proceedings of the 28th acm joint meeting on european software engineering conference and symposium on the foundations of software engineering

Mining input grammars from dynamic control flow

R. Gopinath, B. Mathis, A. Zeller

Published: 2020

Proc. IEEE/ACM ICSE

Fuzz4all: Universal fuzzing with large language models

C. S. Xia, M. Paltenghi, J. Le Tian, M. Pradel, L. Zhang

Published: 2024

Proceedings of the 46th IEEE/ACM International Conference on Software Engineering

Large language models are edge-case generators: Crafting unusual programs for fuzzing deep learning libraries

Y. Deng, C. S. Xia, C. Yang, S. D. Zhang, S. Yang, L. Zhang

Published: 2024

Proceedings of the 31st Annual Network and Distributed System Security Symposium (NDSS)

Large language model guided protocol fuzzing

R. Meng, M. Mirchev, M. Bohme, A. Roychoudhury

Published: 2024

NeurIPS

Chain-of-Thought prompting elicits reasoning in large language models

Jason Wei, Xuezhi Wang, Dale Schuurmans, et al.

Published: 2022

Augmented large language models with parametric knowledge guiding

Z. Luo, C. Xu, P. Zhao, X. Geng, C. Tao, J. Ma, Q. Lin, D. Jiang

Published: 2023

22nd USENIX Security Symposium (USENIX Security 13)

WHYPER: Towards automating risk assessment of mobile applications

R. Pandita, X. Xiao, W. Yang, W. Enck, T. Xie

Published: 2013

2015 IEEE/ACM 37th IEEE International Conference on Software Engineering

Dase: Document-assisted symbolic execution for improving automated software testing

E. Wong, L. Zhang, S. Wang, T. Liu, L. Tan

Published: 2015

2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE)

Evaluating large language models in class-level code generation

X. Du, M. Liu, K. Wang, H. Wang, J. Liu, Y. Chen, J. Feng, C. Sha, X. Peng, Y. Lou

Published: 2024

2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)

Automated program repair in the era of large pre-trained language models

C. S. Xia, Y. Wei, L. Zhang

Published: 2023

arxiv

Cited by 1

Annual ACM Conference on Computer and Communications Security (CCS)

Large Language Models for Code: Security Hardening and Adversarial Testing

Jingxuan He, Martin Vechev

Published: 2.11.2023

Large language models (large LMs) are increasingly trained on massive codebases and used to generate code. However, LMs lack awareness of security and are found to frequently produce unsafe code. This work studies the security of LMs along two important axes: (i) security hardening, which aims to enhance LMs' reliability in generating secure code, and (ii) adversarial testing, which seeks to evaluate LMs' security at an adversarial standpoint. We address both of these by formulating a new security task called controlled code generation. The task is parametric and takes as input a binary property to guide the LM to generate secure or unsafe code, while preserving the LM's capability of generating functionally correct code. We propose a novel learning-based approach called SVEN to solve this task. SVEN leverages property-specific continuous vectors to guide program generation towards the given property, without modifying the LM's weights. Our training procedure optimizes these continuous vectors by enforcing specialized loss terms on different regions of code, using a high-quality dataset carefully curated by us. Our extensive evaluation shows that SVEN is highly effective in achieving strong security control. For instance, a state-of-the-art CodeGen LM with 2.7B parameters generates secure code for 59.1% of the time. When we employ SVEN to perform security hardening (or adversarial testing) on this LM, the ratio is significantly boosted to 92.3% (or degraded to 36.8%). Importantly, SVEN closely matches the original LMs in functional correctness.

Prompt Injection Vulnerability Analysis Security Assurance

Internet key exchange protocol version 2 (ikev2)

Published: 2024

strongswan is an opensource ipsec-based vpn solution

Published: 2024

A library providing the core ikev2 funcionability

Published: 2024

An internet key exchange (ike) implementation for linux, freebsd, netbsd and openbsd

Published: 2024

An ipsec implementation for linux

Published: 2024

NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following

How long can context length of open-source llms truly promise?

D. Li, R. Shao, A. Xie, Y. Sheng, L. Zheng, J. Gonzalez, I. Stoica, X. Ma, H. Zhang

Published: 2023

s2n-tls is a c99 implementation of the tls/ssl protocols

Published: 2024

Openbgpd is a free implementation of the border gateway protocol

Published: 2024

Feng - standard compliant streaming server

Published: 2024

Openl2tp is a complete implementation of rfc2661

Published: 2024

arxiv

Cited by 1

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela

Published: 5.23.2020

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate knowledge is still limited, and hence on knowledge-intensive tasks, their performance lags behind task-specific architectures. Additionally, providing provenance for their decisions and updating their world knowledge remain open research problems. Pre-trained models with a differentiable access mechanism to explicit non-parametric memory can overcome this issue, but have so far been only investigated for extractive downstream tasks. We explore a general-purpose fine-tuning recipe for retrieval-augmented generation (RAG) -- models which combine pre-trained parametric and non-parametric memory for language generation. We introduce RAG models where the parametric memory is a pre-trained seq2seq model and the non-parametric memory is a dense vector index of Wikipedia, accessed with a pre-trained neural retriever. We compare two RAG formulations, one which conditions on the same retrieved passages across the whole generated sequence, the other can use different passages per token. We fine-tune and evaluate our models on a wide range of knowledge-intensive NLP tasks and set the state-of-the-art on three open domain QA tasks, outperforming parametric seq2seq models and task-specific retrieve-and-extract architectures. For language generation tasks, we find that RAG models generate more specific, diverse and factual language than a state-of-the-art parametric-only seq2seq baseline.

Information Extraction Method RAG Knowledge Extraction Method

Openai is an ai research and deployment company

Published: 2024

The faiss library

M. Douze, A. Guzhva, C. Deng, J. Johnson, G. Szilvasy, P.-E. Mazare, M. Lomeli, L. Hosseini, H. Jegou

Published: 2024

Proceedings of the 9th ACM Symposium on Information, Computer and Communications Security

Towards automated protocol reverse engineering using semantic information

G. Bossert, F. Guihery, G. Hiet

Published: 2014

Gpt-4 is openai’s most advanced system, producing safer and more useful responses

Published: 2024