Automated Mapping of CVE Vulnerability Records to MITRE CWE Weaknesses

Tom Kwiatkowski, Jennimaria Palomaki, Olivia Redfield, Michael Collins, Ankur Parikh, Chris Alberti, Danielle Epstein, Illia Polosukhin, Jacob Devlin, Kenton Lee, et al.

Published: 2019

Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Neural architectures for named entity recognition

Guillaume Lample, Miguel Ballesteros

Published: 2016

1st Cyber Threat Intelligence Symposium

Automated Retrieval of ATT&CK Tactics and Techniques for Cyber Threat Reports

Valentine Legoy, Marco Caselli, Christin Seifert, Andreas Peter

Published: 2020

Roberta: A robustly optimized bert pretraining approach

Liu, Y.

Published: 2019

Master Thesis in Cybersecurity, Department of Mathematics “Tullio Levi-Civita”. University of Padova

STIXnet: Entity and Relation Extraction from Unstructured CTI Reports

Francesco Marchiori, Mauro Conti, Nino Vincenzo Verde

Published: 2021

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures

Makoto Miwa, Mohit Bansal

Published: 2016

Findings of the Association for Computational Linguistics: EMNLP 2020

Rethinking Self-Attention: Towards Interpretability in Neural Parsing

Khalil Mrini, Franck Dernoncourt, Quan Hung Tran, Trung Bui, Walter Chang, Ndapa Nakashole

Published: 2020

MIT Press

Machine learning: A Probabilistic Perspective

Kevin P. Murphy

Published: 2013

National institute of standards and technology(NIST)

National Vulnerability Database

CVE National Vulnerability Database

Published: 2022

Ms marco: A human generated machine reading comprehension dataset

Payal Bajaj, Daniel Campos, Nick Craswell, Li Deng, Jianfeng Gao, Xiaodong Liu, Rangan Majumder, Andrew McNamara, Bhaskar Mitra, Tri Nguyen

Published: 2016

Master Theses, 2020-CURRENT. James Madison University

RedAI: A Machine Learning Approach to Cyber Threat Intelligence

Luke Noel

Published: 2021

Findings of the Association for Computational Linguistics: EMNLP 2020

Document Ranking with a Pretrained Sequence-to-Sequence Model

Rodrigo Nogueira, Zhiying Jiang, Ronak Pradeep, Jimmy Lin

Published: 2020

arxiv

Cited by 29

Conference on Neural Information Processing Systems (NeurIPS)

Training language models to follow instructions with human feedback

Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe

Published: 3.4.2022

Making language models bigger does not inherently make them better at following a user's intent. For example, large language models can generate outputs that are untruthful, toxic, or simply not helpful to the user. In other words, these models are not aligned with their users. In this paper, we show an avenue for aligning language models with user intent on a wide range of tasks by fine-tuning with human feedback. Starting with a set of labeler-written prompts and prompts submitted through the OpenAI API, we collect a dataset of labeler demonstrations of the desired model behavior, which we use to fine-tune GPT-3 using supervised learning. We then collect a dataset of rankings of model outputs, which we use to further fine-tune this supervised model using reinforcement learning from human feedback. We call the resulting models InstructGPT. In human evaluations on our prompt distribution, outputs from the 1.3B parameter InstructGPT model are preferred to outputs from the 175B GPT-3, despite having 100x fewer parameters. Moreover, InstructGPT models show improvements in truthfulness and reductions in toxic output generation while having minimal performance regressions on public NLP datasets. Even though InstructGPT still makes simple mistakes, our results show that fine-tuning with human feedback is a promising direction for aligning language models with human intent.

Alignment Performance Evaluation User Behavior Analysis

The owasp top 10

The Open Web Application Security Project

Published: 2021

Journal of machine learning research

Exploring the limits of transfer learning with a unified text-to-text transformer

Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J Liu

Published: 2020

Official Sbert Website

Pretrained Model - Sbert.net

Nils Reimers

Published: 2021

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing

Sentence-bert: Sentence embeddings using siamese bert-networks

Nils Reimers, Iryna Gurevych

Published: 2019

The Probabilistic Relevance Framework: BM25 and Beyond

Stephen Robertson, Hugo Zaragoza

Published: 2009

Energy and policy considerations for deep learning in nlp

Emma Strubell, Ananya Ganesh, Andrew McCallum

Published: 2019

Advances in Neural Information Processing Systems 27

Sequence to sequence learning with neural networks

Ilya Sutskever, Oriol Vinyals, Quoc V. Le

Published: 2014

NIST

The Common Vulnerability Scoring System

CVSS

Published: 2022

In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP

GLUE: A multi-task benchmark and analysis platform for natural language understanding

Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel Bowman

Published: 2018

Conference on Empirical Methods in Natural Language Processing

Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Path

Xu Yan, Lili Mou, Ge Li, Yunchuan Chen, Hao Peng, Zhi Jin

Published: 2015

Cybersecurity

Tim: threat context-enhanced ttp intelligence mining on unstructured threat data

Y. You, J. Jiang, Z. Jiang, P. Yang, B. Liu, H. Feng, X. Wang, N. Li

Published: 2022

32nd Pacific Asia Conference on Language, Information and Computation

Automatic Identification of Indicators of Compromise using Neural-Based Sequence Labelling

Shengping Zhou, Zi Long, Lianzhi Tan, Hao Guo

Published: 2018

CCS ’16

FeatureSmith: Automatically Engineering Features for Malware Detection by Mining the Security Literature

Ziyun Zhu, Tudor Dumitras

Published: 2016

2018 IEEE European symposium on security and privacy (EuroS&P)

Chainsmith: Automatically learning the semantics of malicious campaigns by mining threat intelligence reports

Z. Zhu, T. Dumitras

Published: 2018

RankT5: Fine-Tuning T5 for Text Ranking with Ranking Losses

Honglei Zhuang, Zhen Qin, Rolf Jagerman, Kai Hui, Ji Ma, Jing Lu, Jianmo Ni, Xuanhui Wang, Michael Bendersky

Published: 2022