Dataset Generation

UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing

Authors: Yifeng He, Jiabo Huang, Yuyang Rong, Yiwen Guo, Ethan Wang, Hao Chen | Published: 2024-02-04
Code Generation
Dataset Generation
Test Prioritization

ADVENT: Attack/Anomaly Detection in VANETs

Authors: Hamideh Baharlouei, Adetokunbo Makanju, Nur Zincir-Heywood | Published: 2024-01-16
Dataset Generation
Malicious Node Detection
Federated Learning

Digger: Detecting Copyright Content Mis-usage in Large Language Model Training

Authors: Haodong Li, Gelei Deng, Yi Liu, Kailong Wang, Yuekang Li, Tianwei Zhang, Yang Liu, Guoai Xu, Guosheng Xu, Haoyu Wang | Published: 2024-01-01
LLM Performance Evaluation
Dataset Generation
Prompt Injection

Anticipated Network Surveillance — An extrapolated study to predict cyber-attacks using Machine Learning and Data Analytics

Authors: Aviral Srivastava, Dhyan Thakkar, Sharda Valiveti, Pooja Shah, Gaurang Raval | Published: 2023-12-27
Dataset Generation
Model Performance Evaluation
Literature List

An Approach to Abstract Multi-stage Cyberattack Data Generation for ML-Based IDS in Smart Grids

Authors: Ömer Sen, Philipp Malskorn, Simon Glomb, Immanuel Hacker, Martin Henze, Andreas Ulbig | Published: 2023-12-21
Cybersecurity
Dataset Generation
Network Node Configuration

Traces of Memorisation in Large Language Models for Code

Authors: Ali Al-Kaswan, Maliheh Izadi, Arie van Deursen | Published: 2023-12-18 | Updated: 2024-01-15
Dataset Generation
Data Leakage
Training Data Extraction Method

Enhancing Malware Detection by Integrating Machine Learning with Cuckoo Sandbox

Authors: Amaal F. Alshmarni, Mohammed A. Alliheedi | Published: 2023-11-07
Security Analysis
Dataset Generation
Deep Learning Method

From Chatbots to PhishBots? — Preventing Phishing scams created using ChatGPT, Google Bard and Claude

Authors: Sayak Saha Roy, Poojitha Thota, Krishna Vamsi Naragam, Shirin Nilizadeh | Published: 2023-10-29 | Updated: 2024-03-10
Dataset Generation
Detection Rate of Phishing Attacks
Prompt Injection

Large Language Models for Code Analysis: Do LLMs Really Do Their Job?

Authors: Chongzhou Fang, Ning Miao, Shaurya Srivastav, Jialin Liu, Ruoyu Zhang, Ruijie Fang, Asmita, Ryan Tsang, Najmeh Nazari, Han Wang, Houman Homayoun | Published: 2023-10-18 | Updated: 2024-03-05
Dataset Generation
Program Analysis
Prompt Injection

MalDICT: Benchmark Datasets on Malware Behaviors, Platforms, Exploitation, and Packers

Authors: Robert J. Joyce, Edward Raff, Charles Nicholas, James Holt | Published: 2023-10-18
Dataset Generation
Token Processing and Collection
Malware Classification