Tarallo: Evading Behavioral Malware Detectors in the Problem Space Authors: Gabriele Digregorio, Salvatore Maccarrone, Mario D'Onghia, Luigi Gallo, Michele Carminati, Mario Polino, Stefano Zanero | Published: 2025-06-03 API SecurityDynamic Analysis MethodBehavior Analysis Method 2025.06.03 2025.06.05 Literature Database
CyberGym: Evaluating AI Agents’ Cybersecurity Capabilities with Real-World Vulnerabilities at Scale Authors: Zhun Wang, Tianneng Shi, Jingxuan He, Matthew Cai, Jialin Zhang, Dawn Song | Published: 2025-06-03 Prompt InjectionDynamic Analysis MethodWatermark Evaluation 2025.06.03 2025.06.05 Literature Database
Attention Knows Whom to Trust: Attention-based Trust Management for LLM Multi-Agent Systems Authors: Pengfei He, Zhenwei Dai, Xianfeng Tang, Yue Xing, Hui Liu, Jingying Zeng, Qiankun Peng, Shrivats Agrawal, Samarth Varshney, Suhang Wang, Jiliang Tang, Qi He | Published: 2025-06-03 Indirect Prompt InjectionModel DoSEthical Considerations 2025.06.03 2025.06.05 Literature Database
BitBypass: A New Direction in Jailbreaking Aligned Large Language Models with Bitstream Camouflage Authors: Kalyan Nakka, Nitesh Saxena | Published: 2025-06-03 Disabling Safety Mechanisms of LLMDetection Rate of Phishing AttacksPrompt Injection 2025.06.03 2025.06.05 Literature Database
A Review of Various Datasets for Machine Learning Algorithm-Based Intrusion Detection System: Advances and Challenges Authors: Sudhanshu Sekhar Tripathy, Bichitrananda Behera | Published: 2025-06-03 Trigger DetectionIntrusion Detection SystemAnalysis of Detection Methods 2025.06.03 2025.06.05 Literature Database
MISLEADER: Defending against Model Extraction with Ensembles of Distilled Models Authors: Xueqi Cheng, Minxing Zheng, Shixiang Zhu, Yushun Dong | Published: 2025-06-03 Model Extraction AttackDetection of Model Extraction AttacksDefense Method 2025.06.03 2025.06.05 Literature Database
IF-GUIDE: Influence Function-Guided Detoxification of LLMs Authors: Zachary Coalson, Juhan Bae, Nicholas Carlini, Sanghyun Hong | Published: 2025-06-02 | Updated: 2025-06-09 Text DetoxificationEthical Statement影響関数 2025.06.02 2025.06.11 Literature Database
On the Stability of Graph Convolutional Neural Networks: A Probabilistic Perspective Authors: Ning Zhang, Henry Kenlay, Li Zhang, Mihai Cucuringu, Xiaowen Dong | Published: 2025-06-01 | Updated: 2025-06-03 Dynamic Graph ProcessingAdversarial LearningOptimization Problem 2025.06.01 2025.06.05 Literature Database
A Systematic Review of Metaheuristics-Based and Machine Learning-Driven Intrusion Detection Systems in IoT Authors: Mohammad Shamim Ahsan, Salekul Islam, Swakkhar Shatabda | Published: 2025-05-31 | Updated: 2025-06-03 Prompt InjectionIntrusion Detection SystemSelection and Evaluation of Optimization Algorithms 2025.05.31 2025.06.05 Literature Database
A Red Teaming Roadmap Towards System-Level Safety Authors: Zifan Wang, Christina Q. Knight, Jeremy Kritz, Willow E. Primack, Julian Michael | Published: 2025-05-30 | Updated: 2025-06-09 Model DoSLarge Language Model製品安全性 2025.05.30 2025.06.11 Literature Database