LightDefense: A Lightweight Uncertainty-Driven Defense against Jailbreaks via Shifted Token Distribution Authors: Zhuoran Yang, Jie Peng, Zhen Tan, Tianlong Chen, Yanyong Zhang | Published: 2025-04-02 Prompt InjectionModel Performance EvaluationUncertainty Measurement 2025.04.02 2025.05.27 Literature Database
Identifying Obfuscated Code through Graph-Based Semantic Analysis of Binary Code Authors: Roxane Cohen, Robin David, Florian Yger, Fabrice Rossi | Published: 2025-04-02 Explainability of Graph Machine LearningModel Performance EvaluationMachine Learning Algorithm 2025.04.02 2025.05.27 Literature Database
PiCo: Jailbreaking Multimodal Large Language Models via $\textbf{Pi}$ctorial $\textbf{Co}$de Contextualization Authors: Aofan Liu, Lulu Tang, Ting Pan, Yuguo Yin, Bin Wang, Ao Yang | Published: 2025-04-02 | Updated: 2025-04-07 Model Performance EvaluationLarge Language ModelWatermark 2025.04.02 2025.05.27 Literature Database
ATOM: A Framework of Detecting Query-Based Model Extraction Attacks for Graph Neural Networks Authors: Zhan Cheng, Bolin Shen, Tianming Sha, Yuan Gao, Shibo Li, Yushun Dong | Published: 2025-03-20 Graph Neural NetworkModel Performance EvaluationAnalysis of Detection Methods 2025.03.20 2025.05.27 Literature Database
ToxicSQL: Migrating SQL Injection Threats into Text-to-SQL Models via Backdoor Attack Authors: Meiyu Lin, Haichuan Zhang, Jiale Lao, Renyuan Li, Yuanchun Zhou, Carl Yang, Yang Cao, Mingjie Tang | Published: 2025-03-07 | Updated: 2025-04-03 Backdoor DetectionBackdoor AttackModel Performance Evaluation 2025.03.07 2025.05.27 Literature Database
SpinML: Customized Synthetic Data Generation for Private Training of Specialized ML Models Authors: Jiang Zhang, Rohan Xavier Sequeira, Konstantinos Psounis | Published: 2025-03-05 | Updated: 2025-04-07 Privacy ProtectionModel Performance EvaluationDifferential Privacy 2025.03.05 2025.05.27 Literature Database
Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems Authors: Song Xia, Yi Yu, Wenhan Yang, Meiwen Ding, Zhuo Chen, Ling-Yu Duan, Alex C. Kot, Xudong Jiang | Published: 2025-03-01 | Updated: 2025-04-03 Privacy ProtectionCertified RobustnessModel Performance Evaluation 2025.03.01 2025.05.27 Literature Database
Efficient Model Compression for Bayesian Neural Networks Authors: Diptarka Saha, Zihe Liu, Feng Liang | Published: 2024-11-01 Sparse ModelModel Performance EvaluationOptimization Problem 2024.11.01 2025.05.27 Literature Database
Automated Trustworthiness Oracle Generation for Machine Learning Text Classifiers Authors: Lam Nguyen Tung, Steven Cho, Xiaoning Du, Neelofar Neelofar, Valerio Terragni, Stefano Ruberto, Aldeida Aleti | Published: 2024-10-30 | Updated: 2025-04-23 XAI (Explainable AI)Model Performance EvaluationReliability Analysis 2024.10.30 2025.05.27 Literature Database
Diffuse or Confuse: A Diffusion Deepfake Speech Dataset Authors: Anton Firc, Kamil Malinka, Petr Hanáček | Published: 2024-10-09 Dataset GenerationModel Performance EvaluationSpeech Synthesis Technology 2024.10.09 2025.05.27 Literature Database