Model Robustness

TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic Representation

Authors: Tianyu Cui, Xinjie Lin, Sijia Li, Miao Chen, Qilei Yin, Qi Li, Ke Xu | Published: 2025-04-05 | Updated: 2025-04-15
LLM Performance Evaluation
Task-Specific Tuning
Model Robustness

Robust LLM safeguarding via refusal feature adversarial training

Authors: Lei Yu, Virginie Do, Karen Hambardzumyan, Nicola Cancedda | Published: 2024-09-30 | Updated: 2025-03-20
Prompt Injection
Model Robustness
Adversarial Learning

Stealing Part of a Production Language Model

Authors: Nicholas Carlini, Daniel Paleka, Krishnamurthy Dj Dvijotham, Thomas Steinke, Jonathan Hayase, A. Feder Cooper, Katherine Lee, Matthew Jagielski, Milad Nasr, Arthur Conmy, Itay Yona, Eric Wallace, David Rolnick, Florian Tramèr | Published: 2024-03-11 | Updated: 2024-07-09
Prompt leaking
Model Robustness
Model Extraction Attack

Data Reconstruction Attacks and Defenses: A Systematic Evaluation

Authors: Sheng Liu, Zihan Wang, Yuxiao Chen, Qi Lei | Published: 2024-02-13 | Updated: 2025-03-22
Privacy Analysis
Model Robustness
Adversarial attack

Attack of the Tails: Yes, You Really Can Backdoor Federated Learning

Authors: Hongyi Wang, Kartik Sreenivasan, Shashank Rajput, Harit Vishwakarma, Saurabh Agarwal, Jy-yong Sohn, Kangwook Lee, Dimitris Papailiopoulos | Published: 2020-07-09
Poisoning
Model Robustness
Attack Method

A Fast Saddle-Point Dynamical System Approach to Robust Deep Learning

Authors: Yasaman Esfandiari, Aditya Balu, Keivan Ebrahimi, Umesh Vaidya, Nicola Elia, Soumik Sarkar | Published: 2019-10-18 | Updated: 2021-03-01
Model Robustness
Adversarial Learning
Adversarial Example

Mapper Based Classifier

Authors: Jacek Cyranka, Alexander Georges, David Meyer | Published: 2019-10-17 | Updated: 2019-10-21
Model Robustness
Deep Learning
Generative Model

Instance adaptive adversarial training: Improved accuracy tradeoffs in neural nets

Authors: Yogesh Balaji, Tom Goldstein, Judy Hoffman | Published: 2019-10-17
Model Robustness
Adversarial Learning
Adversarial Example

A New Defense Against Adversarial Images: Turning a Weakness into a Strength

Authors: Tao Yu, Shengyuan Hu, Chuan Guo, Wei-Lun Chao, Kilian Q. Weinberger | Published: 2019-10-16 | Updated: 2019-12-04
Model Robustness
Adversarial Learning
Adversarial Attack Detection

MUTE: Data-Similarity Driven Multi-hot Target Encoding for Neural Network Design

Authors: Mayoore S. Jaiswal, Bumsoo Kang, Jinho Lee, Minsik Cho | Published: 2019-10-15
Model Robustness
Deep Learning