Literature Database

LoRA and Privacy: When Random Projections Help (and When They Don’t)

Authors: Yaxi Hu, Johanna Düngler, Bernhard Schölkopf, Amartya Sanyal | Published: 2026-01-29

Privacy protection framework

Membership Inference

Differential Privacy

2026.01.29 2026.01.31

Literature Database

FIT: Defying Catastrophic Forgetting in Continual LLM Unlearning

Authors: Xiaoyu Xu, Minxin Du, Kun Fang, Zi Liang, Yaxin Xiao, Zhicong Huang, Cheng Hong, Qingqing Ye, Haibo Hu | Published: 2026-01-29

Robustness

Machine Unlearning

evaluation metrics

2026.01.29 2026.01.31

Literature Database

ICL-EVADER: Zero-Query Black-Box Evasion Attacks on In-Context Learning and Their Defenses

Authors: Ningyuan He, Ronghong Huang, Qianqian Tang, Hongyu Wang, Xianghang Mi, Shanqing Guo | Published: 2026-01-29

データ毒性攻撃

Prompt leaking

Model Extraction Attack

2026.01.29 2026.01.31

Literature Database

Towards Zero Rotation and Beyond: Architecting Neural Networks for Fast Secure Inference with Homomorphic Encryption

Authors: Yifei Cai, Yizhou Feng, Qiao Zhang, Chunsheng Xin, Hongyi Wu | Published: 2026-01-29

Algorithm Design

Trigger Detection

Encryption Technology

2026.01.29 2026.01.31

Literature Database

User-Centric Phishing Detection: A RAG and LLM-Based Approach

Authors: Abrar Hamed Al Barwani, Abdelaziz Amara Korba, Raja Waseem Anwar | Published: 2026-01-29

LLM Performance Evaluation

Poisoning attack on RAG

ユーザー中心のフィッシング検出

2026.01.29 2026.01.31

Literature Database

Adaptive and Robust Cost-Aware Proof of Quality for Decentralized LLM Inference Networks

Authors: Arther Tian, Alex Ding, Frank Chen, Simon Wu, Aaron Chan | Published: 2026-01-29

Identification of AI Output

Incentive Mechanism

Adversarial Learning

2026.01.29 2026.01.31

Literature Database

IoT Device Identification with Machine Learning: Common Pitfalls and Best Practices

Authors: Kahraman Kostas, Rabia Yasa Kostas | Published: 2026-01-28

IoT Device Identification

Data Protection Method

Machine Learning Technology

2026.01.28 2026.01.30

Literature Database

Eliciting Least-to-Most Reasoning for Phishing URL Detection

Authors: Holly Trikilis, Pasindu Marasinghe, Fariza Rashid, Suranga Seneviratne | Published: 2026-01-28

LLM Performance Evaluation

Prompt Injection

Prompt leaking

2026.01.28 2026.01.30

Literature Database

GAVEL: Towards rule-based safety through activation monitoring

Authors: Shir Rozenfeld, Rahul Pankajakshan, Itay Zloczower, Eyal Lenga, Gilad Gressel, Yisroel Mirsky | Published: 2026-01-27

LLM Performance Evaluation

Indirect Prompt Injection

Data Generation Method

2026.01.27 2026.01.29

Literature Database

RvB: Automating AI System Hardening via Iterative Red-Blue Games

Authors: Lige Huang, Zicheng Liu, Jie Zhang, Lewen Yan, Dongrui Liu, Jing Shao | Published: 2026-01-27

Relationship of AI Systems

Adversarial Learning

Automated Vulnerability Remediation

2026.01.27 2026.01.29

Literature Database