文献データベース

BELLS: A Framework Towards Future Proof Benchmarks for the Evaluation of LLM Safeguards

Authors: Diego Dorn, Alexandre Variengien, Charbel-Raphaël Segerie, Vincent Corruble | Published: 2024-06-03

LLMセキュリティ

コンテンツモデレーション

プロンプトインジェクション

2024.06.03 2025.04.03

文献データベース

FedAdOb: Privacy-Preserving Federated Deep Learning with Adaptive Obfuscation

Authors: Hanlin Gu, Jiahuan Luo, Yan Kang, Yuan Yao, Gongxi Zhu, Bowen Li, Lixin Fan, Qiang Yang | Published: 2024-06-03

ウォーターマーキング

プライバシー保護手法

モデル性能評価

2024.06.03 2025.04.03

文献データベース

No Vandalism: Privacy-Preserving and Byzantine-Robust Federated Learning

Authors: Zhibo Xing, Zijian Zhang, Zi'ang Zhang, Jiamou Liu, Liehuang Zhu, Giovanni Russello | Published: 2024-06-03

ウォーターマーキング

バックドア攻撃

ポイズニング

2024.06.03 2025.04.03

文献データベース

Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients

Authors: Weijun Li, Qiongkai Xu, Mark Dras | Published: 2024-06-03 | Updated: 2024-10-04

ウォーターマーキング

データプライバシー評価

プライバシー保護手法

2024.06.03 2025.04.03

文献データベース

BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models

Authors: Jiaqi Xue, Mengxin Zheng, Yebowen Hu, Fei Liu, Xun Chen, Qian Lou | Published: 2024-06-03 | Updated: 2024-06-06

LLM性能評価

クエリの多様性

クエリ生成手法

2024.06.03 2025.04.03

文献データベース

A Synergistic Approach In Network Intrusion Detection By Neurosymbolic AI

Authors: Alice Bizzarri, Chung-En Yu, Brian Jalaian, Fabrizio Riguzzi, Nathaniel D. Bastian | Published: 2024-06-03

NSAI統合

モデルの解釈性

未知の攻撃検出

2024.06.03 2025.04.03

文献データベース

Constrained Adaptive Attack: Effective Adversarial Attack Against Deep Neural Networks for Tabular Data

Authors: Thibault Simonetto, Salah Ghamizi, Maxime Cordy | Published: 2024-06-02

CAPGDアルゴリズム

攻撃手法

敵対的訓練

2024.06.02 2025.04.03

文献データベース

Transforming Computer Security and Public Trust Through the Exploration of Fine-Tuning Large Language Models

Authors: Garrett Crumrine, Izzat Alsmadi, Jesus Guerrero, Yuvaraj Munian | Published: 2024-06-02

LLMセキュリティ

サイバーセキュリティ

倫理的ガイドライン遵守

2024.06.02 2025.04.03

文献データベース

VeriSplit: Secure and Practical Offloading of Machine Learning Inferences across IoT Devices

Authors: Han Zhang, Zifan Wang, Mihir Dhamankar, Matt Fredrikson, Yuvraj Agarwal | Published: 2024-06-02 | Updated: 2025-03-31

ウォーターマーキング

データプライバシー評価

計算効率

2024.06.02 2025.04.03

文献データベース

Exploring Vulnerabilities and Protections in Large Language Models: A Survey

Authors: Frank Weizhen Liu, Chenhui Hu | Published: 2024-06-01

LLMセキュリティ

プロンプトインジェクション

防御手法

2024.06.01 2025.04.03

文献データベース