フレームワーク

VerifBFL: Leveraging zk-SNARKs for A Verifiable Blockchained Federated Learning

Authors: Ahmed Ayoub Bellachia, Mouhamed Amine Bouchiha, Yacine Ghamri-Doudane, Mourad Rabah | Published: 2025-01-08
プライバシー保護
フレームワーク
ポイズニング

A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models

Authors: Yinpeng Cai, Lexin Li, Linjun Zhang | Published: 2025-01-05
フレームワーク
仮説検定
透かし評価

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Authors: Yanjiang Liu, Shuhen Zhou, Yaojie Lu, Huijia Zhu, Weiqiang Wang, Hongyu Lin, Ben He, Xianpei Han, Le Sun | Published: 2025-01-03
フレームワーク
プロンプトインジェクション
攻撃手法

BARTPredict: Empowering IoT Security with LLM-Driven Cyber Threat Prediction

Authors: Alaeddine Diaf, Abdelaziz Amara Korba, Nour Elislem Karabadji, Yacine Ghamri-Doudane | Published: 2025-01-03
サイバーセキュリティ
ネットワーク脅威検出
フレームワーク

Safeguarding Large Language Models in Real-time with Tunable Safety-Performance Trade-offs

Authors: Joao Fonseca, Andrew Bell, Julia Stoyanovich | Published: 2025-01-02
フレームワーク
プロンプトインジェクション
安全性アライメント

On Large Language Models in Mission-Critical IT Governance: Are We Ready Yet?

Authors: Matteo Esposito, Francesco Palagiano, Valentina Lenarduzzi, Davide Taibi | Published: 2024-12-16 | Updated: 2025-01-10
プライバシー保護
フレームワーク
リスク管理

CantorNet: A Sandbox for Testing Geometrical and Topological Complexity Measures

Authors: Michal Lewandowski, Hamid Eghbalzadeh, Bernhard A. Moser | Published: 2024-11-29 | Updated: 2025-01-28
フレームワーク

Privacy-Preserving Federated Learning with Differentially Private Hyperdimensional Computing

Authors: Fardin Jalil Piran, Zhiling Chen, Mohsen Imani, Farhad Imani | Published: 2024-11-02 | Updated: 2025-03-22
プライバシー保護
フレームワーク

CausAdv: A Causal-based Framework for Detecting Adversarial Examples

Authors: Hichem Debbi | Published: 2024-10-29
フレームワーク
敵対的サンプル

Unified Breakdown Analysis for Byzantine Robust Gossip

Authors: Renaud Gaucher, Aymeric Dieuleveut, Hadrien Hendrikx | Published: 2024-10-14 | Updated: 2025-02-03
フレームワーク
攻撃手法