AIセキュリティポータルbot

A Systematic and Formal Study of the Impact of Local Differential Privacy on Fairness: Preliminary Results

Authors: Karima Makhlouf, Tamara Stefanovic, Heber H. Arcolezi, Catuscia Palamidessi | Published: 2024-05-23
バイアス
プライバシー保護
プライバシー保護手法

A Comprehensive Overview of Large Language Models (LLMs) for Cyber Defences: Opportunities and Directions

Authors: Mohammed Hassanin, Nour Moustafa | Published: 2024-05-23
LLMセキュリティ
サイバーセキュリティ
プロンプトインジェクション

Tighter Privacy Auditing of DP-SGD in the Hidden State Threat Model

Authors: Tudor Cebere, Aurélien Bellet, Nicolas Papernot | Published: 2024-05-23 | Updated: 2024-10-14
データプライバシー評価
プライバシー保護手法
メンバーシップ推論

Evaluation of the Programming Skills of Large Language Models

Authors: Luc Bryan Heitz, Joun Chamas, Christopher Scherb | Published: 2024-05-23
LLM性能評価
コード生成
データ収集

Federated Domain-Specific Knowledge Transfer on Large Language Models Using Synthetic Data

Authors: Haoran Li, Xinyuan Zhao, Dadi Guo, Hanlin Gu, Ziqian Zeng, Yuxing Han, Yangqiu Song, Lixin Fan, Qiang Yang | Published: 2024-05-23
Few-Shot Learning
データセット生成
プライバシー保護手法

S-Eval: Towards Automated and Comprehensive Safety Evaluation for Large Language Models

Authors: Xiaohan Yuan, Jinfeng Li, Dongxia Wang, Yuefeng Chen, Xiaofeng Mao, Longtao Huang, Jialuo Chen, Hui Xue, Xiaoxia Liu, Wenhai Wang, Kui Ren, Jingyi Wang | Published: 2024-05-23 | Updated: 2025-04-07
リスク分析手法
大規模言語モデル
安全性アライメント

Memory Scraping Attack on Xilinx FPGAs: Private Data Extraction from Terminated Processes

Authors: Bharadwaj Madabhushi, Sandip Kundu, Daniel Holcomb | Published: 2024-05-22
FPGA
ウォーターマーキング
メモリ管理手法

Towards Certification of Uncertainty Calibration under Adversarial Attacks

Authors: Cornelius Emde, Francesco Pinto, Thomas Lukasiewicz, Philip H. S. Torr, Adel Bibi | Published: 2024-05-22
評価手法
透かし評価
難易度キャリブレーション

Naturally Private Recommendations with Determinantal Point Processes

Authors: Jack Fitzsimons, Agustín Freitas Pasqualini, Robert Pisarczyk, Dmitrii Usynin | Published: 2024-05-22
ウォーターマーキング
プライバシー保護手法
透かし評価

WaterPool: A Watermark Mitigating Trade-offs among Imperceptibility, Efficacy and Robustness

Authors: Baizhou Huang, Xiaojun Wan | Published: 2024-05-22
ウォーターマーキング
透かしの耐久性
透かし評価