AIセキュリティポータルbot

SABLE: Secure And Byzantine robust LEarning

Authors: Antoine Choffrut, Rachid Guerraoui, Rafael Pinot, Renaud Sirdey, John Stephan, Martin Zuber | Published: 2023-09-11 | Updated: 2023-12-14
ウォーターマーキング
ビザンチン耐性
プライバシー保護手法

FuzzLLM: A Novel and Universal Fuzzing Framework for Proactively Discovering Jailbreak Vulnerabilities in Large Language Models

Authors: Dongyu Yao, Jianshu Zhang, Ian G. Harris, Marcel Carlsson | Published: 2023-09-11 | Updated: 2024-04-14
LLMセキュリティ
ウォーターマーキング
プロンプトインジェクション

Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis

Authors: Li Du, Yequan Wang, Xingrun Xing, Yiqun Ya, Xiang Li, Xin Jiang, Xuezhi Fang | Published: 2023-09-11
ハルシネーションの検知
人工知能の役割
生成AI向け電子透かし

Outlier Robust Adversarial Training

Authors: Shu Hu, Zhenhuan Yang, Xin Wang, Yiming Ying, Siwei Lyu | Published: 2023-09-10
収束特性
損失項
敵対的攻撃

DAD++: Improved Data-free Test Time Adversarial Defense

Authors: Gaurav Kumar Nayak, Inder Khatri, Shubham Randive, Ruchit Rawal, Anirban Chakraborty | Published: 2023-09-10
敵対的サンプル
敵対的攻撃
防御手法

Classification of Spam URLs Using Machine Learning Approaches

Authors: Omar Husni Odeh, Anas Arram, Murad Njoum | Published: 2023-09-10 | Updated: 2023-12-03
スパム検出
文献リスト
機械学習手法

Bicoptor 2.0: Addressing Challenges in Probabilistic Truncation for Enhanced Privacy-Preserving Machine Learning

Authors: Lijing Zhou, Qingrui Song, Su Zhang, Ziyu Wang, Xianggui Wang, Yong Li | Published: 2023-09-10 | Updated: 2024-03-06
MPCアルゴリズム
多者計算
通信コスト削減

Compact: Approximating Complex Activation Functions for Secure Computation

Authors: Mazharul Islam, Sunpreet S. Arora, Rahul Chatterjee, Peter Rindal, Maliheh Shirvanian | Published: 2023-09-09 | Updated: 2024-03-17
MPCアルゴリズム
多者計算
機械学習技術

Adversarially Robust Deep Learning with Optimal-Transport-Regularized Divergences

Authors: Jeremiah Birrell, Mohammadreza Ebrahimi | Published: 2023-09-07
悪意のあるデモ構築
敵対的攻撃
防御手法

Enhancing Trustworthiness in ML-Based Network Intrusion Detection with Uncertainty Quantification

Authors: Jacopo Talpini, Fabio Sartori, Marco Savi | Published: 2023-09-05 | Updated: 2024-04-09
Out-of-Distribution検出
アクティブラーニング
不確実性評価