AIセキュリティポータルbot | ページ 73 | AIセキュリティポータル

Regularized Robustly Reliable Learners and Instance Targeted Attacks

Authors: Avrim Blum, Donya Saless | Published: 2024-10-14 | Updated: 2025-04-29

サンプル複雑性

ロバスト性評価

ロバスト最適化

2024.10.14

文献データベース

Model-based Large Language Model Customization as Service

Authors: Zhaomin Wu, Jizhou Guo, Junyi Hou, Bingsheng He, Lixin Fan, Qiang Yang | Published: 2024-10-14 | Updated: 2025-05-22

テキスト生成手法

プライバシー管理

差分プライバシー

2024.10.14

文献データベース

Unified Breakdown Analysis for Byzantine Robust Gossip

Authors: Renaud Gaucher, Aymeric Dieuleveut, Hadrien Hendrikx | Published: 2024-10-14 | Updated: 2025-02-03

フレームワーク

攻撃手法

2024.10.14 2025.04.03

文献データベース

On Calibration of LLM-based Guard Models for Reliable Content Moderation

Authors: Hongfu Liu, Hengguan Huang, Hao Wang, Xiangming Gu, Ye Wang | Published: 2024-10-14

LLM性能評価

コンテンツモデレーション

プロンプトインジェクション

2024.10.14 2025.04.03

文献データベース

Feature Averaging: An Implicit Bias of Gradient Descent Leading to Non-Robustness in Neural Networks

Authors: Binghui Li, Zhixuan Pan, Kaifeng Lyu, Jian Li | Published: 2024-10-14

収束分析

敵対的サンプル

2024.10.14 2025.04.03

文献データベース

Yuan: Research on the Concept of Digital World Analogue Scientific Infrastructure and Science Popularization Communication Based on Suzhou Gardens Pattern

Authors: Zhang Lvyang, Lu Wen, Zhao Yang, Li Jiaqi, Zhai Lidong | Published: 2024-10-14

サイバーセキュリティ

2024.10.14 2025.04.03

文献データベース

Can LLMs be Scammed? A Baseline Measurement Study

Authors: Udari Madhushani Sehwag, Kelly Patel, Francesca Mosca, Vineeth Ravi, Jessica Staddon | Published: 2024-10-14

LLM性能評価

プロンプトインジェクション

評価手法

2024.10.14 2025.04.03

文献データベース

Evaluating of Machine Unlearning: Robustness Verification Without Prior Modifications

Authors: Heng Xu, Tianqing Zhu, Wanlei Zhou | Published: 2024-10-14

損失項

最適化問題

2024.10.14 2025.04.03

文献データベース

Survival of the Safest: Towards Secure Prompt Optimization through Interleaved Multi-Objective Evolution

Authors: Ankita Sinha, Wendi Cui, Kamalika Das, Jiaxin Zhang | Published: 2024-10-12

プロンプトインジェクション

マルチオブジェクティブプロンプト最適化

2024.10.12 2025.04.03

文献データベース

Minimax rates of convergence for nonparametric regression under adversarial attacks

Authors: Jingfu Peng, Yuhong Yang | Published: 2024-10-12

敵対的サンプル

敵対的訓練

2024.10.12 2025.04.03

文献データベース