AIセキュリティポータルbot | ページ 89 | AIセキュリティポータル

Privacy Leaks by Adversaries: Adversarial Iterations for Membership Inference Attack

Authors: Jing Xue, Zhishen Sun, Haishan Ye, Luo Luo, Xiangyu Chang, Ivor Tsang, Guang Dai | Published: 2025-06-03

プライバシー分析

敵対的サンプル

透かし評価

2025.06.03

文献データベース

Tarallo: Evading Behavioral Malware Detectors in the Problem Space

Authors: Gabriele Digregorio, Salvatore Maccarrone, Mario D'Onghia, Luigi Gallo, Michele Carminati, Mario Polino, Stefano Zanero | Published: 2025-06-03

APIセキュリティ

動的分析手法

行動解析手法

2025.06.03

文献データベース

CyberGym: Evaluating AI Agents’ Cybersecurity Capabilities with Real-World Vulnerabilities at Scale

Authors: Zhun Wang, Tianneng Shi, Jingxuan He, Matthew Cai, Jialin Zhang, Dawn Song | Published: 2025-06-03

プロンプトインジェクション

動的分析手法

透かし評価

2025.06.03

文献データベース

Attention Knows Whom to Trust: Attention-based Trust Management for LLM Multi-Agent Systems

Authors: Pengfei He, Zhenwei Dai, Xianfeng Tang, Yue Xing, Hui Liu, Jingying Zeng, Qiankun Peng, Shrivats Agrawal, Samarth Varshney, Suhang Wang, Jiliang Tang, Qi He | Published: 2025-06-03

インダイレクトプロンプトインジェクション

モデルDoS

倫理的考慮

2025.06.03

文献データベース

BitBypass: A New Direction in Jailbreaking Aligned Large Language Models with Bitstream Camouflage

Authors: Kalyan Nakka, Nitesh Saxena | Published: 2025-06-03

LLMの安全機構の解除

フィッシング攻撃の検出率

プロンプトインジェクション

2025.06.03

文献データベース

A Review of Various Datasets for Machine Learning Algorithm-Based Intrusion Detection System: Advances and Challenges

Authors: Sudhanshu Sekhar Tripathy, Bichitrananda Behera | Published: 2025-06-03

トリガーの検知

侵入検知システム

検出手法の分析

2025.06.03

文献データベース

MISLEADER: Defending against Model Extraction with Ensembles of Distilled Models

Authors: Xueqi Cheng, Minxing Zheng, Shixiang Zhu, Yushun Dong | Published: 2025-06-03

モデル抽出攻撃

モデル抽出攻撃の検知

防御手法

2025.06.03

文献データベース

IF-GUIDE: Influence Function-Guided Detoxification of LLMs

Authors: Zachary Coalson, Juhan Bae, Nicholas Carlini, Sanghyun Hong | Published: 2025-06-02 | Updated: 2025-06-09

テキストデトキシフィケーション

倫理声明

影響関数

2025.06.02

文献データベース

SALAD: Systematic Assessment of Machine Unlearning on LLM-Aided Hardware Design

Authors: Zeng Wang, Minghao Shao, Rupesh Karn, Likhitha Mankali, Jitendra Bhandari, Ramesh Karri, Ozgur Sinanoglu, Muhammad Shafique, Johann Knechtel | Published: 2025-06-02 | Updated: 2025-08-05

データ駆動型脆弱性評価

プロンプトリーキング

透かし

2025.06.02

文献データベース

On the Stability of Graph Convolutional Neural Networks: A Probabilistic Perspective

Authors: Ning Zhang, Henry Kenlay, Li Zhang, Mihai Cucuringu, Xiaowen Dong | Published: 2025-06-01 | Updated: 2025-06-03

動的グラフ処理

敵対的学習

最適化問題

2025.06.01

文献データベース