文献データベース

Exploring User Privacy Awareness on GitHub: An Empirical Study

Authors: Costanza Alfieri, Juri Di Rocco, Paola Inverardi, Phuong T. Nguyen | Published: 2024-09-06 | Updated: 2024-09-10
プライバシー保護
プロンプトインジェクション
ユーザー活動解析

Understanding Data Importance in Machine Learning Attacks: Does Valuable Data Pose Greater Harm?

Authors: Rui Wen, Michael Backes, Yang Zhang | Published: 2024-09-05
バックドア攻撃
プライバシー保護手法
メンバーシップ推論

Robust Q-Learning under Corrupted Rewards

Authors: Sreejeet Maity, Aritra Mitra | Published: 2024-09-05
アルゴリズム
収束保証
強化学習

Towards Autonomous Cybersecurity: An Intelligent AutoML Framework for Autonomous Intrusion Detection

Authors: Li Yang, Abdallah Shami | Published: 2024-09-05
フレームワーク
モデル性能評価
自動化された侵入検知システム

Well, that escalated quickly: The Single-Turn Crescendo Attack (STCA)

Authors: Alan Aqrawi, Arian Abbasi | Published: 2024-09-04 | Updated: 2024-09-10
LLMセキュリティ
コンテンツモデレーション
攻撃手法

Obsidian: Cooperative State-Space Exploration for Performant Inference on Secure ML Accelerators

Authors: Sarbartha Banerjee, Shijia Wei, Prakash Ramrakhyani, Mohit Tiwari | Published: 2024-09-04
エネルギー管理
メモリ管理手法
最適化問題

Boosting Certified Robustness for Time Series Classification with Efficient Self-Ensemble

Authors: Chang Dong, Zhengyang Li, Liangwei Zheng, Weitong Chen, Wei Emma Zhang | Published: 2024-09-04 | Updated: 2024-09-19
敵対的サンプル
評価手法
透かし評価

“Yes, My LoRD.” Guiding Language Model Extraction with Locality Reinforced Distillation

Authors: Zi Liang, Qingqing Ye, Yanyun Wang, Sen Zhang, Yaxin Xiao, Ronghua Li, Jianliang Xu, Haibo Hu | Published: 2024-09-04 | Updated: 2025-05-19
LLMセキュリティ
モデル抽出攻撃
透かし技術

AdvSecureNet: A Python Toolkit for Adversarial Machine Learning

Authors: Melih Catal, Manuel Günther | Published: 2024-09-04
ツールキット比較
敵対的訓練
評価手法

Adversarial Attacks on Machine Learning-Aided Visualizations

Authors: Takanori Fujiwara, Kostiantyn Kucher, Junpeng Wang, Rafael M. Martins, Andreas Kerren, Anders Ynnerman | Published: 2024-09-04 | Updated: 2024-09-24
バックドア攻撃
敵対的サンプル
視覚化の脆弱性