AIセキュリティポータルbot | ページ 2 | AIセキュリティポータル

PAL*M: Property Attestation for Large Generative Models

Authors: Prach Chantasantitam, Adam Ilyas Caulfield, Vasisht Duddu, Lachlan J. Gunn, N. Asokan | Published: 2026-01-22

RAG

セキュリティ保証

フレームワーク

2026.01.22

文献データベース

On damage of interpolation to adversarial robustness in regression

Authors: Jingfu Peng, Yuhong Yang | Published: 2026-01-22

バックドアモデルの検知

ロバスト性評価

敵対的学習

2026.01.22

文献データベース

CAFE-GB: Scalable and Stable Feature Selection for Malware Detection via Chunk-wise Aggregated Gradient Boosting

Authors: Ajvad Haneef K, Karan Kuwar Singh, Madhu Kumar S D | Published: 2026-01-22

機械学習アルゴリズム

特徴選択手法

解釈可能性

2026.01.22

文献データベース

Connect the Dots: Knowledge Graph-Guided Crawler Attack on Retrieval-Augmented Generation Systems

Authors: Mengyu Yao, Ziqi Zhang, Ning Luo, Shaofei Li, Yifeng Cai, Xiangqun Chen, Yao Guo, Ding Li | Published: 2026-01-22

RAGへのポイズニング攻撃

ロバスト性評価

知識グラフ設計

2026.01.22

文献データベース

Predictive Coding and Information Bottleneck for Hallucination Detection in Large Language Models

Authors: Manish Bhatt | Published: 2026-01-22

ハルシネーションの検知

フレームワーク

解釈可能性

2026.01.22

文献データベース

Data-Free Privacy-Preserving for LLMs via Model Inversion and Selective Unlearning

Authors: Xinjie Zhou, Zhihui Yang, Lechao Cheng, Sai Wu, Gang Chen | Published: 2026-01-22

LLM活用

プライバシー保護

差分プライバシー

2026.01.22

文献データベース

Lightweight LLMs for Network Attack Detection in IoT Networks

Authors: Piyumi Bhagya Sudasinghe, Kushan Sudheera Kalupahana Liyanage, Harsha S. Gardiyawasam Pussewalage | Published: 2026-01-21

IoTセキュリティリスク

LLM活用

RAGへのポイズニング攻撃

2026.01.21

文献データベース

NeuroFilter: Privacy Guardrails for Conversational LLM Agents

Authors: Saswat Das, Ferdinando Fioretto | Published: 2026-01-21

プライバシー保護

プロンプトインジェクション

マルチターン攻撃分析

2026.01.21

文献データベース

An LLM Agent-based Framework for Whaling Countermeasures

Authors: Daisuke Miyamoto, Takuji Iimura, Narushige Michishita | Published: 2026-01-21

インダイレクトプロンプトインジェクション

メールセキュリティ

リスクシナリオ生成

2026.01.21

文献データベース

Constructing Multi-label Hierarchical Classification Models for MITRE ATT&CK Text Tagging

Authors: Andrew Crossman, Jonah Dodd, Viralam Ramamurthy Chaithanya Kumar, Riyaz Mohammed, Andrew R. Plummer, Chandra Sekharudu, Deepak Warrier, Mohammad Yekrangian | Published: 2026-01-21

脅威アクター支援

透かし技術

階層的分類手法

2026.01.21

文献データベース