強化学習環境

Out of the Cage: How Stochastic Parrots Win in Cyber Security Environments

Authors: Maria Rigaki, Ondřej Lukáš, Carlos A. Catania, Sebastian Garcia | Published: 2023-08-23 | Updated: 2023-08-28

LLMセキュリティ

実験的検証

強化学習環境

2023.08.23 2025.04.03

文献データベース

New intelligent defense systems to reduce the risks of Selfish Mining and Double-Spending attacks using Learning Automata

Authors: Seyed Ardalan Ghoreishi, Mohammad Reza Meybodi | Published: 2023-07-02 | Updated: 2024-03-08

アルゴリズム設計

セキュリティ保証

強化学習環境

2023.07.02 2025.04.03

文献データベース

Query Rewriting for Retrieval-Augmented Large Language Models

Authors: Xinbei Ma, Yeyun Gong, Pengcheng He, Hai Zhao, Nan Duan | Published: 2023-05-23 | Updated: 2023-10-23

RAG

強化学習環境

情報検索

2023.05.23 2025.04.03

文献データベース

RORL: Robust Offline Reinforcement Learning via Conservative Smoothing

Authors: Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han | Published: 2022-06-06 | Updated: 2022-10-22

ロバスト性

不確実性評価

強化学習環境

2022.06.06 2025.04.03

文献データベース

Deep Reinforcement Learning for Cybersecurity Threat Detection and Protection: A Review

Authors: Mohit Sewak, Sanjay K. Sahay, Hemant Rathore | Published: 2022-06-06

エンドポイント検出

ネットワーク脅威検出

強化学習環境

2022.06.06 2025.04.03

文献データベース

MDLdroid: a ChainSGD-reduce Approach to Mobile Deep Learning for Personal Mobile Sensing

Authors: Yu Zhang, Tao Gu, Xi Zhang | Published: 2020-02-07 | Updated: 2020-02-15

スケジューリング手法

トレードオフ分析

強化学習環境

2020.02.07 2025.04.03

文献データベース

Policy Poisoning in Batch Reinforcement Learning and Control

Authors: Yuzhe Ma, Xuezhou Zhang, Wen Sun, Xiaojin Zhu | Published: 2019-10-13 | Updated: 2019-10-31

強化学習環境

攻撃の評価

攻撃者や悪意のあるデバイス

2019.10.13 2025.04.03

文献データベース

Defensive Escort Teams via Multi-Agent Deep Reinforcement Learning

Authors: Arpit Garg, Yazied A. Hasan, Adam Yañez, Lydia Tapia | Published: 2019-10-09

リスク評価

実験的検証

強化学習環境

2019.10.09 2025.04.03

文献データベース