Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios

TOP 文献データベース Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios

arxiv

AIセキュリティポータルbot

文献データベースの情報は、自動的に収集されています。

Source

https://arxiv.org/abs/2509.04403

PDF

https://arxiv.org/pdf/2509.04403

文献情報

作者: Jingen Qu,Lijun Li,Bo Zhang,Yichen Yan,Jing Shao
公開日: 2025-9-5
所属機関: Tongji University
所属の国: China
会議名

AIにより推定されたラベル

プロンプトインジェクションリスク分析手法安全性評価手法

※ こちらのラベルはAIによって自動的に追加されました。そのため、正確でないことがあります。
詳細は文献データベースについてをご覧ください。

Abstract

Multimodal large language models (MLLMs) are rapidly evolving, presenting increasingly complex safety challenges. However, current dataset construction methods, which are risk-oriented, fail to cover the growing complexity of real-world multimodal safety scenarios (RMS). And due to the lack of a unified evaluation metric, their overall effectiveness remains unproven. This paper introduces a novel image-oriented self-adaptive dataset construction method for RMS, which starts with images and end constructing paired text and guidance responses. Using the image-oriented method, we automatically generate an RMS dataset comprising 35k image-text pairs with guidance responses. Additionally, we introduce a standardized safety dataset evaluation metric: fine-tuning a safety judge model and evaluating its capabilities on other safety datasets.Extensive experiments on various tasks demonstrate the effectiveness of the proposed image-oriented pipeline. The results confirm the scalability and effectiveness of the image-oriented approach, offering a new perspective for the construction of real-world multimodal safety datasets.