Evaluating the Efficacy of Prompt-Engineered Large Multimodal Models Versus Fine-Tuned Vision Transformers in Image-Based Security Applications

Learning and individual differences

Chatgpt for good? on opportunities and challenges of large language models for education

Enkelejda Kasneci, Kathrin Seßler, Stefan Küchemann, Maria Bannert, Daryna Dementieva, Frank Fischer, Urs Gasser, Georg Groh, Stephan Günnemann, Eyke Hüller-meier, et al.

Published: 2023

Mme: A comprehensive evaluation benchmark for multimodal large language models

Chaoyou Fu, Peixian Chen, Yunhang Shen, Yulei Qin, Mengdan Zhang, Xu Lin, Jinrui Yang, Xiawu Zheng, Ke Li, Xing Sun

Published: 2023

2023 IEEE International Conference on Big Data (BigData)

Multimodal large language models: A survey

Jiayang Wu, Wensheng Gan, Zefeng Chen, Shicheng Wan, S Yu Philip

Published: 2023

Gemini: A Family of Highly Capable Multimodal Models

Google DeepMind

Published: 2024

Improved baselines with visual instruction tuning

Haotian Liu, Chunyuan Li, Yuheng Li, Yong Jae Lee

Published: 2023

Minigpt-4: Enhancing vision-language understanding with advanced large language models

Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny

Published: 2023

When vision meets reality: Exploring the clinical applicability of gpt-4 with vision

Jiawen Deng, Kiyan Heybati, Matthew Shammas-Toma

Published: 2024

Mm-vet: Evaluating large multimodal models for integrated capabilities

Weihao Yu, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Xinchao Wang, Lijuan Wang

Published: 2023

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., Houlsby, N.

Published: 2021

arxiv

被引用数 1

AAAI Conference on Artificial Intelligence (AAAI)

Hidden Trigger Backdoor Attacks

Aniruddha Saha, Akshayvarun Subramanya, Hamed Pirsiavash

Published: 2019.10.1

With the success of deep learning algorithms in various domains, studying adversarial attacks to secure deep models in real world applications has become an important research topic. Backdoor attacks are a form of adversarial attacks on deep networks where the attacker provides poisoned data to the victim to train the model with, and then activates the attack by showing a specific small trigger pattern at the test time. Most state-of-the-art backdoor attacks either provide mislabeled poisoning data that is possible to identify by visual inspection, reveal the trigger in the poisoned data, or use noise to hide the trigger. We propose a novel form of backdoor attack where poisoned data look natural with correct labels and also more importantly, the attacker hides the trigger in the poisoned data and keeps the trigger secret until the test time. We perform an extensive study on various image classification settings and show that our attack can fool the model by pasting the trigger at random locations on unseen images although the model performs well on clean data. We also show that our proposed attack cannot be easily defended using a state-of-the-art defense algorithm for backdoor attacks.

バックドア攻撃敵対的攻撃トレーニングデータ生成

IEEE Signal Processing Magazine

The mnist database of handwritten digit images for machine learning research

Li Deng

Published: 2012

2019 27th Signal Processing and Communications Applications Conference (SIU)

Utilization and comparision of convolutional neural networks in malware recognition

Ahmet Selman Bozkir, Ahmet Ogulcan Cankaya, Murat Aydos

Published: 2019

IEEE transactions on pattern analysis and machine intelligence

A survey on vision transformer

Kai Han, Yunhe Wang, Hanting Chen, Xinghao Chen, Jianyuan Guo, Zhenhua Liu, Yehui Tang, An Xiao, Chunjing Xu, Yixing Xu

Published: 2022

A battle of network structures: An empirical study of cnn, transformer, and mlp

Yucheng Zhao, Guangting Wang, Chuanxin Tang, Chong Luo, Wenjun Zeng, Zheng-Jun Zha

Published: 2021

Procedia Computer Science

Vision transformer outperforms deep convolutional neural network-based model in classifying x-ray images

Om Uparkar, Jyoti Bharti, RK Pateriya, Rajeev Kumar Gupta, Ashutosh Sharma

Published: 2023