Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM
            
        Authors: Alexander Panfilov, Evgenii Kortukov, Kristina Nikolić, Matthias Bethge, Sebastian Lapuschkin, Wojciech Samek, Ameya Prabhu, Maksym Andriushchenko, Jonas Geiping | Published: 2025-09-22      
                        Hallucination
武器設計手法
Fraud Techniques