Negative impact “Unethical output or actions by AI”

This page provides the security targets of negative impacts “Unethical output or actions by AI” in the external influence aspect in the AI Security Map, as well as the attacks and factors that cause them, and the corresponding defense methods and countermeasures.

Security target

Non-consumer
Consumer
Society

Attack or cause

Integrity violation
Jailbreak

References

Security target

Attack or cause

Defensive method or countermeasure

References

Jailbreak

Education and follow-up

AI alignment