Real-world prompt injection and jailbreak attempts
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms
Attack Goal
get model to respond as misinformation harms