Detailed comparison for LLMs
Gemini 2.0 Flash Thinking is the overall winner in this comparison!
Comparison of Attack Success Rate (ASR) metrics for Google Gemini 2.0 Flash Thinking and OpenAI GPT-4.1 across the three attack methods.