Detailed comparison for LLMs
It's a close match between DeepSeek-V3.2 Non-thinking and Gemini 2.0 Flash Thinking!
Comparison of Attack Success Rate (ASR) metrics for DeepSeek DeepSeek-V3.2 Non-thinking and Google Gemini 2.0 Flash Thinking across the three attack methods.
No specific highlights generated for this comparison.