Detailed comparison for LLMs
It's a close match between Gemini 2.0 Flash Thinking and GPT OSS 120B!
Comparison of Attack Success Rate (ASR) metrics for Google Gemini 2.0 Flash Thinking and OpenAI GPT OSS 120B across the three attack methods.
No specific highlights generated for this comparison.