Mistral Mistral Large 3 675B Instruct 2512 Eagle vs OpenAI GPT-4.1
Detailed comparison for LLMs
MistralOpenAI
Head-to-Head Overview
GPT-4.1 is the overall winner in this comparison!
LLM Metric
Comparison of Attack Success Rate (ASR) metrics for Mistral Mistral Large 3 675B Instruct 2512 Eagle and OpenAI GPT-4.1 across the three attack methods.
Overall (ASR)
Mistral Large 3 675B Instruct 2512 Eagle
100.000
GPT-4.1
34.900
TAP Attack Method (ASR)
Mistral Large 3 675B Instruct 2512 Eagle
100.000
GPT-4.1
57.580
Crescendo Attack Method (ASR)
Mistral Large 3 675B Instruct 2512 Eagle
100.000
GPT-4.1
44.850
Zero-Shot (ASR)
Mistral Large 3 675B Instruct 2512 Eagle
100.000
GPT-4.1
2.290
Key Highlights
OpenAI GPT-4.1 has a lower Overall (ASR).
OpenAI GPT-4.1 has a lower TAP Attack Method (ASR).
OpenAI GPT-4.1 has a lower Crescendo Attack Method (ASR).