AI 模型对比
按供应商、类型、上下文大小、基准分数和参考价格对比模型家族。
Reference data snapshot: 2026-05-29
这些数值来自工具数据集的参考估算。做生产决策前,请以 Crazyrouter 主站价格页的实时模型价格为准。
Migrated local data from D:\crazyrouter-tools and image-nextjs planning records
对比模型能力
按供应商和模型类型筛选,再按基准或成本排序。
| 模型 | 供应商 | 类型 | 上下文 | MMLU | HumanEval | Math | 输入 / 1M | 输出 / 1M | 视觉 | 工具 |
|---|---|---|---|---|---|---|---|---|---|---|
| GPT-5.4 | OpenAI | 旗舰 | 270K | 93.5 | 95.8 | 88.0 | $2.5 | $15 | ||
| Gemini 3.1 Pro | 旗舰 | 2M | 93.2 | 96.0 | 89.5 | $1.25 | $10 | |||
| Claude Opus 4.6 | Anthropic | 旗舰 | 200K | 93.0 | 96.5 | 89.0 | $15 | $75 | ||
| Grok 4 | xAI | 推理 | 256K | 93.0 | 96.0 | 93.0 | $5 | $25 | ||
| o3 | OpenAI | 推理 | 200K | 92.3 | 95.2 | 98.6 | $10 | $40 | ||
| Claude Sonnet 4.6 | Anthropic | 旗舰 | 200K | 91.5 | 95.5 | 86.5 | $3 | $15 | ||
| DeepSeek-R1-0528 | DeepSeek | 推理 | 128K | 91.5 | 93.5 | 97.8 | $0.55 | $2.19 | ||
| Qwen3-Max | Qwen | 旗舰 | 128K | 91.0 | 94.0 | 85.0 | $0.4 | $1.6 | ||
| Gemini 2.5 Pro | 推理 | 1M | 90.8 | 94.0 | 86.5 | $1.25 | $10 | |||
| GPT-4.1 | OpenAI | 旗舰 | 1.0M | 90.2 | 93.5 | 80.0 | $3 | $12 | ||
| Llama 4 Maverick | Meta | 开放 | 256K | 89.2 | 91.5 | 80.5 | $0.5 | $1.5 | ||
| GPT-5 mini | OpenAI | 快速 | 270K | 89.0 | 92.0 | 82.0 | $0.25 | $2 | ||
| Gemini 3 Flash | 快速 | 1M | 89.0 | 92.5 | 83.0 | $0.15 | $0.6 | |||
| GPT-4o | OpenAI | 旗舰 | 128K | 88.7 | 90.2 | 76.6 | $2.5 | $10 | ||
| o4-mini | OpenAI | 推理 | 200K | 88.5 | 93.4 | 98.2 | $4 | $16 | ||
| DeepSeek-V3 0324 | DeepSeek | 旗舰 | 128K | 88.5 | 91.0 | 78.5 | $0.27 | $1.1 | ||
| GPT-4.1 mini | OpenAI | 快速 | 1.0M | 87.5 | 91.0 | 76.0 | $0.8 | $3.2 | ||
| Mistral Medium 3 | Mistral | 旗舰 | 128K | 87.0 | 91.5 | 76.0 | $2 | $6 | ||
| QwQ-32B | Qwen | 推理 | 128K | 86.5 | 90.0 | 95.0 | $0.15 | $0.6 | ||
| Claude Haiku 4.5 | Anthropic | 快速 | 200K | 86.0 | 90.0 | 75.0 | $0.8 | $4 | ||
| GPT-4o mini | OpenAI | 快速 | 128K | 82.0 | 87.0 | 70.2 | $0.15 | $0.6 |