deepseek-ai/DeepSeek-R1-Distill-Qwen-32B cover image
featured

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on Qwen 2.5 32B, using outputs from DeepSeek R1. It outperforms OpenAI's o1-mini across various benchmarks, achieving new state-of-the-art results for dense models. Other benchmark results include: AIME 2024: 72.6 | MATH-500: 94.3 | CodeForces Rating: 1691.

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on Qwen 2.5 32B, using outputs from DeepSeek R1. It outperforms OpenAI's o1-mini across various benchmarks, achieving new state-of-the-art results for dense models. Other benchmark results include: AIME 2024: 72.6 | MATH-500: 94.3 | CodeForces Rating: 1691.

Public
$0.12/$0.18 in/out Mtoken
fp8
131,072
JSON
ProjectLicense
demoapi

d66bcfc2f3fd52799f95943264f32ba15ca0003d

2025-01-31T20:01:33+00:00