deepseek-ai/DeepSeek-R1 cover image
featured

deepseek-ai/DeepSeek-R1

DeepSeek-R1-Zero is a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning.

DeepSeek-R1-Zero is a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning.

Public
$0.85/$2.50 in/out Mtoken
16,000
ProjectLicense
demoapi

cb48aa8cb28c160ec8d853707278e0402c9ad01a

2025-01-22T22:43:37+00:00