Update README.md

2025-06-14 13:43:47 -04:00 · 2025-01-28 23:59:12 +05:30 · 2025-01-28 23:59:12 +05:30 · dbc3a01195
commit dbc3a01195
parent 1108785f81
1 changed files with 7 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -53,6 +53,13 @@ we introduce DeepSeek-R1, which incorporates cold-start data before RL.
 DeepSeek-R1 achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks. 
 To support the research community, we have open-sourced DeepSeek-R1-Zero, DeepSeek-R1, and six dense models distilled from DeepSeek-R1 based on Llama and Qwen. DeepSeek-R1-Distill-Qwen-32B outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.
 **Key Features**
 - State-of-the-art performance in reasoning tasks
 - Open-source availability of both main models
 - Six dense distilled models based on Llama and Qwen architectures
 - 32,768 token context length support
 - Comprehensive benchmark results across multiple domains
 **NOTE: Before running DeepSeek-R1 series models locally, we kindly recommend reviewing the [Usage Recommendation](#usage-recommendations) section.**