DeepSeek-R1 is an open-source large language model developed by DeepSeek, designed to excel in complex reasoning tasks across domains such as mathematics, coding, and language. DeepSeek R1 offers unrestricted access for both commercial and academic use. The model employs a Mixture of Experts (MoE) architecture, comprising 671 billion total parameters with 37 billion active parameters per token, and supports a context length of up to 128,000 tokens. DeepSeek-R1's training regimen uniquely integrates large-scale reinforcement learning (RL) without relying on supervised fine-tuning, enabling the model to develop advanced reasoning capabilities. This approach has resulted in performance comparable to leading models like OpenAI's o1, while maintaining cost-efficiency. To further support the research community, DeepSeek has released distilled versions of the model based on architectures such as LLaMA and Qwen.

This is a full repo snapshot ZIP file of the DeepSeek R1 code.

Features

  • Mixture of Experts (MoE) Architecture – Features 671 billion total parameters, with 37 billion active parameters per token, optimizing efficiency and performance.
  • 128K Context Length – Supports an extended context window of up to 128,000 tokens, enabling better comprehension of long-form content.
  • Reinforcement Learning Training – Utilizes large-scale reinforcement learning (RL) instead of supervised fine-tuning, enhancing reasoning capabilities.
  • High Performance – Achieves results comparable to leading models like OpenAI’s GPT-4-turbo, while being more cost-efficient.
  • Open-Source & Commercial Use – Released under the MIT License, allowing unrestricted access for both academic and enterprise applications.
  • Multimodal & Coding Capabilities – Excels in mathematics, coding, and logical reasoning, making it suitable for diverse AI tasks.
  • Distilled Versions Available – Includes optimized versions based on architectures like LLaMA and Qwen, delivering high efficiency.
  • Cloud & Local Deployment – Available via Azure AI Foundry and GitHub, ensuring seamless integration into various platforms.

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow DeepSeek R1

DeepSeek R1 Web Site

Other Useful Business Software
Our Free Plans just got better! | Auth0 Icon
Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Try free now
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

User Reviews

  • Amazing open source AI model with super good reasoning abilities
Read more reviews >

Additional Project Details

Operating Systems

Linux, Android, Mac, Windows

Languages

English, Chinese (Traditional), Chinese (Simplified)

Programming Language

Python

Related Categories

Python Large Language Models (LLM), Python Reinforcement Learning Frameworks, Python Reinforcement Learning Libraries, Python Reinforcement Learning Algorithms, Python AI Models

Registered

2025-02-27