DeepSeek R1

DeepSeek-R1 is an open-source large language model developed by DeepSeek, designed to excel in complex reasoning tasks across domains such as mathematics, coding, and language. DeepSeek R1 offers unrestricted access for both commercial and academic use. The model employs a Mixture of Experts (MoE) architecture, comprising 671 billion total parameters with 37 billion active parameters per token, and supports a context length of up to 128,000 tokens. DeepSeek-R1's training regimen uniquely integrates large-scale reinforcement learning (RL) without relying on supervised fine-tuning, enabling the model to develop advanced reasoning capabilities. This approach has resulted in performance comparable to leading models like OpenAI's o1, while maintaining cost-efficiency. To further support the research community, DeepSeek has released distilled versions of the model based on architectures such as LLaMA and Qwen.

This is a full repo snapshot ZIP file of the DeepSeek R1 code.

Features

Mixture of Experts (MoE) Architecture – Features 671 billion total parameters, with 37 billion active parameters per token, optimizing efficiency and performance.
128K Context Length – Supports an extended context window of up to 128,000 tokens, enabling better comprehension of long-form content.
Reinforcement Learning Training – Utilizes large-scale reinforcement learning (RL) instead of supervised fine-tuning, enhancing reasoning capabilities.
High Performance – Achieves results comparable to leading models like OpenAI’s GPT-4-turbo, while being more cost-efficient.
Open-Source & Commercial Use – Released under the MIT License, allowing unrestricted access for both academic and enterprise applications.
Multimodal & Coding Capabilities – Excels in mathematics, coding, and logical reasoning, making it suitable for diverse AI tasks.
Distilled Versions Available – Includes optimized versions based on architectures like LLaMA and Qwen, delivering high efficiency.
Cloud & Local Deployment – Available via Azure AI Foundry and GitHub, ensuring seamless integration into various platforms.

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow DeepSeek R1

DeepSeek R1 Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Ratings

5.0 out of 5 stars

★★★★★

★★★★

★★★

★★

★

ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

User Reviews

Filter Reviews:

All

dappervoid Posted 2025-02-27

Amazing open source AI model with super good reasoning abilities

Additional Project Details

Operating Systems

Linux, Android, Mac, Windows

Languages

English, Chinese (Traditional), Chinese (Simplified)

Programming Language

Python

Related Categories

Python Large Language Models (LLM), Python Reinforcement Learning Frameworks, Python Reinforcement Learning Libraries, Python Reinforcement Learning Algorithms, Python AI Models

Registered

2025-02-27

Similar Business Software

DeepSeek R1

DeepSeek-R1 is an advanced open-source reasoning model developed by DeepSeek, designed to rival OpenAI's Model o1. Accessible via web, app, and API, it excels in complex tasks such as mathematics and coding, demonstrating superior performance on benchmarks like the American Invitational...

See Software
QwQ-32B

QwQ-32B is an advanced reasoning model developed by Alibaba Cloud's Qwen team, designed to enhance AI's problem-solving capabilities. With 32 billion parameters, it achieves performance comparable to state-of-the-art models like DeepSeek's R1, which has 671 billion parameters. This efficiency...

See Software
Phi-4-reasoning

Phi-4-reasoning is a 14-billion parameter transformer-based language model optimized for complex reasoning tasks, including math, coding, algorithmic problem solving, and planning. Trained via supervised fine-tuning of Phi-4 on carefully curated "teachable" prompts and reasoning demonstrations...

See Software