deepseek-r1-distill-qwen-7b

Public

DeepSeek R1 Distill Qwen 7B by deepseek-ai

5.8K Downloads

2 stars

Capabilities

Minimum system memory

4GB

Tags

7B
qwen2

Last updated

Updated on May 24by
lmmy's profile picture
lmmy

README

DeepSeek R1 Distill Qwen 7B by deepseek-ai

Supports context length of 128k.

Distilled from DeepSeek's R1 reasoning model.

Tuned for reasoning and chain-of-thought.

Sources

The underlying model files this model uses