deepseek-r1-distill-qwen-14b

Public

6.9K Downloads

3 stars

Capabilities

Minimum system memory

8GB

Tags

14B
qwen2

Last updated

Updated on May 24by
lmmy's profile picture
lmmy

README

DeepSeek R1 Distill Qwen 14B by deepseek-ai

Supports context length of 128k.

Distilled from DeepSeek's R1 reasoning model.

Tuned for reasoning and chain-of-thought.

Sources

The underlying model files this model uses