最終更新:2025-01-22 (水) 11:35:40 (17d)
DeepSeek-R1-Distill
Top / DeepSeek-R1-Distill
- DeepSeek-R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. We slightly change their configs and tokenizers. Please use our setting to run these models.
Model BF16 Base Model DeepSeek-R1-Distill-Qwen-1.5B 3.55GB Qwen2.5-Math-1.5B DeepSeek-R1-Distill-Qwen-7B? 15.23GB Qwen2.5-Math-7B? DeepSeek-R1-Distill-Llama-8B 16.06GB Llama-3.1-8B? DeepSeek-R1-Distill-Qwen-14B? 29.54GB Qwen2.5-14B DeepSeek-R1-Distill-Qwen-32B 65.56GB Qwen2.5-32B? DeepSeek-R1-Distill-Llama-70B 136.87GB Llama-3.3-70B-Instruct?
Ollama/モデル
https://ollama.com/library/deepseek-r1
Model Parameters Size Download Qwen DeepSeek R1 1.5B 1.1GB ollama run deepseek-r1:1.5b Q4_K_M Qwen DeepSeek R1 (デフォルト) 7B 4.7GB ollama run deepseek-r1:7b Q4_K_M Llama DeepSeek R1 8B 4.9GB ollama run deepseek-r1:8b Q4_K_M Qwen DeepSeek R1 14B 9.0GB ollama run deepseek-r1:14b Q4_K_M Qwen DeepSeek R1 32B 20GB ollama run deepseek-r1:32b Q4_K_M Llama DeepSeek R1 70B 43GB ollama run deepseek-r1:70b Q4_K_M