最終更新:2025-03-08 (土) 21:08:38 (12d)
OpenThinker
Top / OpenThinker
https://github.com/open-thoughts/open-thoughts
OpenThinker-32B?
- a fine-tuned version of Qwen2.5-32B-Instruct on the OpenThoughts-114k? dataset.
OpenThinker-7B?
- a fine-tuned version of Qwen2.5-7B-Instruct on the OpenThoughts-114k? dataset dataset.
メモ
- Our first goal is to curate a reasoning dataset to train state-of-the-art small reasoning models that surpass DeepSeek-R1-Distill-Qwen-32B and DeepSeek-R1-Distill-Qwen-7B on math and code reasoning benchmarks.