最終更新:2025-02-25 (火) 08:44:46 (59d)  

日本語LLM評価
Top / 日本語LLM評価

https://swallow-llm.github.io/evaluation/index.ja.html

https://github.com/swallow-llm/swallow-evaluation

ベンチマーク

比較したモデル

  • Aya Expanse 32B?
  • Aya Expanse 8B
  • C4AI Command-R v0.1?
  • CyberAgentLM2-7B-chat?
  • CyberAgentLM2-7B?
  • CyberAgentLM3-22B-chat?
  • ELYZA-japanese-Llama-2-13b
  • Fugaku-LLM 13B?
  • GPT-3.5 (gpt-3.5-turbo-0125)?
  • GPT-4o (gpt-4o-2024-05-13)?
  • GRIN-MoE?
  • Gemma 2 27B IT?
  • Gemma 2 2B IT?
  • Gemma 2 9B IT?
  • Gemma 2 27B?
  • Gemma 2 2B?
  • Gemma 2 9B?
  • Gemma 2 Baku 2B IT?
  • Gemma 2 Baku 2B?
  • Gemma 2 JPN
  • Japanese Stable LM Base Gamma 7B?
  • Japanese Stable LM Beta 70B?
  • Japanese Stable LM Beta 7B?
  • KARAKURI LM 70B Chat v0.1?
  • KARAKURI LM 8x7B Instruct v0.1?
  • KARAKURI LM 70B v0.1?
  • LLM-jp-13B v2.0?
  • Llama 2 13B
  • Llama 2 70B
  • Llama 2 7B
  • Llama 3 70B Instruct?
  • Llama 3 8B Instruct?
  • Llama 3 70B?
  • Llama 3 8B
  • Llama 3 Swallow 70B Instruct?
  • Llama 3 Swallow 8B Instruct?
  • Llama 3 Swallow 70B?
  • Llama 3 Swallow 8B?
  • Llama 3 Youko 70B Instruct
  • Llama 3 Youko 8B Instruct?
  • Llama 3 Youko 70B?
  • Llama 3 Youko 8B
  • Llama 3 heron brain 70B v0.3?
  • Llama 3 heron brain 8B v0.3?
  • Llama 3.1 405B Instruct?
  • Llama 3.1 70B Instruct?
  • Llama 3.1 8B Instruct?
  • Llama 3.1 70B?
  • Llama 3.1 8B?
  • Llama 3.1 Swallow 70B Instruct v0.1?
  • Llama 3.1 Swallow 8B Instruct v0.1?
  • Llama 3.1 Swallow 8B Instruct v0.2?
  • Llama 3.1 Swallow 70B Instruct v0.3
  • Llama 3.1 Swallow 8B Instruct v0.3?
  • Llama 3.1 Swallow 70B v0.1?
  • Llama 3.1 Swallow 8B v0.1?
  • Llama 3.1 Swallow 8B v0.2?
  • Llama 3.2 1B Instruct?
  • Llama 3.2 3B Instruct?
  • Llama 3.2 1B
  • Llama 3.2 3B?
  • Llama 3.3 70B Instruct?
  • Llama-3-ELYZA-JP-8B?
  • Llama-3.1-70B-Japanese-Instruct-2407
  • Mistral-7B-Instruct-v0.3?
  • Mistral-7B-v0.1?
  • Mistral-7B-v0.2?
  • Mistral-7B-v0.3?
  • Mistral-NeMo-Instruct-2407 (12B)?
  • Mistral-NeMo-Minitron 8B Instruct?
  • Mistral-NeMo-Minitron 8B?
  • Mistral-Nemo-Base-2407 (12B)?
  • Mixtral-8x7B-Instruct-v0.1?
  • Mixtral-8x22B-Instruct-v0.1?
  • Mixtral-8x7B-v0.1?
  • Mixtral-8x22B-v0.1?
  • OLMo-2-1124-13B-Instruct?
  • OLMo-2-1124-7B-Instruct?
  • OLMo-2-1124-13B?
  • OLMo-2-1124-7B?
  • Phi-3-Mini-128K-Instruct?
  • Phi-3.5-MoE Instruct?
  • Qwen1.5-7B?
  • Qwen2-72B-Instruct?
  • Qwen2-7B-Instruct?
  • Qwen2-72B?
  • Qwen2-7B?
  • Qwen2.5-72B-Instruct
  • Qwen2.5-3B-Instruct
  • Qwen2.5-7B-Instruct
  • Qwen2.5-0.5B-Instruct
  • Qwen2.5-0.5B
  • Qwen2.5-72B
  • Qwen2.5-1.5B-Instruct
  • Qwen2.5-1.5B?
  • Qwen2.5-3B?
  • Qwen2.5-7B?
  • RakutenAI-7B-chat?
  • RakutenAI-7B
  • Sarashina2-13B?
  • Sarashina2-70B
  • Sarashina2-7B?
  • Stockmark-100b?
  • Swallow 13B?
  • Swallow 70B?
  • Swallow 7B?
  • Swallow-70b-instruct-v0.1?
  • Swallow-7b-instruct-v0.1?
  • Swallow-MS 7B v0.1?
  • Swallow-MS-7b-instruct-v0.1?
  • Swallow-MX 8x7B v0.1?
  • Tanuki-8x8B-dpo-v1.0?
  • Tanuki-8B-dpo-v1.0?
  • Yi-1.5 34B?
  • Yi-1.5 6B?
  • Yi-1.5 9B?
  • Youri 7B
  • llm-jp-3-13b-instruct?
  • llm-jp-3-13b?
  • llm-jp-3-1.8b-instruct?
  • llm-jp-3-1.8b?
  • llm-jp-3-3.7b-instruct?
  • llm-jp-3-3.7b?

メモ

関連