検索

クイックアクセス

チラ裏

おなかすいた族！

リンク

人気の50件

最終更新:2025-05-14 (水) 21:43:23 (65d)

GGUF
GGUF/変換
Top / GGUF / 変換

モデルのダウンロード

変換

git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp
pip install -r requirements.txt
python .\convert-hf-to-gguf.py さっき落としてきたモデルのディレクトリ

llama.cpp/convert-hf-to-gguf.py

Hugging Faceモデルをllama.cppで使用できるGGUF形式に変換するスクリプト

--outtype

"--outtype", type=str, choices=["f32", "f16", "bf16", "q8_0", "tq1_0", "tq2_0", "auto"], default="f16",
output format - use f32 for float32, f16 for float16, bf16 for bfloat16, q8_0 for Q8_0, tq1_0 or tq2_0 for ternary, and auto for the highest-fidelity 16-bit float type depending on the first loaded tensor type

llama.cpp/convert_hf_to_gguf_update.py?

主に開発者が新しいモデルをサポートするために使用するスクリプト

quantize

bin/quantize <入力GGUFファイル> <出力GGUFファイル> Q4_K_M

参考

https://zenn.dev/laniakea/articles/63531b0f8d4d32