最終更新:2024-04-05 (金) 11:35:50 (307d)
GGUF
Top / GGUF
GPT-Generated Unified Format?
a binary format that is designed for fast loading and saving of models, and for ease of reading.
https://github.com/ggerganov/ggml/blob/master/docs/gguf.md
量子化
メモ
- K: k-quantメソッドなる新方式による量子化モデル