最終更新:2026-02-26 (木) 14:04:38 (10d)
Qwen3.5-27B
Top / Qwen3.5-27B
- 4bitだと17 GB
Apple M1 Ultra MLX 24tok/s (LM Studio) Apple M1 Ultra Unsloth 13tok/s (LM Studio) GeForce RTX 3090 Q4_K_XL? 31tok/s (LM Studio) GeForce RTX 5090 Q4_K_XL? 60tok/s (llama-bench) Apple M2 Ultra Q4_K_XL? 18tok/s (llama-bench) https://x.com/gosrum/status/2026450569695830360

