最終更新:2025-03-15 (土) 03:13:06 (109d)  

AWQ
Top / AWQ

Activation-aware Weight Quantization for LLM Compression and Acceleration

https://github.com/mit-han-lab/llm-awq